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A Simple Method for Solving 
Three-Dimensional Inverse 
Problems of Turbomachine Flow 
and the Annular Constraint 
Condition 


in 1950, Wu [/\ suggested that the three-dimensional inverse (design) problem of 
turbomachine flow may be solved approximately by a Taylor series expansion in the 
circumferential direction based on the known flow variation over an S» stream 
surface in the midchannel of the blade passage. This idea has been realized recently. 
A new coordinate transformation has been developed. The coordinate surfaces are 
coincident with the S; stream surfaces. This transformation leads to a new method 
to calculate the aerodynamic equations of three-dimensional fiow. By the use of this 
transformation, a high-order expansion is realized to determine the shape af the 
biade surfaces from the fluid state on the S>,, stream surface directly, Compuiation 
in this manner Soon leads to the discovery that theoretically the distribution of flow 
Parameter (usually Var) on S;,, prescribed by the designer should satisfy a con- 
straint condition, which guarantees that the S, stream surfaces along the hub and 
shroud obtained Jrom circumferential extension of the S2,, surface are surfaces of 


revolution. An approximate method is suggested to meet this condition. 


Introduction 


In 1950, a general theory of three-dimensional compressible 
flow in turbomachines based on the flow on two intersecting 
families of relative stream surfaces was presented [1]. The 
basic computation 1s that of the flow variation on the blade- 
to-blade S, stream surface or hub-to-tip Sj stream surface 
(Fig. 1). Early computation methods [2-4] have been further 
developed and improved by expressing the basic equations in 
terms of nonorthogonal curvilinear coordinate system and 
corresponding nonorthogonal velocity components. 

In general a complete three-dimensional solution is to be 
obtained by an iterative procedure between the two basic 
computations of 5, and §,. In cases, where the §, stream 
surfaces may be taken as surfaces of revolution, only one S, 
surface, which is usually chosen in the middle of the flow 
channel formed two neighbouring blades, is needed. The flow 
on the S, surfaces of revolution, which are generated by the 
streamlines on the S,,,, surface, may be solved accurately by 
the field method (for example, [3~7]}, or by the approximate, 
but much quicker, mean-strcamline, series-expansion method 
(far example, early work [9, i0] and recent improvement 
[f1-13}}. The solution thus obtained is called quasi-three- 
dimensional solution. 

in cases where the assumption that S, surfaces are surfaces 
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of revolution is not good enough, it is necessary to compute 
flow on general S, surfaces and a number of general S, 
surfaces, and solution for axial compressor by finite dif- 
ference method and for centrifugal compressor by finite 
element method have been obtained according to this 
procedure. More solutions obtained by this method would 
define whether the area within quasi three-dimensiona! flow is 
accurate enough for engineering application. Full three- 
dimensional solution of turbomachine flow may also be 





Fig.1 Two isospace families In three-dimensional space 
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obtained without assuming that S, surfaces are surfaces of 
revolution namely determination by the use of Taylor series 
expansions, and the circumferential derivatives in the series 
are calculated from known values on the S,,,, surface. 

This procedure is different from the two-dimensional, 
mean-streamline, series-expansion method in two important 
respects. First, the circumference derivatives are to be 
calculated from flow variation along one coordinate line other 
than in the circumferential coordinate line but also from the 
flow variation along the fluid coordinate as well. Second, the 
successive S, surfaces in the circumferential directions cannol 
be determined simply by a mass balance on the surface of 
revolution as in the two-dimension case. This type of three- 
dimensional solution was first obtained in [19, 20]. A three- 
dimensional flow field can be obtained very quickly. 

Being an approximate method, the method may be limited 
to a range of value of flow configuration and design 
parameters. It should be very useful in analyzing the nature of 
three-dimensional flow corresponding to certain flow 
parameters and geometrical configurations, and especially, in 
a preliminary three-dimensional design blades of tur- 
bomachines. 


Determination of the Shape of S,,, and the Flow Along 
So, 


z 


The §;,, calculation is the same as that in a quasi three- 
dimensional design procedure. The hub and casing wall 
contours and the projection of the S;,,, surface are specified by 
the designer, The variation of the anguiar momentum of the 
fluid about the machine axis (V,r) along the S;,, surface and 
the circumferential thickness of the S,,,. surface are also 
specified by the designer. (The latter is to be obtained em- 
pirically from a desirable thickness distribution of the blade to 
be designed). Calculation of §,,, flow gives the shape of the 
So», surface, the streamline distribution on the 5;,, surface, 
and the variation of ail flow properties on the $3,.. 


The Partial Derives of Flow Quantities in Nonor- 
thogonal Curvilinear Coordinates 


The flow properties at a point in the flow passage can be 


Nomenclature 





basic metric tensor 

Jacobian composed of a,; 

integrating factor in the continuity equation 
for S, stream surfaces 

nonzero term om the right-hand side of the 
continuity equation for S$. stream surface 
values of stream function’s on isospace of S, 
family 

values of stream function’s on isospace of 8, 
family 

= enthalpy per unit mass of fluid 

= relative stagnation rothalpy, 
U?/2)4 W?/2 

station along x’ coordinate lines 
station along x* coordinate lines 

= distance along streamline 

= any fluid quantity 

= relative cylindrical coordinates 
angular momentum of fluid about axis of 
rotation 

relative velocity 

w = contravariant component of W 

Ww, = covariant component of W 


Qs 
li 


Deis a a ed 


D3, se = 


(h — 
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obtained from the known values on the S;,,, surface by Taylor 
series expansion as follows 


(y-n) 


5 Fm) (1) 


He) = Pn) +E — Onde (Cy + 
. ee 


In [5], the vorticities and continuity equations were ex- 
pressed as fallows 





,;_ 1 faVor) ow, 
o= ={ ax? 4 
2, Lf em _ Ber 
é ake ax! 2) 
i 3.) Oe 
: -a 1 a | 
¥ +(oW)=0 (3) 


When the flow is isentropic and irrotational, the first and 
second derivative of W', W?, W_., p with respect to ~ can be 
obtained in a manner similar to that used in [1] as follows (for 
details, see Appendix A): 











[ a 
—-: — Oy an OV yr) 
ans cose, " r--— = : 
W a3 COS 13 V a> 3 Z ae ax? 
a4 dy an o( Var) 
Va r-—— a Pec, (Nl ee as Oe 
i V/a3;C0S6 axt dep ax! 
a de 1 dy | OW, C 
Vai, ax! Var ax? r 8p 
(4) 
where 
y_ d tf wt a oe ee 
(a= a re G35 A)3) + axe (ro Wa, sind,2)) 
5 
rpviay, ( ) 





W' = physical component of W tangent to x 
é = time 
x! .x*,.x* = general nonorthogonal coordinates 
k = ratio of specific heats 
%, = angle included by the coordinate lines x’ and 


xi 


£€ = absolute vorticity, V x ¥ 
&.4,y = nonorthogonal coordinate system 

p = gas density 

yY = stream function 

© = angular velocity of blade 
Subscripts 

c = casing 

A = hub 

i = inlet 

m = mean (midchannel) 

nH = component in the direction norma! to hub or 

casing 

p = pressure surface of blade 

s = suction surface of blade 
Superscripts 


vy = number of 5, sheets in S, family 
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dp _ oe 142 a2 
5 = GE & 5, OW)? + (0) 

—2(oW' ioW? \oos6 a] +(oW,)? | 6) 
The Annular Constraini 


It is shown in {21}, when the integrability condition is 
considered in blade design, the flow variation a designer is 
free to specify is stream filament thickness, in addition to the 
variation of the thickness only one aerothermodynamic 
relation on the S;,, may be specified, which is usually the 
variation of Vr. 

New computations by equation {1) immediately show that 
the acrothermodynamic flow quantities must satisfy 4 con- 
straint condition in order to guarantee that the S,; stream 
surfaces obtained on the casing and hub wall are surfaces of 
revolution. The velocity component normal to hub or shroud 
(W,,) should be equal to 0 to satisfy the solid wall boundary 
condition. As the same as other fluid properties, the MW’, 
should be obtained from the known values on the 5;,, surface 


by Taylor series expansion 


aw’, 
W,, (Pac — ae (¢,,) + (= Py) ae (Sm) 





Thus all orders of partial derivatives of W’, on S3,, with 
respect to ¢ at the hub and casing should be equal to Q, 
satisfying the solid wall condition. As a first-order ap- 
proximation, the first partial derivative should be equai to 
zero. In nonorthogonal curvilinear coordinate this relation is 


a ae 
= (9) 
Gp Ireswy 





By adjusting the variation of V,r on S3,, in the vicinity of hub 
and casing wall, the values of d(V,r)/dx? can easily be 
modified to satisfy equations (6). This method obtained 
through our numerical caljculations is similar to the con- 
clusion reached in [22]. Table shows variations of aH /d¢g 
and Vy,r at the hub and casing boundaries on S3;,, in our 
numerical caiculation. 


Determination of the Blade Shape 


Determination of the blade shape ts the key part of this 
three-dimensional, mean-streamsurface method. The blade 
shape cannot be determined only by the mass flow relation on 
a surface of revolution as in the two-dimensional mean- 
streamline method. The following function proposed in [23] is 
used in [24] to describe the flow on the two intersecting 
families of S,; and 5, surfaces in a three-dimensional tur- 
bomachine flow passage (see Fig. 1). 


(e—om)? & Wr a Vii xX Ve2.=—W (10) 
2 (ey? =n ¥, = 000 x7 C)=D,,E, . (11 
On the S,,,, surface Ta ¥2(x) 2° = DE... ’ 
W(@mdns =O. (8S) . whereg), @),.....- DE on oho oe are values of stream 
dW? /dy before adjustment 
m/s 
K/J 0 1 z 3 4 5 6 7 
11 135.3 114.9 2.5 — 6.0 — 28.9 —42.0 -12.1 251.0 
10 Jia 81.1 4,8 —6.} — 17,3 — 13.3 — 9.6 18L.9 
2 34.8 32.4 199.2 163.9 145.4 Liza 30.5 —283.8 
1 — 39.6 15.0 216.5 173.6 170.5 127.9 31.3 ~—252,7 
d+ /dp after adjustment ai 
K/J 0 [ Z 3 4 5 6 Fi 
Il 0.03 0.04. 0.05 0.01 — 0.04 — 0.03 0.06 0.08 
10 336 41.2 —21.0 — 15.0 — [6.6 — 10.0 — 8.0 0.7 
2 26.8 —2.8 66.1 63.7 72.4 69.7 29.2 a 
1 —I.1 —J.2 —3.5 —2.3 0.2 2.4 1.4 ~0.1 
¥,, before adjustment site 
K/J 0 1 2 3 4 5 6 ar 
11 Ce, a1 26.9 16.6 8.0 2.1 —0.6 G.0 
10 37.6 35.6 25.8 16.0 7.9 2.0 —0.6 0.0 
fs 35.0 35.9 22.8 13.9 6.9 1.3 —{,9 0.G 
i 35.0 36.6 22.3 13.3 6.4 0.8 —(.7 6.0 
V,, after adjustment az 
K/5 D) 1 Z a 4 5 6 7 
I] 38.0 36,2 26,7 16.6 8.3 2,4 —0.6 0.0 
EO 37.6 35.6 25.8 16.0 re 2.3 —0.7 0.0 
2 35.0 45.9 22.8 13.9 6.9 1.3 ~G.5 0.0 
1 35.6 36.1 aout 14.6 8.) i 0.2 0.2 
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Fig.2 Family of So surface and intersection with inlet plane Z; 


functions on these stream surfaces. In the three-dimensiona! 
space, the distribution of these streamsurfaces is restricted by 
the boundaries of the flow passage as follows: 


i Hub and casing walls are two stream surfaces of ihe $, 
family, 

2 Pressure and suction surfaces of the blade are two stream 
surfaces of the $, family. 


From the known S,,, surface a family of S. surfaces wilk be 
formed {see next section}. In this family, two surfaces are the 
pressure surface and suction surface of two neighbouring 
biades, respectively, and S;,, divides the total mass flow 
through the channel into suitable proportions. 


Forming S, Family by Progressing Circumferentially 
from S;,, 


Similar to the two-dimensional mean-streamline method, 
the » coordinate of S, surface can be obtained by the use of a 
serics expansion trom S,,, as follows 


ap 
Ap=e— m — Ya by n( =~ | 
p=yp—~,, =(Y.— py) ap? m 


+ 5s ~ Yan) ( 2) Fiesee. At2) 


3}} 
where ¢ ts the circumferential coordinate of any S; surface 
and y. 1s the valuc of stream function of that S. surface. 
The integrating factor 8 of [i] is now expressed as follows 
(see Fig. 2) 





B/B,= Ae/ Ay; (13) 
DnB din& dinB 

= yr _—— 14 

Dp ag 7" ax! 4) 


The inlet plane Z; is usually sufficiently far upstream of the 
blade row, where the flow can be considered to be uniform. 
The intersection of the family of 5S, surface with this Z,-plane 
is a family of curves. The variation of the stream function of 
two adjacent S, surfaces is the mass flow passing through the 
small area between the two adjacent curves (shaded area in 
Fig. 2). Then the variation of ¥, can be determined in the inlet 
plane Z, as foliows 


Ay. =(e WAA); =[p Wr? -— r2)Agl, (15) 
From equations (8) and (10) 
B/B,; = Ag/ Ag; = Ag/ Ay, (16) 


When Ay. approaches © as a limit, B/B; becomes the first 
derivative of ¢ with respect to ¥. Using first-order ex- 
pansion, the ¥ of S, surface can be determined siep by step as 
follows 
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Fig.3 Family of So surface formed progressiveiy from Sz) 
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Zz 
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Fig.4 Transformation of coordinates 


Oy 
OY) 
When the y-coordinate of (v + 1)th 8, surface is obtained, the 


fluid properties W', W*, and p on the S\'""! surface can be 
determined by power series from S:,, 


demo") — ots van SF) + (di! vNB/BD" U7 


| aw! 
Ww! (p)= Ww! {2 ,7) +{yp- Cad be (Pr) 


(g— ny a7 wy! 
2 Xr 


etc. Subsequently, equation (14) gives (B/B;)'""'| and the 
position of the next S, surface can be determined. This 
method may be called streamsurface extension method. A 
more accurate method is introduced in the next section. 

Coordinate Transformation and Direct Expansion 


Method 


In computation of the flow along the S;,, stream, the 
meridional plane is usuaily taken as the coordinate surface. 
The coordinate system is (x', x?, ¢)}. The coordinate 


transformation is as follows 
£=x(r,z) 
9 = X(7,Z) 
Yo = v2.9.2) 


The surfaces of revolution obtained by rotating around z- 
axis with the x' and x? curves, and the S;,, stream surfaces are 
the new coordinate surfaces. The following relations may be 
obtained directly 





(Pr) (18) 


(19) 


dg dg dg ag 
ax’ GE? On “ 
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-§ ~4 -2 Q@ 2 4 
Fig.5 Meridional projection of So,, 


an (Bt) ah GE) ae () 
dv, \axt/ ~~ ax! \ay,/ ab, \ax? 


_ 9 ff og 
~ oye a) (21) 
Wi= WW? = Wl wills (22) 


where a bold partial derivative sign is used ta denote the 
partial derivative of a quantity following the motion along 
S>,, with respect to x! or x* on the meridional plane. The w', 
w?, w', w" are used to represent the contravariant com- 
ponents of W in (x!, x*, ~) or (£, 9, y2) system, respectively. 
The derivatives of W', W?, W,,, and y, with respect to ¥2 can 
be obtained, The details are shown in Appendix B. Dift- 
ferentiating the relation (14) with respect to %, and combining 
with relations (21) determines the first derivative of the 
angular thickness 2 with respect to 1, 


ad —-l¢ 1 d _ acy! 
D (mB) NLT 8 (5 sntgr MO) 








Di OY, pP Wa dx! Jv2 
2 /-—. . oh~) dow!) anB 1 
+ axe (Veusinéar ay | a, we a 
a(aW*) ainB 1 
ay, ax? ae} m7) 


The ¢ coordinate of any 5, surface, especially those of the 
suction Or pressure surface of the two neighboring blades, can 
be obtained directly from S>,, by 4 power series expansion 


l aB 
PF Py, + (v2 _ V2 IB m, T, 9 (v2 — Yam)” ( By. ), 


+E 1—Vonl* (FF), 


The fluid properties W?, W?, and p on the suction or pressure 
surface can then be obtained by Taylor series 


on 
W* (Yas) = Wham) + bas — bam) a (Yon) 


(24) 


i a? Ww! 
7 > (Ys ~ Vom)? a PD (Yon) 


; 25 
ENP (25) 
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Fig.6 Distribution of ¥ qr in front of the stator 


tes, 





0 2 A & 3 f. 
Fig.? Oistribution of thickness 


The distribution of S, stream function y¥; in the flow passage 
can also be obtained by Taylor series, determines the shape of 
S, surfaces. This direct expansion method is more accurate 
than the streamsurface extension method described in the 
preceding section. 


Hlustrative Example 


FORTRAN IV programs based on the two methods 
presented herein have been coded to study the three- 
dimensional flow in axial turbomachines. The blade shape, 
the distribution of $, and S, surfaces, and the fluid properties 
in the flow passage can be obtained in less than a minute on 4 
UNIVAC-1100 computer. 

Example 1. The stator of a single-stage CAS research 
compressor reported in [16]. The mass flow is 61 kg/s, the 
rotor tip M is 1.4, the stage total-pressure ratio is 1.5, The 
hub-tip ratio at stator inlet is 0.49, the number of stator 
blades is 37. The projection of S,,, and its streamline 
distribution on the meridional plane is shown in Fig. 5. The 
specified radial variation of grin front of the stator is shown 
in Fig. 6. Figure 7 shows the blade thickness distribution 
specified. The blade shape and distribution of velocity on the 
k=:7 coordinate surface obtained by the two methods are 
shown in Fig. 8. The difference between the two is small. A 
comparison of Mach number distribution with that obtained 
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in the full three-dimensiona! solution [17] ts shown in Fig. 9. 
Except near the leading edge, the result obtained from the 
present method is close to that of the three-dimensional 
solution. The relative twist of §, surfaces at the suction 
surface (J = 4) is shown in Fig. 10. Rm ts the value of X on the 
S.,, surface of the same S$, surface. (2 is the local value to the 
point J=4, k=1,... 11.) It is seen that largest AR/R occurs 
a short distance from the hub wall and the maximum dif- 
ference between the present solution and the full three- 
dimensional solution also occurs there. It may be noted that 
the maximum relative twist is rather smali, being [css than 1 
percent. 

Example 2. A turbine rotor. The mass flow is 2.2 kg/s, the 
expansion ralio is 1.4, the rotor speed is 16,000 rpm, the 
number of blades is 60, and the hub-tip ratio is 0.66. The 
meridional projection of S,,, and its distribution of streamline 
are shown in Fig. 11. The blade shape and the circumferential 
variation of velocity on k=3 coordinate surface obtained in 
ihe direct expansion method are shown in Fig. 12. The in- 
iersections of the /=3, 4, 5 coordinate surfaces with the S, 
surface and S, surface are shown in Fig. 13. The direction of 
twist of the $, surface is basically consistent with the con- 
clusion reached in [22]; i.¢., under most conditions, the $, 
surfaces twist upward at the suction surface in the turbine 
Stator and inward in the turbine rotor. As this turbine is 
designed for radially nonuniform work output so that the 
twist involved is relatively large, but is still under 2 percent. 


Concluding Remarks 
1 In prescribing a desired flow variation on the midchannel 
hub to tip S;,, streamsurface in three-dimensional blade 


300 W m/sec 
250 -—— Stream Surface Extension 


200 ——TDirect Expansion 





“2 =f JO ff 
Fig.8 Blade shape and distribution of Won K =7 coordinate surface 
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—o— COMPLETE 3D SOLUTION 


design, it should be noted that prescribed variation should 
satisfy a constraint condition in order that the resulting blade 
to blade S, flow surfaces along the hub and casing walls are 
surtaces of revolution. 

2 A new coordinate transformation, in which the S, 
stream-surface is taken as a coordinate surfaces, leads to a 
new method for calculating partial derivatives of flow 
variables with respect to the stream function of S$ surface. 
High-order derivatives can be obtained, and consequently 
high-order series expansion can be used to compute the three- 
dimensional flow ficld. 

3 This three-dimensional mean-streamsurface series e¢x- 
pansion method is different from the two-dimensional mean- 
streamline series expansion method in thai the known flow 
variations along two general curvilinear coordinates on the 
mean streamsurfacce are now used to compute the cir- 
cumferential derivatives of the flow variables. Based on the 
three-dimensional flow field obtained by series expansion, the 
blade shape corresponding to a mass flow rate passing 
through the three-dimensiona!l blade channel can be deter- 
mined. The blade to blade surfaces obtained in the solution 
between (and not including} the hub wall and casing wall are 
general S, streamsurfaces, and twists of these S,; surfaces are 
readily available in the calculation. 

4 Two illustrative examples indicate that the method for 
solving the inverse problem of three-dimensional flaw by the 
use of the present method is a feasible quick approximate 
engineering method. The nature and magnitude of the twist of 
S, surface obtained in a high-subsonic, axial-flow compressor 
and a high-subsonic, axial-flow turbine are determined in the 
computation. 


—*i— PRESENT METHOD 





34 em 
Fig-9 Distribution Mach number onk=9 §, surface 
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Fig.10 Therelative twist quantity on the J =3 suction surface 
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Fig.11 Meridional projection of S$2,, 


5 This simple, quick, approximate three-dimensional 
method should be useful to evaluate the nature and order of 
magnitude of the departure of blade-to-blade 5, surface from 
surface of revolution and in different geomeirical con- 
figurations and design specifications, the accuracy of quasi- 
three-dimensional solution as compared toa full three- 
dimensional solution, and for three-dimensional blade design, 
especially where a certain three-dimensional distribution of 
blade thickness is desired. It may also be used as a three- 
dimensional analysis method by successively correcting the 
mean-streamsurface flow. This correction is similar to the 
successive correction of the mean-streamline shape in the two- 
dimensional analysis method. 
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ABSTRACT 

Design and development of propellers for submarines are 
in some ways different from propellers for surface 
vessels. The most important demand is low acoustic 
signature that has priority over propeller efficiency, and 
the submarine propeller must be optimized with respect to 
acoustics rather than efficiency. Moreover the operating 
conditions of a submarine propeller are quite different. 
These aspects are discussed as well as the weighing of the 
various propeller parameters against the design 
objectives. 


The noise generated by the propeller can be characterized 
as thrust noise due to the inhomogeneous wake field of 
the submarine, trailing-edge noise and noise caused by 
turbulence in the inflow. 


The items discussed are demonstrated in a case study 
where a propeller of the Kappel type was developed. 
Three stages of the development are presented, including 
a design of an 8-bladed propeller where the thrust 
fluctuations as well as the thrust noise were significantly 
reduced relative to a 7-bladed propeller. Results of 
measurements are in good agreement with calculations. 


Keywords 
submarine propeller, propeller acoustics, Kappel propeller, 
propeller noise. 


1 INTRODUCTION 

Design and development of propellers for submarines are 
in some ways different from propellers for surface 
vessels. The most important requirement for a submarine 
propeller is a low acoustic signature. In the design 
process the steps in the design procedure include 
estimates of speed and power and weight as the usual 
standard procedure. The main feature for the design of an 
acoustically optimised propeller is the frequent check of 
its acoustic properties. There are generally four 
conditions that must be considered: 


deep diving, turning and diving, 

periscope depth, diving and with acceleration, 
surface operation and high acceleration mode, 
wind milling, near the surface. 


The noise criteria to be considered include thrust noise at 
blade rate, trailing-edge noise, turbulence noise and 
structural noise. 


The parameters that can be varied during the design 
process are: diameter, blade number, skew and rake, 
circulation distribution (pitch and camber) and trailing 
edge geometry. For submarines there are generally fewer 
restrictions on the diameter than for normal surface ships 
and increasing diameter means decreasing specific 
propeller thrust and higher average inflow velocity. 


In the present case several designs were examined. 
Parameters varied include diameter, number of blades, 
blade shape (skew and rake and in particular tip 
geometry) and propeller turning rate. However, the paper 
focuses on the development and application of propellers 
of the Kappel type (non-planar lifting surface). Results of 
calculations, model test and full-scale tests are compared. 


2 DESIGN OBJECTIVES 

The naval submarine is a complicated vessel built to 
fulfill a multitude of conflicting requirements. Leaving 
aside the crew and weaponry contained and supported by 
the submarine, some of the objectives to be targeted by 
the overall vessel and by the propulsion system and its 
propeller are: 


e low acoustic, optical, electromagnetic, thermal 
and communication signatures, 


e large operational window with respect to 
endurance, speed and depth, 


e high manoeuvrability in surfaced and in 
particular in submerged condition, 


e best possible form and arrangement of hull and 
control surfaces to achieve a low hydrodynamic 
resistance and a smooth hull wake, 


e propeller with low excitation level with respect 
to noise and vibrations, 


e propeller totally free of cavitation with adequate 
margins, 


e propeller with high efficiency. 
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Figure 1: Typical U-boat wake. Low-velocity regions can 
be seen close to the hub at 1.30 and 3 o'clock positions 
and symmetrically at the port side. A remarkable region 
of relatively high velocity is present around 12 o'clock for 
the outer radii. 


A typical submarine wake is shown in Figure 1. It can be 
seen that it differs significantly from most single-screw 
surface ship wakes being without the usual low-velocity 
region in the upper part of the propeller disk, but with 
remarkable influence of the control surfaces just upstream 
of the propeller. This, in combination with the demand for 
adequate cavitation-free margin of propeller operation 
together with the strict requirements to noise, as outlined 
in Figure 2, is a challenge for the propeller designer. 
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3 DESIGN PROCEDURE 

The design procedure of the propeller should ideally 
include feedback to the design of the submarine hull and 
its control surfaces. The progress is iterative and can be 
illustrated with the classical design spiral. A similar 
design spiral can be outlined for the propeller design, cf. 


Figure 3. It illustrates the main feature in the design 
process of an acoustical optimized propeller: the frequent 
check of the acoustics. Optimization can be achieved by 
defined parameters which can influence each other. All 
design parameters should be considered simultaneously 
as far as possible. The evaluation of a solution and its 
possible consequences for the general design is done 
under the design step designated "feedback analysis". 
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Figure 3: Design spiral for submarine propellers. 


3.1 Design matrix 

Whereas the design spiral describes the design as a 
process the objectives and parameters must be specified. 
The overall requirements to the submarine leads to 
objectives specifically related to the design of the 
propeller. These objectives are to be achieved in each of 
the operating conditions mentioned in the introduction of 
this paper. 


To meet the objectives the submarine propeller must be 
carefully optimized and balanced with respect to the 
design parameters available. Some of the implications of 
this process can be demonstrated if the objectives defined 
and the parameters available are seen as a design matrix, 
cf. Table 1. 


4 DESIGN CONSIDERATIONS 
The issues presented in the design matrix in Table 1 lead 
to a number of detailed design considerations. 


4.1 Cavitation margin 

Most surface ships experience propeller cavitation during 
normal operation. The design effort for surface ship 
propellers is therefore directed towards avoidance of 
harmful cavitation in way of erosion and towards 
moderate levels of propeller excited noise and vibration. 
However, propeller cavitation must be completely 
avoided for a submarine in submerged condition in order 
to secure the lowest possible noise signature. For this 
reason, for a submarine in submerged condition, the 
cavitation margin that determines the non-cavitating 
operating regime is very important. In general, the 
cavitation margins are defined by inception of: 


Table 1 Design matrix. Influence: X little or none, XX moderate, XXX strong. 


cavit. free range low noise and vibration level 


excita- 


cavit. cavit. 
margin, margin 
PARAMETERS ee 
orthogonal blade length 
section profile 
circulation distribution 
pitch distribution 
camber distribution 
skew 
area ratio 
area distribution 
diameter 
RPM 
blade number 
thickness ratio 
trailing edge thickness 
anti-singing edge form 


material 


e  suction-side cavitation 

e suction-side tip-vortex cavitation 
e pressure-side cavitation 

e pressure-side tip-vortex cavitation 
e hub-vortex cavitation 


Suction and pressure-side cavitation can normally be 
avoided by allowing adequate room in the classical 
cavitation bucket. This may lead to a thicker section 
profile than used for a comparable cavitating propeller of 
a surface ship. 


Tip-vortex cavitation is — apart from ambient pressure 
less vapor pressure — dependent on the vortex intensity 
and the viscosity of the fluid. In turn the vortex intensity 
depends on the gradient of circulation at the outer part of 
the propeller blade. Submarine propellers are therefore 
generally tip relieved in order to obtain sufficient margin 
against tip-vortex cavitation. This leads to a less-than- 
ideal circulation distribution with respect propeller 
efficiency. As tip-vortex cavitation is also dependent of 
viscosity a correction of the results observed in the model 
tank will have to be applied to arrive at full-scale 
expectations, for example according to McCormick. The 
formation of tip-vortex cavitation observed in model tests 
seems very dependent on local tip geometry. An 
explanation could be that the amount of boundary-layer 
material ending up in the vortex core depends on the 
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blade tip geometry. This would in turn have an effect on 
the cavitation margin as more boundary-layer material 
would increase the viscous and turbulent boundary core. 


The hub vortex consists of the combined blade root 
vortices. Similar to the tip vortex, the hub vortex is 
dependent on the circulation gradient at the blade root. 
The intensity of the hub vortex can be reduced by 
relieving the blade loading towards the root. Similar to tip 
relieving this leads to reduction in propeller efficiency. 
Alternative means to suppress hub vortex cavitation may 
be used by applying a boss cap with small lifting surfaces 
which reduces the rotation of the hub vortex core. 
Furthermore, hub vortex cavitation can be suppressed if 
the boss cap is given a form that will generate enough 
turbulence to increase the vortex core and thereby avoid 
cavitation. 


Cavitation performance for submarine propellers is 
generally illustrated in a diagram presenting the cavitation 
number oy as function of the trust loading Ky at which 
the cavitation starts to occur. This presentation is not 
directly suitable when comparing propellers where 
parameters such as propeller diameter are different. 
Instead, it is suggested to compare the relative margin 
against cavitation 100AK7/K; as a function of the 
operational depth T,, for a given speed, where Ky is the 
design value and AKy is the difference between the 
design value and the cavitation limit of Ky, cf. Figure 4. 
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Figure 4: Relative margin against cavitation, i.e. cavitation 
starts at the deviations in thrust coefficient shown. The 
results are based on model tests without scale corrections 
for propellers 4 and 6 discussed in Section 6.2. 


4.2 Reduction of wake-induced propeller forces 

The most important parameter with respect to propeller- 
induced forces is the wake field itself which for many 
reasons should as fair as possible, be it surface ships or 
submarines. However, with a given wake field the 
number of propeller blades can probably be considered 
the most important issue. Increasing the number of 
propeller blades will reduce the unsteady force on each 
propeller blade, but not necessarily the total unsteady 
force. An example of unsteady force calculations with a 
panel method for propellers with various geometries and 
blade numbers are discussed in section 6.3. 


Increase of the propeller diameter may increase or 
decrease the unsteady forces depending on the radial 
distribution of the unsteadiness in the wake field. An 
example of variation of unsteady forces with propeller 
diameter calculated by a panel method is shown in section 
6.3. 


4.3 Skew, pitch and camber distribution 

The application of skew can reduce in particular the first- 
order thrust and torque fluctuations and improve the 
cavitation behaviour as commonly seen in surface ships. 
However, a fairly large skew angle is needed to reduce 
torque and thrust fluctuations significantly. Large skew 
angles lead to fairly complicated higher-order flexural 
modes of the propeller blades — not to mention the blade 
stresses when the propeller is going astern - for which 


reason large skew angles are often avoided in case of 
submarine propellers. 


4.4 Orthogonal 
distribution 

The circulation distribution has been discussed in 4.1 with 
respect to load relieving of tip and root in order to obtain 
sufficient margin against cavitation. However, the 
application of "non-planar lifting surfaces" (Andersen ef 
al. (2005)) changes the circulation significantly. It is no 
longer meaningful to present the circulation as a function 
of the radius, instead circulation as a function over the 
orthogonal blade length is used. This length is measured 
orthogonally to a surface of helicoidal streamlines passing 
the propeller blade. For a conventional propeller with a 
radial generator line the orthogonal blade length is that of 
the radial line independent of skew, whereas the 
orthogonal line with or without skew might be about 10 
per cent longer for the propeller with non-planar lifting 
surfaces. A typical circulation distribution for a Kappel 
propeller with non-planar lifting surfaces and a 
conventional propeller is shown in Figure 5. Note that 
gradient or derivative of the circulation distribution and 
the maximum circulation is smaller in case of the Kappel 
propeller. Both facilitate the adjustment of the load 
distribution with respect to increase of cavitation margins 
and blade-area distribution with respect to reduction of 
unsteady forces. 
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Figure 5: Spanwise circulation distribution of a 


conventional and a Kappel propeller designed for the 
same task. So is the length parameter along the orthogonal 
blade. 


5 ACOUSTICS 

The effectiveness of a submarine is highly dependent 
upon its noise signature, the special task of the boat 
giving it mainly a passive role and control function. The 
hydrodynamic flow noise dominates in particular the 
performance of the sonar system. The own noise of the 
boat correlates with the detection range, with respect to 
detecting other ships and to being detected. Three types 
of noise can be identified: 


e Thrust noise, dominating the noise level in the 
low frequency range (<100 Hz), 


e Trailing edge noise (200-1000 Hz), 


e Broad band noise (100-1000 Hz), caused by 
turbulence in the inflow. 


9.1 Thrust fluctuations 

The oscillating thrust of the propeller is probably the 
dominating acoustic source. On the other hand thrust 
fluctuations are not the only source as far as noise is 
concerned. Measurements in full scale indicate 
differences from theoretical calculations. In general, a 
reduction of thrust fluctuation by 50 per cent yields a 
reduced value for the noise level of 6dB which 
theoretically halves the detection range of the submarine. 
The acoustic level (dipole source) is proportional to the 
harmonic order m, blade-number Z, turning rate n, and 
the m-harmonic fluctuating force and consequently the 
diameter, skew and number of blades of the propeller 
must be selected with care. 
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Figure 6 Reduced noise (80 per cent reduction of thrust 


fluctuation) for the 8-bladed propeller relative to the 7- 
bladed propeller. 


This is illustrated in Figure 6 where a large reduction in 
the thrust-fluctuation noise has been obtained by 
increasing the blade number from 7 to 8. The reduction in 
thrust variation is calculated by the boundary-element 
method, cf. Table 3, and the noise predicted by the 
procedure by Ross (1986) on the basis of these results. 
These results are in accordance with measurements, cf. 
Figure 10. 


Although thrust fluctuations can be reduced by a good 
propeller design it is important to emphasize that the 
wake optimization should begin early in cooperation with 
the propeller designer. A poor flow field is not a good 
baseline for the process of minimizing noise. 


5.2 Trailing-edge noise 
The trailing-edge noise should not be neglected. For 
submarines the whole noise spectrum is of concern, 


whereas for surface ships the immediate problem is 
propeller singing, cf. Carlton (2007). For this reason 
submarine propellers are rarely provided with anti- 
singing edges, unless there are specific problems. 


Theoretical work on trailing-edge noise has been done by 
Lighthill (1952) and Blake (1986), and in addition 
measurements exist, cf. Carmargo et al. (2005). These 
measurements show clear evidence of vortex shedding 
and correlation with the noise level, indicating that the 
frequency increases somewhat with flow speed and that 
the harmonic content also increases with angle of attack 
of the blade sections. 


The concentrated fluctuation intensity may be expressed 
by (1st approach by Ross (1987)) I ~ 5.6*10°°. For a 
blade section the typical sound level is in the range of 
27dB (f ~ 350 Hz). The Strouhal number is estimated as 
0.2 for all conditions. 


In a case with a full-scale propeller with a blunt trailing 
edge the noise level was nearly 50dB indicating most 
probably induced trailing-edge noise in accordance with 
the results of Figure 7. 


The overall blade shaping, the trailing-edge geometry and 
the boundary layer in the local region are decisive for the 
trailing-edge noise, cf. Blake (1986). Reshaping the 
trailing-edge region altogether shows some progress. 
Wang et al. (2006) show results of a numerical optim- 
ization of trailing-edge noise where a_ substantial 
reduction in the vortex shedding was obtained with a 
corresponding improvement in the acoustic spectrum, in 
particular in the low-frequency range. 
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Figure 7 Trailing-edge noise generation and vortex 
strength as function of trailing-edge geometry. The 
Kappel propeller examined generally all have a form 
factor of 0.22 as indicated. 


In Figure 7 results of a simple analysis is presented of the 
correlation between trailing-edge geometry, trailing-edge 
vortex strength and noise. The analysis is made following 
the procedure by Blake (1986) for two-dimensional 


sections where the trailing edge is varied locally, from 
very blunt shapes to very sharp forms. The shapes are 
described by a form factor. From the figure it is seen that 
there is a correlation with the trailing-edge vortex 
strength, the more blunt trailing edge giving more noise. 


In general, the noise starts at frequencies larger than 
200Hz up to 1000Hz, and the strength varies with the 
blade section radius, the wake harmonics and partly with 
the local coherency of the vortices. For high Reynolds 
numbers (larger that 10°) the situation in unclear. 


In the worst case singing will occur where the trailing- 
edge noise gives strong tonal spectra and blade structural 
regions synchronise their vortex shedding in the so-called 
“lock-in effect”. Generally, the structural dynamic 
behaviour should be taken into account in the concept of 
propeller design. The blade geometry influences the 
eigenmodes, but the highly damped material “Sonoston” 
helps in some cases to reduce the peak magnitude 
somewhat. 


5.3 Inflow-induced noise 

The inflow to the propeller is a stochastic flow field 
superimposed the steady inflow field. The turbulence 
intensity and the integral length scale are decisive values. 
The inflow turbulence increases the noise-spectrum, 
especially if the turbulence length is equal or close to the 
blade distance. Although this phenomenon has not been 
experienced with open propellers it occurs for pumpjets 
with high blade numbers. 


5.4 Perspective 

For further developments of understanding of the flow- 
induced noise, as introduced in this paper, a viable and 
promising alternative would be systematic acoustic meas- 
urements, for example using "piezo-technique" on the 
surface of the blade (trailing-edge region), last but not 
least to verify theoretical calculations. The approach by 
Lighthill equations (Lighthill (1952)) can be designated 
as a first estimation. If enhanced predictions are to be 
made the efforts will increase strongly. More compre- 
hensive software packages (hybrid procedure) exist on 
the market and use more or less the principles of Lighthill 
analogies as well. The applicability of this software is de- 
monstrated for some examples, but the applications for 
the propeller design process are most doubtful. What are 
needed first are accurate experiments in model and full 
scale which are essential to scaling the theoretical 
calculations. 


6 EXAMPLE 

The development of a propeller for a particular submarine 
design is described in the following and it demonstrates 
some of the problems involved in the design process. 


6.1 Initial design stage 

For the initial stages of design four designers were invited 
to present their best proposal for the submarine propeller. 
The propellers were tested at Potsdam Model Basin, 
(Schiffbau-Versuchsanstalt Potsdam, SVA), Germany, in 
their cavitation tunnel in open water and behind a dummy 





Figure 8: Model of Propeller 8, the 8-bladed Kappel 
propeller as discussed in Section 6.3. 


model of the submarine. The initial design stage was 
followed by a second design stage in which the design 
restrictions with respect to the first order natural blade 
frequency were relaxed. Two propellers were tested in the 
model tank at SVA in the second design stage. Two 
further designs were tested in the model tank in a third 
design stage. All propellers are fixed pitch and right- 
handed. Propellers 4, 6 and 8 were of the Kappel type. 
The detailed geometries of the propellers were quite 
different. The main parameters of the eight propeller 
designs are listed in Table 2 with Propeller 4 as reference. 
In this paper the particular design work with three Kappel 
designs will be described even though several other 
designs were also developed and tested. 


Propeller 4 (Kappel) of the first design stage was tested in 
full scale with the original trailing-edge geometry and 
with a trailing-edge geometry modified corresponding to 
an anti-singing edge for a surface ship, sloped from the 
suction side. This modification reduced the noise level 
significantly in the frequency range 150 to 350 Hz, as 
indicated in Figure 9. Propeller 4 showed moderate thrust 
fluctuations after the modification. 


6.2 Second design stage 

Based on the model-test and full-scale results of the first 
design stage and on relaxed conditions with respect to 
first-order natural blade frequency, two further designs 
were made in the second design stage. Relaxation of the 
design requirements made possible a_ substantial 
improvement of the cavitation margin and small reduction 
in the thrust fluctuation. Propellers 5 and 6 (Kappel) were 
designed in this stage. 


Table 2. Propeller designs 


1“ design stage 


2" design stage 3" design stage 


a. es — — Propeller — Propeller — Propeller 


Diameter (relative) 


Pitch ratio 
Skew, degree 
Blade area ratio 


No. of blades 
*Propellers 4, 6 and 8 are of the Kappel type. 


Original trailing edge - -----Modified trailing edge 


level (dB) 





Ys ae ee ee ee 
hp * 
a ee ee ee ee 


frequency 


Figure 9: Envelopes of measured full-scale thrust 
fluctuations for Propeller 4 (cf. Table 2) with original and 
modified trailing edges. 


6.3 Third design stage 

The harmonic analysis of the wake of the submarine 
indicated that reduced propeller excitations might be 
expected in case a six-bladed and particularly an eight- 
bladed propeller were selected instead of a seven-bladed 
propeller. For this reason the propeller forces and 
moments were calculated for propellers in the behind 
condition by use of the boundary-element method in the 
third design stage. The propellers were assumed to be 
non-cavitating. The calculations were carried out for the 
seven-bladed versions, Propeller 4, Propeller 6 and 
furthermore for eight-bladed propellers with diameters 
(relative) of 1.0, 1.025, 1.05 and 1.1 (versions of 
Propeller 8). Table 3 gives an overview of forces and 
moments calculated. According to these calculations the 
design changes from Propeller 4 to Propeller 6 resulted in 
a minor reduction of thrust fluctuation. However, the 
change from seven to eight blades reduces the thrust and 
torque fluctuations to about 1/5. The fluctuation of the 
vertical force component is of the same magnitude for the 
blade numbers seven and eight, whereas the horizontal 
fluctuation increases 2'2 times when changing to eight 
blades. 


The thrust fluctuations were furthermore investigated in 
the circulating model tank of SVA. The model tank 





results confirm the significant reduction found by the 
calculations when changing from the seven to eight 
blades. The model tests for Propeller 6 (seven blades) and 
Propeller 8 (eight blades) are shown in Figure 10. The 
difference between the seven-bladed and the eight-bladed 
propellers is significant and the difference is more than 
10dB which correlates well with the theoretical Lighthill 
calculation based on the values of the panel calculation. 


The optimum diameter with respect to efficiency is found 
to be in the range 1.0 to 1.025 for the eight-bladed 
propeller version. The thrust and torque fluctuations 
calculated by the panel method increases with diameter, 
whereas the horizontal and vertical force fluctuations 
decrease marginally. The intensity of the tip vortex will in 
principle decrease as the diameter is increased, thereby 
increasing the margin against tip vortex cavitation. 
However, the cavitation margin in general will decrease 
as the diameter increases with constant RPM. An eight- 
bladed propeller with a diameter of 1.0125 (Propeller 8) 
was designed and tested in the model test facilities of 
SVA. 


Propeller 6 
Propeller 8 





frequency 


Figure 10. Spectral analysis up to 120 Hz of thrust 
fluctuations, Propellers 6 and 8 (respectively 7 and 8 
blades). The propeller model is turning at 13 rps. The 
reduction is more than 10 dB at the lst order blade rate, 
91 Hz respectively 104 Hz, when comparing Propeller 6 
and 8. 


By courtesy of Schiffbau Versuchsanstalt Potsdam (SVA). 


Table 3: Total forces and moments acting on propellers in inhomogeneous inflow 


Propeller aaa — Propeller | Propeller | Propeller | Propeller 

8.1 8.2 8.3 8.4 
D, (relative) 1.0 1.025 1.05 Ll 
No. of blades 


0.25942 
0.0517 
0.0093 
0.0071 
0.04326 
0.0076 


0.0032 
0.0051 


0.25081 
0.0481 

0.00767 
0.00685 
0.04221 
0.00771 
0.00324 
0.0052 


0.25296 
0.0089 

0.00963 
0.01724 
0.04296 
0.00171 
0.00519 
0.00964 


0.23448 
0.01006 
00.0086 
0.01541 
0.03880 
0.00183 
0.00470 
0.00895 


0.21288 
0.00918 
0.00756 
0.01364 
0.03440 
0.00170 
0.00416 
0.00817 


0.16736 
0.00918 
0.00557 
0.01074 
0.02608 
0.00149 
0.00313 
0.00682 


Akx = kX max - KXmin 





Kx*=Akx/Kx+Akmx/Mx 


Kx* 0.3750 0.3745 0.0750 0.0901 0.0952 0.1120 Kyz*=Aky/Kx+Akz/Kx 
Kyz* 0.2551 0.2579 0.4515 0.4541 0.4580 0.4790 +Akmy/Mx+Akmz/Mx 


7 CONCLUSION 

The development work for submarine propellers has been 
described and the differences relative to surface 
propellers outlined. The design process has _ been 
illustrated for a propeller of the Kappel type applied to a 
submarine design. The main concern has been a low 
acoustic signature and one of the means of obtaining this 
has been using eight blades, relative to seven blades used 
initially. Theoretical calculations as well as model tests 
demonstrate that lower thrust fluctuations were obtained 
accompanied by lower thrust noise. 
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Abstract 


Albatrosses have been observed to soar in an upwind direction using what 1s called 
here an upwind mode of dynamic soaring. The upwind mode is modeled using the 
dynamics of a two-layer Rayleigh cycle in which the lower layer has zero velocity and 
the upper layer has a uniform wind speed of W. The upwind mode consists of a climb 
across the wind-shear layer headed upwind, a 90° turn and descent across the wind- shear 
layer perpendicular to the wind, followed by a 90° turn into the wind. The increase of 
airspeed gained from crossing the wind-shear layer headed upwind is balanced by the 
decrease of airspeed caused by drag. Results show that a wandering albatross can soar 
over the ocean in an upwind direction at a mean speed of 8.4 m/s in a 3.6 m/s wind, 
which is the minimum wind speed necessary for sustained dynamic soaring. The main 
result is that an albatross can soar upwind much faster that the wind speed. The upwind 
dynamic soaring mode of a possible robotic albatross UAV (Unmanned Aerial Vehicle) 
is also modeled using a Rayleigh cycle. Maximum possible airspeeds are approximately 
equal to 9.5 times the wind speed of the upper layer. In a wind of 10 m/s, the maximum 
possible upwind (56 m/s) and across-wind (61 m/s) components of UAV velocity over 
the ocean result in a diagonal upwind velocity of 83 m/s. In sufficient wind, a UAV 
could, in principle, use fast diagonal speeds to rapidly survey large areas of the ocean 
surface and the marine boundary layer. Limitations to achieving such fast travel velocity 


are discussed and suggestions are made for further studies. 


Keywords: dynamic soaring, energy-neutral soaring, albatross, robotic albatross, 
Unmanned Aerial Vehicle, UAV 


Highlights: 

e Dynamic soaring of albatrosses and UAVs was simulated with a two-layer 
dynamic model. 

e Observed upwind dynamic soaring of albatrosses was successfully modeled. 

e Upwind dynamic soaring of a robotic albatross UAV was simulated with the same 
model. 

e A UAV is able to soar upwind much faster than an albatross due to superior 
strength. 


e A possible fast UAV survey mode uses parallel diagonal tracks relative to wind. 


1. Introduction 


On a research cruise to the South Atlantic I watched with amazement as a wandering 
albatross soared upwind parallel to our ship, which was steaming upwind at 6 m/s (12 
knots) into a 7 m/s head wind. The bird came from leeward of the ship and caught up 
with us, indicating it was soaring significantly faster than our 6 m/s speed. At first 
thought, it seemed almost impossible that an albatross could soar upwind without 
flapping its wings with a mean velocity through the air faster than 13 m/s, significantly 
faster than the 7 m/s wind. More observations of albatrosses soaring upwind and further 
reflection suggested that such upwind soaring is made possible by using dynamic soaring 


to exploit wind shear, possibly supplemented by updrafts over waves. 


The term dynamic soaring refers to the extraction of energy from the gradient of wind 
velocity. In dynamic soaring an albatross typically climbs across the wind-shear layer 
while headed upwind, turns to head downwind, descends down across the wind-shear 
layer, and then turns upwind again (Baines, 1889; Idrac, 1925; Pennycuick, 2002). Both 
crossings of the wind-shear layer and the turn downwind can provide energy for 
sustained or energy-neutral soaring. Usually an albatross executes these two turns in an 
S-shaped maneuver while soaring across-wind, which is the preferred direction for 


albatross soaring (Alerstam et al., 1993; Wakefield, 2009). 


Despite many studies of dynamic soaring there 1s still uncertainty about how fast a 
bird can soar upwind using dynamic soaring. The uncertainty 1s caused in part by the 
relatively slow upwind velocity predicted by models of dynamic soaring that use an 
average wind profile above a flat ocean surface. There are benefits to being able to fly 
fast upwind such as being able to rapidly travel to a good upwind fishing ground, being 
able to return and feed a hungry chick after being stuck downwind in a wind shift, and 


being able to keep up with a ship and scavenge discarded food scraps or discarded 


bycatch from fishing. That was probably why the albatross was shadowing our ship, 
which had numerous spools of wire and other mooring gear on the fantail and thus 


resembled a commercial fishing vessel. 


The purpose of this paper is to explain how a bird can fly fast upwind by using a 
particular form of dynamic soaring that is somewhat different from the usual across-wind 
dynamic soaring maneuver. In particular a bird is able to achieve fast upwind flight by 
using a Series of 90° turns, starting with a climb across the shear layer headed upwind 
banked to the right (for example), then a descent across the shear layer headed across 
wind, and switching to a banked turn to the left toward the wind direction again. The 
albatross that was shadowing our research ship was doing this maneuver, starting headed 
across wind in a wave trough, turning upwind and climbing over the top of a wave, then 


turning back across wind and diving down into another wave trough. 


The goal of this paper is to explore the upwind mode of travel velocity by using the 
Rayleigh cycle model to predict upwind travel velocities of an albatross and a robotic- 
albatross type UAV (Unmanned Aerial Vehicle). Part of the motivation for this study is 
to investigate the possible development of a robotic albatross UAV that could soar over 
the ocean using dynamic soaring like an albatross but much faster because of superior 
strength and aerodynamic performance. Possibly such a UAV could be used in 
environmental monitoring, search and rescue, and surveillance. In order to evaluate how 
effectively a UAV could survey, it 1s necessary to establish typical upwind and across- 
wind soaring abilities. The UAV soaring was modeled on the observed soaring flight of a 


wandering albatross (Diomedea Exulans). 


2. Rayleigh cycle model of dynamic soaring 


Lord Rayleigh (1883) was the first to describe how a bird could use the vertical 
gradient of the wind velocity to obtain energy for sustained dynamic soaring, although he 
did not call it that. He envisioned a two-layer model with different wind speeds (W) in the 


two layers, the speeds differing by an amount indicated by AW. He described how a 


circling bird could gain airspeed equal to AW by climbing headed upwind across the 
boundary between layers and gaining another AW by descending headed downwind 
across the boundary, a total gain of 2AW in a circle (Fig. 1). If the wind gradient were 
sufficiently large, then it is possible that enough energy could be obtained by this 
maneuver to sustain soaring and even climb upwards in the wind gradient. This maneuver 


has become known as the Rayleigh cycle. 


( 20 m/s (Albatross airspeed) 
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Fig. 1. Idealized example of the airspeeds of a dragless albatross dynamic soaring through a thin 
wind-shear layer, which is assumed to consist of an increase in wind speed from zero below the 
layer to 5 m/s above. The zero speed in the lower layer represents the low wind speed located in 
wave troughs. This schematic is based on the written description of Rayleigh (1883) who first 
suggested that a bird could continuously soar in nearly-circular flight on an inclined plane that 
crosses a thin wind-shear layer. Starting in the lower layer with an airspeed 15 m/s a bird climbs 
upwind a short distance vertically across the wind-shear layer, which increases airspeed to 20 
m/s. The bird turns and flies downwind with the same airspeed of 20 m/s. During the turn, ground 
speed increases to 25 m/s in a downwind direction and consists of the bird’s 20 m/s airspeed plus 
(tail) wind speed of 5 m/s. The bird descends downwind a short distance vertically across the 
wind-shear layer, which increases airspeed to 25 m/s. Flying with an airspeed of 25 m/s the bird 
turns upwind. Thus, one circle through the wind-shear layer increases airspeed and groundspeed 
from 15 m/s to 25 m/s (two times the 5 m/s wind speed increase across the wind-shear layer). By 
descending upwind (dashed line) the bird’s airspeed would have decreased from 20 m/s back to 
15 m/s with no net gain in airspeed. This schematic shows an oblique view of near-circular flight. 


In order to develop a (fairly) simple dynamic model of albatross dynamic soaring 
Richardson (2011) balanced the airspeed gain in the Rayleigh cycle with the loss of 
energy due to the drag of the air acting on the bird. The model is referred to as the 
Raleigh cycle model. Drag was modeled with a quadratic drag law in which the drag 
coefficient 1s proportional to the lift coefficient squared. Two homogenous layers were 
assumed, a lower layer of zero wind speed and upper layer of wind speed W. A thin 
wind-shear layer is sandwiched between the upper and lower layers. The zero wind speed 
in the lower layer represents the region of low wind speeds located in wave troughs. The 
wind in the upper layer represents the fast wind blowing above wave crests. This model 
agrees with the concept of an albatross gust soaring proposed by Pennyquick (2002) in 
which a wind gust is encountered when a bird pulls up from the low-wind region in a 


wave trough into the faster wind (the gust) located just above a wave crest. 


The essential assumptions in the Rayleigh cycle model are that 1) the bird soars in 
nearly circular loops along a plane tilted upward into the wind, 2) the plane crosses the 
wind-shear layer at a small angle with respect to the horizon so that vertical motions can 
be ignored, 3) the average airspeed and average glide ratio can be used to represent flight 
in the circle, and 4) conservation of energy in each layer requires a balance between the 
sudden increase of airspeed (and kinetic energy) caused by crossing the shear layer and 
the gradual loss of airspeed due to drag over each half loop, resulting in energy neutral 


soaring (see Appendix A for a more complete description of the model.) 


3. Dynamic Soaring Maneuvers 


I have observed albatrosses to fly in three typical dynamic soaring maneuvers. The 
first is a circling hover mode with a bird’s wings banked in the same direction similar to 
the basic Rayleigh cycle shown schematically in Figure 1. A minor modification of this is 
a figure-eight-shaped hover mode in which a bird switches the direction of its banked 
wings on every other crossing of the shear layer. The second maneuver is an across-wind 
travel mode, consisting of a series of 180° turns banked successively right then left (say), 


while the bird is headed on an average course across the wind (Fig. 2). Using this mode, 
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Fig. 2. Schematic examples of upwind and across-wind modes of dynamic soaring 
created by linking 90° and 180° turns into snaking flight patterns. The 16.0 m/s cruise 
speed of a wandering albatross was used to calculate a 10.2 m/s mean travel velocity 
through the air in the across-wind and along-wind directions and diagonal travel 
velocities of 14.4 m/s. Arrow heads are placed at points where the bird crosses the wind 
shear layer when headed either upwind or downwind and thereby gains airspeed and 
kinetic energy. 


an albatross can soar quite fast perpendicular to the wind velocity. Tracking of 
albatrosses suggests that this 1s the usual dynamic soaring mode (Wakefield et al., 2009). 


It is not obvious how an albatross could use either of these techniques to soar very fast 


upwind, although some models have suggested that they can, including one based on the 
across-wind mode tilted in an upwind direction (Richardson, 2011). A third maneuver is 
an upwind travel mode in which an albatross can soar quite fast in an upwind direction as 
Idrac (1925) and I have observed (Fig. 2). I believe that this upwind mode better 
represents how an albatross actually soars upwind and is a more efficient way to gain 


energy from dynamic soaring than a tilted across-wind mode. 


3.1 Upwind travel mode 


In this mode a bird first climbs up across the wind-shear layer headed upwind, banks 
and turns 90° to the right (say). The bird then descends across the wind-shear layer 
headed perpendicular to the wind, banks left and turns 90° into the wind again. Using a 
series of 90° turns a bird zig-zags diagonally upwind through the air at an average angle 
of around 45° relative to the wind direction (Fig. 2). By adjusting the direction of the 
turns, either to the left or right after an upwind climb, a bird can effectively tack in any 
direction it chooses including (on average) directly upwind. To soar directly upwind the 
bird would execute a series of turns, first right then left, after upwind climbs (Fig. 2) 
resulting in a series of 180° turns similar to the across-wind mode. This technique could 
be also used for fast down-wind soaring if a bird replaces upwind climbs across the wind- 


shear layer with downwind descents across the wind-shear layer. 


The energy balance in a set of two 90° turns in the diagonal upwind mode is almost 
exactly the same as in one 180° turn. Airspeed gained in an upwind crossing of the wind- 
shear layer is reduced by drag during the two 90° turns, which equal the same amount of 
angular turn as one 180° turn. The main difference is an extra change in banking direction 
between 90° turns. As long as the bird can change its bank angle rapidly compared to the 
whole maneuver then the dynamics would be unchanged from the basic circular Rayleigh 
cycle model. The typical observed period of time for the two 180° turns of the hover 
mode, the across-wind mode, and the direct upwind mode is around 10 s for a wandering 


albatross, and thus the implied period of two 90° turns would be around 5 s. However, the 


albatross I observed soaring upwind executed the two 90° turns in around 10 s. (More 


about this below.) 


3.2 Numerical example of upwind travel mode 


The cruise air speed, V. = 16 m/s, of a wandering albatross is its speed at the 
maximum glide ratio, which is around 21.2 in straight flight (Pennycuick, 2008). Using 
these values, a loop period of 10 s, the aerodynamic equations of motion and the 
Rayleigh cycle model I found that the minimum wind speed in the upper layer that can 
support energy-neutral dynamic soaring 1s 3.6 m/s (Eq. A.4). This value is in good 
agreement with the results of numerical simulations using the full aerodynamic equations 
(Lissaman, 2005, personal communication; Richardson, 2011; Sachs, 2005). The Raleigh 
cycle model predicts that the bird’s airspeed would be 14.2 m/s just before crossing the 
shear layer, would increase by 3.6 m/s on crossing the shear layer, and would reach 17.8 
m/s just afterward. The 17.8 m/s speed would then be reduced by drag to 14.2 m/s by the 
end of a 180° turn or two 90° turns. The average bank angle of the maneuver would be 


around 46° and the average acceleration acting on the bird around 1.4 g. 


The average across-wind travel velocity through the air of an albatross is equal to the 
diameter of a 180° semi-circular turn divided by the 5 s to fly it. Thus the across-wind 
component of soaring flight would be equal to 2V,/z = 10.2 m/s, where V, is the 16 m/s 
cruise airspeed (Table 1, Fig. 3). In the upwind mode of flight consisting of a series of 
90° or 180° turns, the average upwind velocity through the air would also equal 10.2 m/s 
(assuming the same 5 s to fly 180°), as would the downwind velocity. Diagonal travel 
velocity at an angle of 45° relative to the wind is the resultant of the across-wind and 
upwind values and equals 14.4 m/s, almost as fast as the bird’s cruise airspeed (16 m/s). 
The relatively fast diagonal travel velocity is possible because the diagonal series of 
linked 90° turns more closely approximates a straight course than does a series of linked 


180° turns in the across-wind or upwind directions. 


Table 1 
Wandering Albatross and UAV Travel Velocities 


Wandering Albatross (V, = 16 m/s) UAV (V=9.5W) 

Travel Direction Travel Velocity Travel Velocity Travel Velocity Travel Velocity 
Through Air (m/s) = Over Ground (m/s) Through Air Over Ground 

Upwind 10.2 8.4 6.1W 5.6W 
Diagonal Upwind 14.4 13.2 (51°) 8.6W 8.3W (47°) 
Downwind 10.2 12.0 6.1W 6.6W 
Diagonal Downwind 14.4 15.7 (140°) 8.6W 9.0W (137°) 
Across-wind 10.2 10.2 6.1W 6.1W 


Values of travel velocity (m/s) of a wandering albatross were calculated using its cruise airspeed 
(V.) of 16 m/s at the maximum lift/drag ratio of 21.2, a loop period of 10 s, the Rayleigh cycle 
model, a wind speed of 3.6 m/s (the minimum for dynamic soaring), and a combination of 
upwind and across-wind soaring modes as shown in Fig. 2. The average bank angle 1s 46° and 
average acceleration is 1.4 g. The UAV’s maximum airspeed was calculated with the Rayleigh 
cycle model to be 9.5W, where W is the wind speed in the upper layer (Eq. A10, see text). This 
relation, UAV airspeed = 9.5W, is a good approximation for wind speeds greater than around 5 
m/s. UAV travel velocities were calculated using this airspeed and a combination of soaring 
modes as shown in Fig. 2. In order to obtain the travel velocity over the ground, values of travel 
velocity through the air were corrected for leeway, estimated to be one half the wind speed of the 
upper layer. Values in parentheses are the angles of travel velocity relative to the wind direction, 
which was assumed to be from zero degrees. 


3.3 Leeway 


In all three modes of dynamic soaring maneuvers there is downwind leeway caused by 
the portion of flight in the upper wind layer. A bird spends approximately half of its time 
in each layer and thus the average leeway in a maneuver equals around one half of the 
wind velocity of the upper layer. This leeway value needs to be applied to the bird’s 
mean velocity through the air in order to obtain the bird’s travel velocity over the ground. 
In principle, a bird could extend the upwind flight legs to compensate for leeway. This 
would make the hover mode more stationary and make the across-wind travel mode 
perpendicular to the wind. It would require gaining a bit more energy from the wind- 


shear layer and a slightly faster wind than without compensating for leeway. 


In the upwind mode, the upwind velocity of 10.2 m/s would be countered by leeway 
of 1.8 m/s (half the wind speed in the upper layer) resulting in an upwind travel velocity 
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over the ground of 8.4 m/s (Table 1, Fig. 3). Correcting for leeway results in a diagonal 
upwind travel velocity of 13.2 m/s at a direction of around 51° relative to the wind. In a7 
m/s wind the upwind travel velocity would be 6.7 m/s, enough to follow our ship but not 


enough to rapidly catch up to it from astern. 


Albatross Travel Velocity Polar Diagram (m/s) 





Fig. 3. Travel velocity polar diagram of a wandering albatross created using upwind and across- 
wind dynamic soaring modes (see Fig. 2). The square shape is a result of the equal travel 
velocities through the air in the along-wind and across-wind directions. Values of mean travel 
velocities over the ground have been added to characteristic flight directions. The diagram was 
created using the cruise airspeed (16.0 m/s) of the bird and the minimum wind speed (3.6 m/s) 
necessary for continuous dynamic soaring. Values were corrected for leeway (1.8 m/s), which 
was estimated to be one half of the wind speed, based on assuming that the bird spends equal time 
in the upper and lower layers. 


The diagonal travel velocity over the ground is significantly faster than either the 
direct upwind or across-wind travel velocities. This suggests that diagonal flight would 


be the fastest way to travel over the ocean, especially downwind diagonal flight at 15.7 
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m/s, which is very close to the bird’s 16 m/s cruise velocity through the air. An albatross 
could choose a diagonal flight mode in order to make the fastest passage to feeding 
grounds or a fast return. The implication is that the fastest way an albatross could search 


a particular region for food would be to use a diagonal flight pattern. 


3.4. Across-wind descent 


An important detail should be mentioned about the descent across the wind-shear 
layer when headed perpendicular to the wind. As a bird soars in the upper layer on a 
heading of 90° with respect to the wind direction (considered to come from zero degrees), 
for example, the wind causes a downwind leeway to the bird’s right at the wind speed 
(3.6 m/s) so that the direction of flight over the ground is toward 103°. When descending 
on a 90° heading across the wind-shear layer, the apparent wind encountered by the bird 
shifts to the right around 13° due to the leeway component. At this point the bird could 
quickly yaw to the right to head into the apparent wind direction then bank left to start the 
“9O0°” upwind turn, which would be approximately 103°. In order to retain the upwind 
travel mode Raleigh cycle of approximately two 90° turns that sum to 180° a bird could 
descend across the wind shear layer when headed toward 83.5° (after a 83.5° turn), then 
quickly yaw right toward 96.5° (the direction of the bird’s velocity over the ground), then 
bank left and turn 96.5° into the wind. The sum of these two turns is 180°. Presumably, a 
bird would know how to maximize its upwind velocity by optimally adjusting the turns 


somewhat like the maneuver envisioned here. 


3.5 Dynamic soaring in higher wind speeds 


In principle, albatrosses can use winds faster than the minimum (3.6 m/s) required for 
dynamic soaring to gain more energy from the wind and fly faster than the cruise 
airspeed of 16 m/s. They probably actually do this at least for moderate increases of 
airspeed. The albatross I saw using the upwind soaring mode in a wind of 7 m/s was 


completing the two 90° turns in around 10 s. Using these values the Rayleigh cycle 
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model predicts that the bird could soar with an average airspeed of around 24 m/s, an 
average bank angle of 38°, and average acceleration of around 1.3 g, which seems 
reasonable. This airspeed corresponds to upwind and across-wind travel velocities 
through the air of 15.6 m/s. Correcting for leeway gives an upwind travel velocity over 
the ground of 12.1 m/s. This is almost twice as fast as the upwind travel velocity (6.7 
m/s) with a 16 m/s airspeed and 7 m/s wind. This result suggests that the albatross I 
observed had more than enough upwind travel velocity to catch up with our ship (as 
observed). The implication is that as long as the acceleration of an albatross can be kept 
to a reasonable level below the maximum strength of the bird’s wings, the bird could 
increase its upwind and across-wind travel velocity with winds faster than 3.6 m/s. The 
Rayleigh cycle model indicates that an albatross could increase its airspeed as wind speed 
increases and, by increasing the period of the maneuver, maintain a constant average 
bank angle (45°, say) and acceleration (1.4 g) typical of dynamic soaring at the minimum 
wind speed (3.6 m/s). This suggests that the travel velocities in Table 1 and in Figure 3 
could underestimate real albatross travel velocities in wind speeds faster than 3.6 m/s. 
However, there is some contrary evidence against faster airspeeds in faster winds as 


discussed below. 


3.6 Discussion of albatross airspeeds faster than 16 m/s 


Wakefield et al. (2009) found a linear relationship between the ground speed of 
albatrosses and the wind-speed component in the direction of flight. For example, the 
ground speed of a wandering albatross was found to equal an average 10 m/s plus 0.4 
times the wind component at 5 m height in the direction of flight (their Table 1). The 
Rayleigh cycle model predicts a similar 10 m/s travel velocity through the air for a wind 
speed of 3.6 m/s, as described above. The observed speeds used by Wakefield et al. 
suggest that albatrosses generally fly with an average 10 m/s travel velocity through the 
air even in relatively fast winds of 6-9 m/s and large wind shears. The implication is that 
the birds can control the amount of energy in dynamic soaring in order to maintain a 
nearly constant average airspeed. Some possible ways a bird could limit energy gain and 


airspeed in faster winds would be to climb and descend obliquely across the wind-shear 
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layer, to soar in the weaker shear located higher up in the wind-shear layer, and to 


decrease the frequency of shear-layer crossings. 


The average 10 m/s travel velocity through the air found by Wakefield et al. appears 
to contradict the faster upwind travel velocity through the air (15.6 m/s) and faster 
airspeed (24 m/s) calculated above using a 7 m/s wind speed. There are at least two 
possible explanations of this discord. The first is that albatrosses usually prefer the 
across-wind travel mode soaring at a travel velocity of 10 m/s because that is easy for 
them. Therefore, the preponderance of observations used by Wakefield et al. are probably 
from the common across-wind flight with few observations from fast upwind flight. 
Secondly, significant scatter can be seen in the plot (their Fig. 2, top panel) showing the 
correlation between the ground speed of wandering albatrosses measured with GPS and 
the wind velocity component in the direction of flight. In particular, there is a cluster of 
around 6 observations of ground speeds of 10-13 m/s matching a head wind speed 
component of 7 m/s, corroborating my upwind flight observations and the Rayleigh 
model predictions. These GPS data indicate that at times albatrosses are quite capable of 
exploiting moderately fast wind speeds in order to soar significantly faster than the 


typical cruise airspeed and in an upwind direction. 


The albatross tracking data used by Wakefield et al. are fairly infrequent positions 
with no information about the fine-scale details of the dynamic soaring flight such as 
high-resolution trajectories over the ground and the times to fly the various maneuvers. 
This kind of information is needed in order to be able to further evaluate how albatrosses 
use the wind to gain energy in soaring over the ocean and how fast they can fly using the 
different travel modes. A few albatrosses have now been tracked with higher temporal 
resolution GPS positions and flight details are starting to be published (Sachs et al., 2013; 
Weimerskirch et al., 2002). More of these high temporal resolution data including air 
speed measurements are needed to reveal how an albatross uses dynamic soaring and to 


accurately model the soaring techniques. 


4. Possible upwind dynamic soaring of a robotic albatross UAV 
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Recent observations of fast radio-controlled gliders suggest that they might form the 
basis of a robotic albatross UAV (Unmanned Aerial Vehicle) that could use dynamic 
soaring to fly over the ocean much faster than a real albatross. The goal of this part of the 


paper is to explore this concept using the observed albatross soaring modes (Fig. 4). 





Fig. 4. Conceptual illustration of a robotic albatross UAV dynamic soaring over the 
ocean. A photo of a Kinetic 1OODP glider flown by Spencer Lisenby at Weldon Hill, 
California was superimposed onto a photo of a black-browed albatross soaring over the 
Southern Ocean. Photos by Phil Richardson. 


4.1 Observations of fast dynamic soaring radio-controlled gliders 


Pilots of radio-controlled gliders have recently used dynamic soaring to exploit winds 
blowing over mountain ridges to fly gliders at astonishingly fast speeds in nearly circular 
loops similar to the hover mode. In 2012 I observed gliders being flown at speeds up to 


around 200 m/s at Weldon Hill near Lake Isabella, California in wind speed gusts of 20- 
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30 m/s near the top of the ridge. The present unofficial world record is a peak speed of 
223 m/s (498 mph) held by Spencer Lisenby flying a Kinetic 100 DP glider with a 2.5 m 
wingspan, similar to that of some albatrosses 
(http://www.rcspeeds.com/aircraftspeeds.aspx). One almost has to see and hear these 
extremely fast gliders to believe how fast they can go and to appreciate their extreme 
performance. Anecdotal information suggests that peak glider speeds as measured by 
radar guns are roughly equal to 10 times the airspeed blowing over the crest of the ridge 


(Lisenby, personal communication). 


4.2. Model simulations of fast radio-controlled gliders and possible UAVs 


In order to better understand the dynamic soaring of these gliders I used the Raleigh 
cycle model to simulate their flight. The Rayleigh cycle model is appropriate for this 
because the low-wind region downwind of the mountain ridges and the fast wind speeds 
located just above the crest of the ridge match the two model layers. A maximum lift to 
drag ratio of 30/1 at a cruise speed of 25 m/s was estimated for a Kinetic 100 DP glider 


(Lisenby, personal communication). 


Using these values the Rayleigh cycle model predicts that the maximum possible 
airspeed is equal to around 9.5 times the wind speed in the upper layer for wind speeds 
greater than around 5 m/s (Eq. A1O). The 9.5 factor agrees quite closely with the 
anecdotal factor of around 10. These fast gliders need to be very strong since peak 
accelerations estimated from the Rayleigh cycle model and some accelerometer 
observations (Chris Bosley, personal communication) are around 100 g’s. The gliders are 
highly maneuverable as demonstrated by successful dynamic soaring at high speeds close 
to the ground in gusty winds, by numerous remarkable acrobatic tricks, and by safely 
landing on top of a very windy gusty ridge. (See Richardson, 2012 for more discussion 


about high-speed dynamic soaring gliders.) 


4.3 Upwind travel velocities of a robotic albatross UAV 
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The maximum possible upwind travel velocity of a UAV was modeled with the 
airspeed calculated using the Rayleigh cycle model optimized for fast flight and the 
upwind flight mode shown in Fig. 2. Values of maximum travel velocity through the air 
as a function of wind speed (W) are given in Table 1, and maximum possible upwind 
travel velocities over the ground as a function of wind speed given in Figure 5. The range 
of possible upwind velocities plotted against wind speed is indicated in Figure 5 by 
darker blue shading. An important result is the large region of possible upwind travel 


velocities and wind speeds using a dynamic soaring UAV. 


UAV Upwind Travel Velocities 


Maximum Possible 
Upwind Travel Velocity —_, 


Upwind Travel Velacity (m/s) 





0 5 10 15 
Wind Speed (m/s) 


Fig. 5. Envelope of maximum possible upwind UAV travel velocities over the ground (corrected 
for leeway) plotted as a function of wind speed. Velocities were calculated using the upwind 
mode of the Rayleigh cycle model and by assuming a maximum lift/drag ratio of 30/1 at a cruise 
speed of 25 m/s (similar to a Kinetic 100 DP glider). The darker blue area indicates all possible 
upwind UAV travel velocities as a function of wind speed (W). The UAV was assumed to be able 
to obtain the correct amount of energy for each wind speed in order to soar upwind at slower 
airspeeds than the maximum possible. 
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As a numerical example, a wind speed of 10 m/s could in principle provide sufficient 
energy to soar at airspeeds up to 95 m/s (Eq. A.10). The resulting maximum upwind 
travel velocity through the air would be 61 m/s, and the maximum diagonal travel 
velocity would be 86 m/s (Table 1). Correcting for leeway gives a maximum upwind 
velocity over the ground of 56 m/s and a maximum upwind diagonal velocity of 83 m/s. 
The values of the maximum possible travel velocities are illustrated in a velocity polar 


diagram for a UAV (Fig. 6). 


UAV Travel Velocity Polar Diagram 





Fig. 6. Travel velocity polar diagram of a robotic albatross UAV. The square shape is a result of 
the equal travel velocities through the air in the along-wind and across-wind directions. The 
diagram was created by assuming a characteristic lift/drag ratio of 30/1, a cruise velocity of 25 
m/s (similar to a Kinetic 100 DP glider), and by calculating airspeeds based on the Raleigh cycle 
model optimized to give the maximum possible airspeed for a given wind speed. Travel velocities 
have been corrected for leeway and represent velocities over the ground. Specific given values of 
travel velocity represent the linear relation between travel velocity over the ground and wind 
speed (W) for wind speeds over around 5 m/s (see text). For example, a wind speed of 10 m/s 
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results in maximum possible travel velocities over the ground of 56 m/s upwind, 83 m/s 
diagonally upwind, and 61 m/s across-wind. 


4.6 Possible UAV Survey mode 


A main result of these calculations is to show that in principle very fast upwind UAV 
travel velocities are possible over the ocean when using the energy extracted from the 
wind in dynamic soaring. In a 10 m/s wind the maximum diagonal upwind travel velocity 
over the ground is 83 m/s, over six times faster than that of an albatross flying with a 
travel velocity of 16 m/s. These results indicate that the fastest way to survey or search a 
particular part of the ocean using a dynamic soaring UAV would be along diagonal lines 


(Fig. 7). 


Possible UAV Survey Pattern 





Fig. 7. Schematic diagram showing a possible UAV survey mode using a combination of upwind 
and downwind modes of the Rayleigh cycle consisting of diagonal trajectories with respect to the 
wind direction (Fig. 2). Diagonal velocities are the fastest possible travel velocities over the ocean 
using the upwind (and downwind) mode. They enable rapid surveying of an area for any given 
wind direction given sufficient wind for dynamic soaring. For example, in a wind speed of 10 m/s 
the average maximum possible diagonal velocities are 86 m/s (Fig. 6). 
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A hypothetical survey mode using diagonal lines is shown in Figure 7. In principle a 
UAV could complete such a survey more than 14 times faster than could a 6 m/s ship. 
This suggests that UAVs could provide rapid measurements over the ocean and 
supplement much slower insitu sampling by ship. In addition UAVs could be used in 
combination with surface drifters to follow oil spills or other pollutants in the surface 


layer and for search and rescue operations. 


4.7 Previous numerical simulations of the upwind travel mode of UAVs 


Deittert et al. (2009) modeled a UAV dynamic soaring through an exponential wind 
profile over a flat ocean (no waves). Only modest upwind travel velocities were obtained 
reaching around 2-6 m/s in wind speeds of 8 to 20 m/s referenced to a height of 20 m. 
These upwind velocities are much slower than the maximum upwind travel velocities of 
~ 45-112 m/s predicted using the Rayleigh cycle model and the same wind speeds. Two 
major differences between models explain the very different predictions. The first is that 
most (~ 70%) of the increase of wind speed above the ocean in the 20 m exponential 
wind layer occurs in the first 1.5 m above the surface. Thus most of the increase of wind 
speed in the profile was missed by the UAV. The second major difference 1s that the 
UAV remained in the faster contrary winds above the surface and thus had a large 
leeway. Deittert et al. show a detailed plot of the direct upwind snaking mode with 
heights flown between approximately 1-25 m in a wind profile with airspeed of 12 m/s at 
20 m. The estimated increase of wind speed (AW) encountered by the UAV in this 
example is around 4 m/s, the maximum upwind travel velocity is around 6 m/s, and the 
leeway is around 11 m/s. The Rayleigh cycle model predicts a nearly-equal upwind travel 
velocity through the air as the Deittert et al. model when using the same AW, cruise 
airspeed, and lift/drag ratio. The main difference between model predictions is the much 
larger leeway ~ 11 m/s (Deittert et al.) as opposed to the 2 m/s predicted by the Rayleigh 


cycle model. 


Barnes (2004) simulated a direct upwind snaking mode using an exponential wind 


profile over a flat ocean with a wind speed of 7 m/s at a height of 20 m. His predicted 
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UAV upwind travel velocity was 3.6 m/s, also much slower than the predictions of the 
Rayleigh cycle model for the same reasons as mentioned above. Bower (2012) modeled 
dynamic soaring over a flat ocean using linear and exponential wind profiles. With a 12 
m/s wind at a 10 m reference height his UAV could not travel upwind due to a smaller 


maximum lift/drag ratio (~ 20) and a large leeway. 


The relatively slow upwind travel velocities obtained by Barnes (2004) and Deittert et 
al. (2009) emphasize that in order to fly fast upwind it is important to exploit the full 
wind-shear layer located just above wave crests and to remain in the slow winds located 
downwind of wave crests and shielded from the faster winds for part of the flight. The 
Rayleigh cycle model includes both of these features—flight in the wave trough and 
climbs upwind across the main wind-shear layer—similar to the gust-soaring technique 


proposed by Pennycuick (2008). 


Numerical simulations like those mentioned above could be made more relevant to 
soaring over the real ocean by incorporating the dynamic soaring of a UAV in a model 
that resolves wind-wave interactions and features like lee eddies and detached shear 
layers, which albatrosses use for gust soaring. It seems probable that when these features 
are incorporated into a simulation, a model UAV could fly closer to the speeds found 
using the Rayleigh cycle model. Such simulations could also reveal information about 
optimal flight over waves. On the other had, the slower upwind travel velocities found in 
simulations over a flat ocean could be more realistic in practice if UAV gust soaring turns 
out to be much less efficient than predicted by the Rayleigh cycle. Field tests should be 


able to provide information about how well a real UAV could soar over the ocean. 
4.8 Dynamic soaring of a UAV over the real ocean 

The results of the Rayleigh cycle simulations indicate that much faster UAV upwind 
velocities are possible than those of an albatross. How much faster needs to be 


determined from further studies. The modeled fast upwind UAV travel velocities are 


based on the observed albatross dynamic soaring flight patterns. A fundamental question 
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is how well can a UAV mimic albatross maneuvers and also increase the frequency of 


shear-layer crossings in order to gain sufficient energy for fast travel velocities? 


In the low-level part of a swoop in a wave trough, an albatross flies very close to the 
ocean surface, close enough so that the bird’s wing tip often grazes the surface. This 
allows the bird to descend into a lee eddy located in a wave trough and be shielded from 
the full force of the wind. Grazing the water surface with wing tip feathers 1s not a 
problem for an albatross, but touching the wing of a UAV could cause a crash. Thus, a 
UAV must maintain a safe gliding distance above the ocean surface. But, to fully exploit 
gust soaring a UAV must be able to descend below the wind-shear layer into a wave 
trough, and this could be compromised if the minimum safe flying height above the 
ocean surface were greater than the wave height. Therefore, it seems likely that 
increasing a UAV’s height above the ocean for safety could lead to a reduced amount of 
energy gained from the available wind shear, especially with low-amplitude waves. As 
demonstrated by the numerical simulations of dynamic soaring over a flat ocean, 
including a safe minimum soaring height limits the energy gain, airspeed, and upwind 


travel velocity of a UAV. 


Clearly there is a wide range of possible UAV upwind travel velocities using dynamic 
soaring as indicated in Figure 5. It is necessary to establish values optimized for flight 
over real ocean waves in various wind conditions and to refine such a figure. To do so it 
would be beneficial to have experienced pilots of RC gliders take various high- 
performance waterproof gliders to sea and experiment with field trials in order to 
measure how fast dynamic soaring could be accomplished in realistic winds and waves. 
In such trials it would be helpful to have the gliders instrumented to measure high 
temporal resolution series of positions, orientations, accelerations and velocities through 


the air as well as observations of the wind and waves. 


4.9 Design of a robotic albatross UAV 


ZZ 


The UAV discussed here was modeled on the kinetic 100 DP high-performance glider, 
which is optimized for fast dynamic soaring using fast winds blowing over mountain 
ridges and not optimized for slower winds blowing over waves like albatrosses often 
exploit. An appropriate robotic albatross UAV should probably be optimized for slower 
winds, which can often limit dynamic soaring. In slow winds albatrosses often use a flap- 
gliding technique to supplement dynamic soaring, and when the wind dies altogether the 
birds typically land on the water. A useful UAV would also need auxiliary power for 
slow winds and the ability to land on water and take off again. Other important 
developments are sensors to measure winds and waves from a UAV and an autopilot that 


could implement dynamic soaring using these measurements. 


5. Summary of results 


An upwind dynamic soaring mode was modeled using a series of 90° turns linking 
upwind climbs across the wind-shear layer and across-wind descents across the wind- 
shear layer (Fig. 2). The upwind mode can be used to soar downwind by descending 
downwind across the wind-shear layer instead of climbing upwind across it. A series of 
90° turns can be combined to soar in any direction including directly upwind and 
downwind somewhat similar to the courses made good by a sailboat tacking in the wind 


(Fig. 2). 


An albatross travel velocity polar diagram was constructed by combining the upwind 
and across-wind travel modes using the minimum wind (3.6 m/s) for energy-neutral 
dynamic soaring (Fig. 3). The upwind component of travel velocity over the ground is 8.4 
m/s, considerably larger than the wind speed (3.6 m/s). The fastest travel velocity over 
the ground in the upwind half of the diagram is 13.2 m/s along diagonals heading in a 
direction of around 51° relative to the wind direction. The upwind travel velocity of an 
albatross soaring in a wind of 7 m/s was calculated to be around 12 m/s, more than 
enough to explain how an albatross was able to catch up to our (6 m/s) ship steaming 


upwind. 
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The concept of a fast dynamic soaring robotic albatross UAV was explored based on 
the characteristics of high speed radio-controlled gliders, which have exploited dynamic 
soaring to reach speeds up to 220 m/s. Maximum possible airspeeds of a such a UAV 
were simulated by optimizing the Rayleigh cycle model for maximum airspeed. Although 
such fast airspeeds might not be achievable in practice by a UAV soaring over the ocean 
for reasons mentioned above, the speeds and resulting travel velocities give an indication 
of the maximum possible using the Rayleigh cycle. The wide range of possible UAV 


upwind travel velocities as a function of wind speed was shown in Figure 5. 


A polar diagram of UAV travel velocities over the ground as a function of wind speed 
is given in Fig. 6. The maximum possible upwind travel velocity in a 10 m/s wind was 
found to be 56 m/s and the maximum diagonal upwind velocity is 83 m/s. These very fast 
speeds demonstrate the power of using dynamic soaring and suggest that in principle 
UAV travel velocities over the ocean could be many times faster than that of albatrosses. 
Even if the achievable UAV travel velocities over the real ocean turn out to be only half 
of the predicted values, they would still be much faster than those of an albatross. Figure 
5 also indicates that a dynamic soaring UAV could travel in any direction over the ocean 
and not be limited by the small upwind travel velocities predicted by models that include 
a flat ocean. The fast diagonal travel velocities suggest that a rapid search or survey mode 
over the ocean might consist of a series of parallel diagonal tracks relative to the wind 


direction offset from each other as illustrated in Fig. 7. 


6. Conclusions and recommendations 


The results of this study suggest that a robotic albatross UAV could soar much faster 
than an albatross and be useful for many applications. In order to evaluate how 
effectively an AUV could soar over the real ocean in different wind and wave condition 
Several studies could to be undertaken. First, numerical simulations could be made of the 
interaction of winds and waves, of the resulting structures in the wind field including 
updrafts and detached shear layers, and of the optimal dynamic soaring patterns using the 


wind field over waves. Second, albatrosses could be instrumented to better measure their 
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dynamic soaring techniques in various winds and waves. Specifically, high temporal 


resolution time series could be obtained of albatrosses’ positions, velocities, orientations, 


accelerations, and airspeeds, and these related to insitu observations of winds and wave 


Third, expert pilots of radio-controlled gliders who are experienced in fast dynamic 


S. 


soaring could conduct field tests in order to evaluate fast dynamic soaring over the ocean 


with high-performance waterproof gliders and to establish safe minimum soaring heigh 
above the ocean surface. These gliders should be instrumented as would be the 
albatrosses mentioned above. Using the results of the simulations and measurements of 
albatross and radio-controlled glider flight, a dynamic soaring autopilot could be 
developed. Finally, a prototype waterproof, instrumented, dynamic soaring UAV with 


backup auxiliary power could be constructed and tested over the ocean. 
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Appendix A 


A.1. Nomenclature 


Lift coefficient 

Drag coefficient 

Diameter of loop 

Optimum diameter of loop for maximum airspeed 
Drag force 

Gravity 

Lift force 

Mass 

Period of loop (or equivalent sum of angles turned) 
Optimum loop period for maximum airspeed 
Characteristic wing area 

Airspeed of albatross or UAV 

Cruise airspeed at maximum glide ratio (V/ V- )max 
Sink rate due to drag 

Increase of airspeed caused by crossing the wind-shear layer 
Wind speed of upper layer 

Increase of wind speed across the wind-shear layer 
Bank angle 

Density of air 


A.2. Rayleigh cycle model 


In the Rayleigh cycle model the sudden gain in airspeed of a glider climbing across 


the wind-shear layer headed into the wind is AV. This AV is assumed to equal the vertical 


increase of wind speed (AW) across the layer and also the wind speed W of the upper 


layer, assuming zero wind speed in the lower layer. A similar increase of airspeed is 


obtained by a glider descending across the wind-shear layer headed downwind. 


The sink rate V, of a glider at constant airspeed was used to obtain the decrease of 


airspeed at constant height as modeled by the Rayleigh cycle. To do this the rate of 


change of kinetic energy d/dt(mV"/2) = mV(dV/dt) at constant height was equated to the 
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rate of change of potential energy d/dt(mgh) = mgV-, at constant airspeed. The result 
indicates that dV/dt = g/(V/V-), where V/V, represents values of the glide ratio (glide 
polar). Values of the glide ratio are closely equal to values of lift/drag (L/D) for L/D 
values >> 1 typical of glider flight. Lift L = Cl(p/2)V’S and drag D= Cd(p/2)V'S. 


V/V, 1s nearly constant in the relevant glider airspeed range AV centered a particular 
average airspeed, and therefore acceleration is nearly constant and airspeed decreases 
nearly linearly in time due to drag. For example, values of V/V, are within around 1% of 
the mean V/V, in the energy-neutral circle of a wandering albatross soaring with a cruise 


airspeed of 16 m/s. Thus, the total decrease of airspeed AV in a half loop (#/2) is given by 


V = mS (Al) 
Values of V/V, were modeled using the aerodynamic equations of motion for balanced 
circular flight (Lissaman, 2005; Torenbeek and Wittenberg, 2009) and a quadratic drag 
law, in which the drag coefficient is proportional to the lift coefficient squared. In 
balanced circular flight the horizontal component of lift balances the centripetal 
acceleration and the vertical component of lift balances gravity. Specifically, V/V, was 


modeled by 


2(V 1Vz) max (A2) 


V IV. = Gate . aay pt eo 
(V /Vc)” +(VWelV cos@) 


where (V/V, )max 18S the maximum glide ratio at V, the associated cruise airspeed (airspeed 
of minimum drag) of a representative glider in straight flight, g is the bank angle, and 


cos@ 1s given by 


i 
Li. A3 
OPN OAV ety + 1 a 


Combining Equations (A2) and (A3) with (A1) and assuming that AV = W indicates that 
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_ gt 
~ AV /V,) 


max 


AV =W (V/V)? +(VeIV)” + (22Ve/ gt)” |. (A4) 


The (27V,/gt)’ term is due to the centripetal acceleration and bank angle. Equation A4 
indicates that for a particular glider (with a given (V/V, )max at V.) in energy-neutral 
soaring, the increase of glider airspeed (AV) gained by crossing the wind-shear layer and 
the gradual loss in a half loop is a function of both the loop period t and the average 
airspeed V. 

A minimum AV (and minimum W) for a given glider airspeed occurs at an “optimum” 
loop period ¢,,; coinciding with minimum energy loss in a loop (minimum V,f). The 
optimum loop period (to,:) was obtained by setting the derivative d(AV)/dt of (Eq. A4) 


equal to zero and solving for f. 


_ 20V. /g (A5) 


t oy , 
" MVIV.2 4(VIVY 


Equation A5 indicates that f,,; decreases with increasingly large V. In sufficient wind 
flight faster than the cruise airspeed V, can be accomplished by decreasing the loop 
period toward f,,; and increasing the frequency of shear-layer crossings. Substituting Eq. 
A5 into Eq. A4 provides an expression for minimum AV (and minimum W) for a given V. 
Thus the minimum wind speed Win needed for a given glider airspeed V in energy 


neutral dynamic soaring 1s given by 


7 MV (Vive +W.Vy 
W in “WIV V/V.) +VIV) . (A6) 


max 


This equation can be used to calculate the maximum possible airspeed Vax for a given 
wind speed W. Eq. A6 indicates that the absolute minimum Win occurs when V = V, , 


and thus the equation for the absolute minimum Wpin 1S 
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mVerl2 


= VV na 


max 


(A7) 


At fast glider speeds >50 m/s and for V, ~ 25 m/s, (V/ Vy >> (Vel vy and (V,/ vy can 
be neglected. This simplifies Eq. A.5 to 


2nV- 
t,, = ra (A8) 
g 


Substituting Eq. A8 into Eq. A4 provides a simplified expression for minimum AV (and 
minimum W) for V > 50 m/s. Thus the minimum wind speed Win needed for a given 


glider airspeed V > 50 m/s in energy neutral dynamic soaring is given by 


W nin = —* 
(V/V.) 


max 


(V). (A9) 


This equation can be rearranged to provide the maximum glider airspeed Vinax for a given 


wind speed W 
V/V. 
Vi = So (W ). (A10) 


Equation A10 indicates that for fast flight (> 50 m/s) the maximum possible average 
airspeed in a Rayleigh cycle is proportional to wind speed. It is important to note that this 
linear relation depends on flying with an optimum loop period. Other loop periods can 
result in a smaller maximum airspeed for a given wind speed. Maximum lift/drag ratio 
(L/D) max 18 approximately equal to the maximum glide polar (V/V-)max and can be 


substituted into the above equations. 


29 


The diameter of a loop is given by d = V#/z. Substituting into this equation the 
expression for optimum loop period f¢,,; in fast flight (Eq. A8) gives the optimum loop 


diameter dopt 


d,. =2V,°/g. (A11) 


Equation A11 reveals that for fast flight the optimum loop diameter is proportional to 
cruise airspeed squared but is independent of glider airspeed. 

The total acceleration of a glider includes centripetal acceleration and gravity and is 
given by the load factor, which equals |/cos@ (see Eq. A3). For fast dynamic soaring, V > 
50 m/s, (2xV/gt)’ >> 1, and the load factor is approximately equal to 2zV/gt. 
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Inflatable and Warpable Wings for Meso-scale UAVs 


Andrew Simpson* Jamey Jacob! and Suzanne Smith* 
Mechanical Engineering Dept. 
University of Kentucky, Lexington, KY 


The paper presents work on testing of inflatable wings for unmanned aerial vehicles 
(UAVs). Design and construction of inflatable wings is discussed, along with ground and 
flight testing. Predictions and correlations of the forces required to warp (twist) the wings 
to a particular shape and the aerodynamic forces generated by that shape change are 
presented. The focus is on characterizing the deformation of the wings and development 
of a system to deform the wings inflight. Relations between wing stiffness and internal 
pressure and the impact of external loads are presented. Mechanical manipulation of the 
wing shape on a test vehicle is shown to be an effective means of roll control. Finally, 
damage resistance is shown from results of flight tests. 


I. Inflatable Wings 


A. Introduction 


Inflatable wings are a promising solution for many situations where wings need to be stowed when not in 
use. While inflatable wings are conceptually possible in almost any size and have been developed with spans 
as small as 6 inches for missiles and as large as 30 feet or more for LTAs, the most promising scale as that 
of the medium or meso-scale UAV. Not only does the UAV see the most requested requirements for stowed 
wings, it is this size where weight and volume make the inflatable wing a practical solution when compared 
to folding wing designs. One major drawback to inflatable wing use is the lack of roll control actuators 
(ailerons). This deficiency may be dealt with in several ways, however, including the use of the century old 
technique of wing warping originally developed by the Wright Brothers.! 

The current papers discusses the development and testing of inflatable wings for use on meso-scale UAVs 
with GTOW in the range of 10-50 Ibs. After a review of inflatable wing history, wing design and construction 
is discussed. Mechanical deformation of the wing is demonstrated using photogrammetry and predictions of 
the roll control effectiveness is made. Finally, results of flight tests are presented. 











B. Previous Work 


While the concept of inflatable structures for flight originated centuries ago, inflatable wings were only 
conceived and developed within the last few decades. While lighter-than-air vehicles also include inflatable 
structures, our focus herein is on inflatable structures used solely for lift generation. Various aspects of 
inflatable structures are discussed elsewhere? while a review of inflatable wing and related technologies is 
included in Cadogan et al.? To provide context for the results of this paper, a selection of design concepts 
is included below emphasizing recent developments in inflatable wings. 

Manned aircraft include some of the earliest and most recent uses of inflatable wings. Inflatable wings were 
successfully demonstrated in the 1950s with the Goodyear Inflatoplane (Model GA-468 is shown in Figure 
la and is also on display in the Patuxent River Naval Air Museum). The 6.7 m (22 ft) wingspan aircraft 
was developed as a military rescue plane that could be dropped behind enemy lines to rescue downed pilots. 
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Technology development, including delivery of dozens of aircraft, continued until the early 1970s. More 
recently, the Stingray (Figure 1b) single seat ultralight has been flight tested. Using chord-wise spars, the 





vehicle has a 13 m (42.6 ft) wingspan and 70 m® (2,500 ft?) of internal volume. The developer proposes to 
use helium as the inflation gas to provide an additional lifting component. 





(a) Goodyear Model GA-468 Inflatoplane. (b) Stingray. 


Figure 1. Single seat inflatable wing aircraft. 





Inflatable winged UAVs were developed in the 1970s by ILC Dover, Inc. The Apteron UAV (Figure 2a) 
had a 1.55 m (5.1 ft) wingspan, a 373 W (0.5 hp) engine, a 3.18 kg (7 Ib) gross weight and was remotely- 
controlled via elevons mounted on the trailing edge. The Apteron was successfully flight tested, but was never 
put into production. ILC Dover has since developed numerous inflatable wing designs, including the ones 
presented in this paper.°? Flight tests of deployment and low-altitude (800-1,000 ft) glide of the 12000 UAV 
using inflatable wings were conducted in 2001 by researchers at NASA Dryden. The wings were developed 
by Vertigo, Inc. for the Navy as a gun launched observation vehicle. The skeleton of the wing was made 
of inflatable tubes, surrounded with crushable foam to provide the airfoil cross-section. After the aircraft 
was released, the five-foot span inflatable wing was successfully deployed in about one-third of a second. To 
maintain suitable wing strength and stiffness, nitrogen gas pressurization of 1380-1725 kPa (200-250 psi) was 
required.* The deployment is shown in Figure 2b. 











(a) ILC Dover Apteron UAV. (b) In flight deployment of NASA Dryden I2000 inflatable 


wing. 


Figure 2. Previous inflatable wing UAVs. 


C. Inflatable Wings at University of Kentucky 


The University of Kentucky has been working with ILC Dover, Inc. on the development and testing of 
inflatable wings. Two variants have been developed and tested: inflatable only wings that require constant 
pressurization to maintain shape and inflatable/rigidizable wings that harden into a persistent shape once 
inflated. The latter were tested first. UV-curable resins under development for spacecraft applications were 
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considered for the inflatable/rigidizable wings.*° ® The wings are constructed of a composite material that 
becomes rigid on exposure to UV light. ‘The wings are assembled by sewing woven material to create the 
airfoil and internal baffling. For the inflatable/rigidizable wings, a layered material was used consisting of 
an external containment film, layers of resin-impregnated woven fabric selected for handling characteristics, 
and an internal containment layer. An inflatable urethane bladder is inserted, with long finger-like sections 
filling each baffle of the sewn wing. Figure 3 shows the inflatable/rigidizable wing before inflation and the 
wing while inflated during laboratory testing. This multi-spar design does not use foam spacer material and 
so packs compactly. The inflatable non-rigidizable design is shown in Figure 4. ‘The design is similar to the 
rigidizable version, but does include the impregnated resin and external containment film. 





Figure 3. Non-inflated (upper) and inflated (lower) inflatable/rigidizable wings. 


One potential application directing aspects of the development of inflatable/rigidizable wing technology is 
use on a “Mars Airplane.”? !? Current plans for Mars exploration include smaller Scout missions to expand 
the explored area. One concept under consideration is an unmanned spacecraft launched to Mars to release 
an aircraft designed to fly an exploratory mission over a wider range than a lander/rover and from a closer 
distance than an orbiter. Motivated by the requirement for a minimal packed-volume-to-weight ratio, an 
alternate approach for the proposed folding wing designs is an inflatable composite wing impregnated with a 
UV-curable resin. With this approach, wings are pressurized for deployment then rigidized with exposure to 
UV radiation from the sun. Once rigid, the wings no longer require pressurization to maintain their shape.!® 

Feasibility of these concepts are being tested through a series of high-altitude experiments. The flight 
experiments that validated this technology included three stages: 1) balloon-launched ascent to deployment 
altitude, 2) deployment of inflatable/rigidizable wings and continued ascent to near 100,000 ft, and 3) release 
from the balloon and gliding descent under autonomous control. Note that ground-level atmospheric density 
on Mars is similar to that at 100,000 ft on Earth. Balloon-launched high-altitude experiments to date 
include the first-ever demonstration of inflatable/rigidizable wing technology on May 3, 2003 with successful 
deployment of inflatable wings at 55,000 ft, curing on continuing ascent to 89,603 ft and descent to recovery.!4 

More recent efforts considered tailoring the composite layering design for weight reduction.!+ A second 
demonstration experiment on May 1, 2004 resulted in deployment and curing of the second-generation 
inflatable/rigidizable wings. Figure 5 shows the test article after flight inflation and rigidization at the 
recovery site. Flight characteristics, aerodynamic performance, aerodynamic analysis and wind-tunnel testing 
for the inflatable/rigidizable wings are detailed elsewhere.'?!© A third flight using a Vectran non-rigidizable 
wing was completed on April 30, 2005 and reached altitudes of over 98,000 feet (Figure 6). Since constant 
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Inflated Wings in Fuselage 





Figure 4. ILC Dovers multi-spar inflated wing and packed configuration. 


gage pressure was required to maintain wing shape, the inflation system was designed to vent upon ascent 
and included make-up gas to maintain 185 kPa (27 psig) until landing. 





Figure 5. Recovery of second-generation inflatable/rigidizable high-altitude test article. 


D. Warping Inflatable Wings 


The current research efforts are focused on warping an inflatable non-rigidizable wing to provide roll control 
through wing warping. Inflatable wings were developed by ILC Dover, Inc. which were not impregnated with 
the UV curable resin for use in other applications where higher wing loading is required. Since the wings do 
not harden, constant internal pressure is required to resist flight loads. As the wings are entirely inflatable, 
they do not contain ailerons. Thus, vehicles employing inflatable wings must generate a roll moment by 
other means. However, as the wings are non-rigid, it is possible to actively manipulate the shape of the 
wings to provide this roll moment. 

Another motive for altering the wing shape is to improve the airfoil efficiency. Principally, efficiency is 
maximized by increasing the lift ZL and decreasing the drag D or maximizing the lift-to drag ratio (L/D) for 
any configuration. A given airfoil profile has vastly differing lift and drag characteristics over the possible 
ranges of Reynolds Number (Re) and Mach number (M). Thus, airfoils are typically designed for a narrow 
range of flight conditions. Alternatively, airfoils can be designed that perform adequately over a wide range 
of conditions, but do not perform well in any. By altering the shape of the wing, the L/D ratio can be 
changed across the length of the wing. Assymmetric shape changes generates differential lift between the 
two semi-spans while deforming the semi-spans symmetrically would provide an altered lift distribution that 
could be optimized for maximum L/D. The differential loading scenario can potentially be used to generate 
the required roll moments for the aircraft. 
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Figure 6. Inflatable wing high-altitude test article at 98,000’. 


II. Wing Design 


A. Inflatable Wing Construction 


The inflatable wing is designed such that constant internal wing pressure is required to maintain wing shape. 
High stiffness is achieved with low inflation pressure by maximizing inflated sectional moment of inertia. 
Since the wing is constructed of a flexible fabric material, it can be stowed by folding or rolling. Previous 
inflatable wing designs required high gage pressures to maintain wing shape. This typically requires the use 
of heavy seals and gaskets and thicker than desired wall thicknesses for the wing material. It also means that 
any leak, however small, will result in the rapid depressurization of the wing and loss of structural integrity. 

The present wing design uses the presence of internal span-wise baffles or inflation cavities to help maintain 
structural stiffness at lower internal pressures. The outer wing (restraint) and internal baffles are constructed 
from high strength fibers such as Kevlar. The current incarnation as tested herein is made of Vectran, a 
manufactured fiber spun from Celanese Vectra liquid crystal polymer. The fibers have high-temperature 
resistance, high strength and modulus, and high resistance to moisture and chemicals, with good property 
retention in hostile environments. Since Vectran is porous, a polyurethane elastomer bladder is used to keep 
the internal volume pressurized. Design pressure is 186 kPa (27 psi), though the wing has been successfully 
flight tested at values down to 52 kPa (7.5 psi) with sufficient wing stiffness for low speed applications. The 
wing is constructed in semi-span sections and mounted to a plenum that can the be attached to an aircraft 
fuselage. The wing in both uninflated and inflated states is shown in Figure 4. The wing profile is based 
around a NACA 4318 with a 4 degree incidence angle. The taper ratio is 0.65 with an aspect ratio of 5.39 
and a span of approximately 1.8 m (6 ft). The wing planform and root and tip cross-sections are shown in 
Figure 7. Note the ribbed profile and blunt trailing edge; this is discussed in more detail below. Generally, 
low Re airfoils are designed to have thin profiles.‘° Here, manufacturability dictated a thicker profile which 
is typically a poor performer at low Re. However, the airfoil actually has improved peformance in the speed 
regime of interest due to the roughness of the inflated profile, which has been noted in the case of bird wings, 
for example.!”!® This is discussed in more detail elsewhere.!® 











III. Experimental Arrangement 


A. Photogrammetry 


Full deformation measurements were made using photogrammetry. Photogrammetry unobtrusively measures 
spatial deformation of the wing surface. Capturing numerous digital images from a variety of positions, and 
referencing the images to each other allows the accurate generation of a three-dimensional representation of 
the wing surface. High contrast markers are placed on the surface of the wing and referenced in each image 
where they can be seen. The wing was then deformed into a new shape. Changes from the original shape 
to the new configuration can be tracked, as well as hysteresis effects, when the process is reversed. Wing 
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Figure 7. Planform and profile of the inflatable wing. 





surface maps can then be generated and the deflection correlated with the applied force and the internal 
pressure in the wing. 

In order to obtain photogrammetry results, high contrast markers were placed on the surface of the wing. 
Higher concentrations of markers were placed towards the wing tip than at the root as greater movement 
was expected in these areas. The markers were placed in lines from wing root to tip and from leading edge 
to trailing edge and a total of 243 markers were used on the top surface of the wing. Spacing between 
markers was approximately 1 in and each marker was 0.25 in in diameter. Additional lighting was used 
when capturing the images, which ensured maximum contrast. Each image was captured in such a way as to 
encompass all markers and occupy the entire field of view of the camera. An Olympus E-20N 5-Megapixel 
SLR digital camera was used to capture the images for the static measurements. Camera calibration was 
conducted before the image capturing process, to correct for lens distortions. The calibration results were 
then used to adjust the captured images for these distortions. Wing surface maps can then be generated as 
seen in Figure 8 where the wing is loaded under torsion similar to how the wing is deformed in flight. 
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Figure 8. Wing deflection due to applied torsion. 
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B. Static Deformation 


Due to the novel wing design of the inflatable wing, wing stiffness is a function of inflation pressure, and 
thus aero-elastic behavior is a concern. To examine wing deformation, static load tests were conducted. The 
deflection tests were in three groups; point loads at the wing tip, distributed loading patterns, and torsional 
loads. In addition to simple tip deflection measurements, photogrammetry measurements were made so as 
to determine the deformation of an entire wing semi-span. 

Figure 9 shows the deflection of the inflatable wing with a point load at the wing tip. The point load 
was applied at the thickest portion of the wing (approximately the 1/4 chord), where the wing has the 
greatest resistance to bending. Five point loads were applied (10, 20, 30, 40, and 50 N), at seven different 
inflation pressures ranging from approximately 34-138 kPa (5, 7.5, 10, 12.5, 15, 17.5, and 20 psi). The wing 
tip deflection was measured on a scale at the 1/4 chord. At the most extreme loading case (50 N) and the 
lowest inflation pressure (5 psi), the maximum tip deflection is less than 3% of the span. 

For the distributed loading tip deflection measurements, the wing was set-up in a test stand upside down. 
Weight was applied on the surface in three different loading patterns and at four different internal pressures 
ranging from 69 to 172 kPa (10, 15, 20, and 25 psi). The loading configurations simulated flight loads under 
approximately 1, 2, and 2.5 g conditions. While the magnitude of the deflections change depending upon the 
loading conditions, the qualitative behavior is the same. Response appears to be monotonically decreasing 
with increasing inflation pressure. Even in the most extreme case at the highest loading and the lowest 
inflation pressure, the maximum tip deflection is less than 2% of the span. These results can be seen in 
Simpson et al.2° Note that these two loading cases display similar behavior. 


Tip Daflactian, th 
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Figure 9. Tip deflections for point loading. 


The wing was set-up in a test stand and was mechanically manipulated by applying a tensile force to 
the surface of the wing. Vectran fabric, (the same material as the wing) was bonded to the surface of the 
wing on the top leading edge and the bottom trailing edge of the wing. Both pieces were modified to be 
an attachment point for the tensile members. ‘Thin wire was then attached and drawn taut to the opposite 
side of the wing at the wing base. The attachment point for the top surface wing tip, was 8 in above and 
1 in behind the trailing edge of the wing. This made an angle of approximately 15° from horizontal. The 
attachment point for the bottom surface wing tip, was 8.5 in below and 3 in ahead of the leading edge of 
the wing base. This made an angle of approximately -20° from horizontal. The goal was to induce torsion 
in the wing from tip to root. Force transducers were connected linearly along the length of wire between 
these points. The transducers were connected to a DAQ that enabled real-time force measurement. The 
force transducers were connected to an adjustable tensioning mechanism, which could be used to adjust the 
force exerted. Four inflation pressures were examined: 69, 103, 138, 172 kPa (10, 15, 20, 25 psi). At each 
inflation pressure, five different forces (10, 20, 30, 40, and 50 N) were applied to the wing on both the top 
and bottom surfaces of the wing in conjunction with each other. The deformations were then measured 
via photogrammetry. Individual applied forces or combinations of applied forces were not examined. From 
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Figure 10. Undeformed (upper) and deformed wing (lower). 


measurements of typical twist deformations, seen in Figure 10, span-wise load distributions for baseline, 
maximum and minimum cases were generated and the placed into a lifting line code using as discussed 
below. 





C. Dynamic Deformation 


Dynamic deformation testing has taken two primary forms; smart materials and mechanical actuation. Smart 
materials such as piezoelectric’s and shape memory alloys offer a range of potential benefits; e.g., see Kudva 
et al.24 Piezoelectric materials have been used to alter camber, and deform leading and trailing edges. 
Rapid and controlled actuation of the material makes these materials desirable. However, the substantial 
equipment requirements for operation of these materials hampers flight testing. Shape memory alloys offer 
many alternatives as a wide range of shapes and actuation mechanisms exist. The wing has been warped 
in laboratory tests using nitinol actuators. ‘The wing was placed in the test stand, and the nitinol attached 
to the wing tip trailing edge and fuselage near the root. As shown in Figure 11, the wing experiences 
substantial deformation under actuation. Note that under the current configuration, the trailing edge is 
deflected downward while the leading edge remains in the same location. When measured from leading to 
trailing edge at the wing tip, the twist is an effective increase in angle of attack of 3°. If the deflection is 
measured from the first deformation point (approximately 0.75c), the effective flap deflection is approximately 
16°. 

Mechanical actuators have also be applied to the inflatable wings. High torque servos (Hitec HSC- 
5998TG) mounted beneath the wing root were connected to the wing at the wing tip as outlined above. The 
servos are capable of delivering 14.4 kg-cm (200 oz-in) of torque at 4.8 V and actuated using a standard R/C 
controller. As the actuation was dynamic, photogrammetry could not be employed to monitor the shape 
change of the wings. Rather, videogrammetry was used to capture the dynamic shape changes to the surface 
of the wing. The videogrammetry system comprised of two synchronized Pulnix (M-6710CL) one mega-pixel, 
progressive line scan cameras. The cameras capture images at 120 frames per second. The images are then 
fed into Photomodeler Pro for static and dynamic measurements. The true AoA variations across the wing 
were then placed in a lifting line code to predict the lift from the wings. Figure 12 shows the measured 
AoA variation and the predicted lift generated by the variation for two cases. The twist in the wings was 
not linear; higher AoA deflections were measured towards the wing tip. Using a = 0°, a, = 0.14/radian, 
ac,=0 = —4° and a tip twist of 16°, results in the lift distributions shown. As seen, substantial positive and 
negative modifications to the baseline distribution are possible. Note that the left hand semi-span produces 
nominal lift while the right hand semi-span produces increased lift as only one semi-span was warped at 
a time in the positive (increasing alpha) direction. From this, change in roll moment coefficient, AC{, are 
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Figure 11. Wing warping using nitinol actuator. 


calculated and shown in Figure 13. Substantial roll control is available with inflation pressures at or lower 
than 20 psi. Above this value, roll control drops significantly. While still non-negligible, adequate roll control 


may not be available for gust response and rapid maneuvering. 
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(a) Inflatable platform. (b) Flight test with warping wing. 


Figure 12. Lifting line prediction for the warped airfoil. 


IV. Flight Tests 


For low altitude flight testing of the inflatable wings, several configurations were constructed and flown. 
Two have been used significantly throughout the low altitude flight tests. The first consists of a PVC fuselage 
with aluminum boom connected to the empenage. This configuration is shown in Figure 14. The second 
consists entirely of a composite fuselage of woven Kevlar fabric providing superior weight and strength. In 
both cases, the wings were mounted directly to the top of the fuselage while multiple tail configurations were 
constructed to test stability and control characteristics. These included a traditional tail, ‘T-tail, and V-tail 
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Figure 13. Lifting line prediction for rolling moment coefficient. 


designs. ‘The tail volumes are typically larger than usual to increase control at low speeds during launch. 
There is no aileron control on the vehicle. 

For each of the test vehicles, several sets of wings were available for flight testing. In addition to sets of 
inflatable wings, simulated inflatable wings were constructed out of styrofoam. These wings were constructed 
with the same profile geometry and planform as their inflatable counterparts, and then weighted down to 
match the final weight of the rigidized wings. The fully inflatable wings have a mass of approximately 3 kg 
(6.6 lbs), including the aluminum plenum used for inflation and mounting. To achieve altitude, the vehicles 
were outfitted with an electric powerplant mounted in a tractor configuration. The motor was an AXI 4120 
brushless motor with a Jeti controller and 24 cell battery providing up to 70 A of current. At an output of 
16-20 cells and with a rated motor efficiency of greater than 82%, this provided up to 549 W of power to the 
propellor. Various sized folding propellers were used as needed. 

To date, over 50 low altitude flight tests of various configurations have been conducted. These tests have 
been conducted with three goals in mind: (i) to evaluate aerodynamic performance of the morphing wings in 
realistic operating conditions, (ii) determine the handling characteristics of the aircraft, and (iii) to obtain 
appropriate feedback gains for use in an autopilot system. While aerodynamic performance closely matched 
that seen in the wind tunnel, handling characteristics are discussed in detail here. In general, the vehicles 
were very stable and exhibited slight Dutch roll at take-off due to the high dihedral. 


Tail Currier 
oe 





(a) Inflatable platform. (b) Flight test with warping wing. 


Figure 14. Flight platform and picture from flight test with wing warping. 


The present vehicle employs a crude wing warping system. A standard R/C servo delivering 14.4 kg/cm 
(200 oz.-in.) of torque at 4.8 V, warps the wings. This is achieved through a pulley system attached to the 
fuselage. The servo is mounted on the tail boom located under the trailing edge of the wing on the fuselage 
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centerline. Nylon lines are run from the servos to attachment points (similar to those used in the experiments 
above) on the pressure surface of the wing at the wing tip. As the servo arm rotates, one semi-span is warped 
down as the line tightens, while the other side slacks. ‘Thus in the current configuration, only one semi-span 
is warped at a time. Additionally, the wing is warped down only, resulting in a higher than normal lift on 
the warped semi-span. 

The UAV has been test flown in this configuration as seen in Figure 14b. ‘Two wing warping configurations 
have been flight tested.?°:?? In the first, the roll and yaw of the vehicle were coupled though the R/C 
receiver. Qualitative flight stability was greatly improved as compared to the unwarped case. In the second 
configuration, the vehicle was flown without coupling the roll and yaw. Roll control was adequately provided 
by the wing deformation. Unfortunately, the UAV did not have any onboard sensors, thus roll rate could not 
be measured and correlated to servo position. A roll rate sensor has since been developed which is capable 
of measuring roll rate, pitch rate, longitudinal acceleration, lateral acceleration, and servo position. The 
results from these test flights can be used to correlate servo position (and hence wing deformation) to roll 


rate. Preliminary results of the flight testing of the UAV and the sensor payload with wing warping were 
23 








presented elsewhere. 
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Figure 15. Roll response with servo step input. 


Figure IV shows the result of a flight test using the onboard sensor, thus the roll rate of the aircraft should 
respond to the wing warping. Figure IV shows the response of the aircraft to a step input; a nearly constant 
increase in roll rate (or constant roll acceleration) results, and a steady state roll rate of approximately 
160°/s is seen after approximately 3 seconds. Once the servo input has been removed, the roll rate reverses 
until the aircraft stabilizes itself. Due to a slight asymmetry in the wings, a constant roll acceleration of 
approximately -0.75°/s? is seen without active input. 

Considering only a single degree of freedom and neglecting roll due to rudder deflection and sideslip, this 


is simplified to 





I b 
ir 7 on ~ Ch; ba = 0 
where p is the roll rate in radians/sec, I; is the moment of inertia in roll, Cj, is the roll damping coefficient, 
Ci,;, 1s the control power coefficient, and 0, is the effective aileron deflection in radians. For a given flight, 
flight parameters include q, S, 6, U and I,, while measurable variables include p and p. At initial lateral 


control input, we can write 


. C15, 0aqSb 
a 


ie 
while at steady state we have 
pb _ =C],. da 
200 «CO, 


The dimensionless roll-rate, pb/2U, is approximately 0.25. With a reported value of I, = 0.57 ke-m?,° 
one can find that Ci; da = 0.0013. Using the laboratory measured value of dq = 16°, we estimate values 
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of Ci;, = 0.0047 and C;, = 0.0051. This compares favorably to the predicted value of Cj,, = 0.0053 from 
McCormick.?4 


V. Conclusions and Future Work 





Inflatable wings have been demonstrated as viable alternatives for meso-scale UAVs in both lab and flight 
tests. In particular, handling qualities and maneuverability have been demonstrated using wing warping 
across a wide range of inflation pressures. Substantial roll control is available with inflation pressures at or 
lower than 20 psi. Above this value, roll control drops significantly but is still non-negligible; thus, adequate 
roll control may not be available for gust response. While not discussed herein, there are aerodynamic 
benefits at low Re due to the unique wing profile.?° 7° 
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Figure 16. Comparison of damage rates of various components during flight tests. 


Figure 16 shows the comparison of damage rates of various components during flight tests. While damage 
was determined using a broad definition including adjustment, repair, or replacement of component parts 
in the field prior to subsequent flight tests, the results show that the wings are extremely damage resistant. 
The inflatable wings have even survived serious crashes that have resulted in the near total destruction of 
the vehicle. Thus, vehicles requiring high damage tolerance may benefit from such wings. 

Further work is required in numerous areas. The current wing warping system is crude and a more elegant 
solution is currently being developed. In the present work, only symmetric wing warping is demonstrated. 
Antisymmetrically warping the wings will provide greater roll authority and hence greater maneuverability 
of the UAV. Finally, all warping strategies need further wind tunnel and flight testing to experimentally 
verify the improvements in lift and in flight deformation. 
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Efforts are continuously being made to find simple ways to convert wings of aircraft from 
an clficient cruise configuration to one that develops the high lift needed during landing and 
takeoff. The high-lift configurations studied here consist of conventional airfoils with a trapped 
vorlex over the upper surface. The vortex is trapped by one or two vertical fences (hat serve 
as barriers to the oncoming stream and as reflection planes for the vortex and the sink thal. 
forme a separation bubble on top of the airfoil. Since the fuil three-dimensional unsteady flow 
problem over the wing of an aircraft is so complicated that it is hard to get an understanding 
of the principles that govern the vortex trapping process, the analysia is restricted here to the 
low field illustrated in the first slide. It is assumed that the flow field between the two end 
plates approximates a streamwise strip of the flow over a wing. The flow between the endplates 
and about the airfoil consists of a spanwise vortex located between the suction orifices in the 
end plates. The spanwise fence or spoiler located near the nose of the airfoil serves to form 
a. separated flow region and a shear fayer. The vorticity in the shear Jayer is concentrated 
into the vortex by withdrawal of fluid at the suction orifices. As the strength of the vortox 
jncreases with time, it eventually dominates the flow in the separated region so that a shear 
or vortical fayer is no longer shed from the tip of the fence. At that point, the vortex strength 
is Bxed and its location is such that all of the velocity contributions at its cenler sum: to zero 
thereby making it an equilibrium: point for the vortex. This presentation describes the results 
of a theorelical analysis of such an idealized! flow field. 
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This slide presents a two-dimensional idealization of the experimental configuration pre- 
sented tn the previous slide that will be tse in the theoretical analysis. A large trapped-vortex 
bubble is shown over the airfoil] to emphasize the fact that the analysis is most interested in 
those configurations wherein the vortex bubble covers a large fraction of the upper surface of 
the airfoil. Ifsuch a flow field can be established, the lift enhancement by the trapped vortex is 
substantial cnough to yield lift coefficients that are in the range of the value, Cr = G. shown in 
the slide. The two-dimensional flow field is assumed to be inviscid and incompressible so thad 
iL can be represented by potential flow theory. Conformal mapping techniques can then be 
used lo develop the desirec! flow-field configuration from the flow about a circular cylinder. A 
substantial advantage of the conformal mapping technique is that it yields directly the location 
of the equilibrium point for the center of the vortex/source combination, the circulation, I, 
of the vortex, and the source strength, 17. Knowlege of I’ and ajz then yield the lift due to the 
trapped vortex and the crag attributed clirectly to the trapping process which is designated | 
hy Ca. As indicated in the slide. the flow is assumed to depart smoothly from the lip of the 
fence and from the trailing edge of the airfoil in order to satisfy the Kutta condition al. those 
locations. - | | 


Vhe single fence case was first studied, Ref. 1, in order to gain an understanding of the 
nature of the fow field and to obtain an estimate of the magnitude of lift enhancement that 
can be achieved hy ineans of a trapped vortex. 


Ref. l: Rossow. Vernon J.. “Lift Enhancement by an Externatly Trapped Vortex”. AIAA 
Journal of Aircraft, Vol. 15, No. 9, Sept. 1978. pp.b18-625. 
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The results presented here for the single fence case illustrate the location of Lhe cqui- 
libriwm potat for the vortex /source combinations for several different fence lengths and lft 
cocllictents, [is to be noted that the lift coefficient has been specified bul the downstream 
extent of the vortex bubble has not heen Axed. [It was assumed that the length of the fence 
and location of the equilibrium point would be enougl: to fix the size of the voriex bubble. 
llowever, when experiments were conducted in a water channel, it was found that a trapped 
vortex could he formed in some cases but that a large amount of fluid had to be withdrawn 
froin the center of the vortex,to not only form the vortex hut also to sustain it. This result 
was predicted by the theory through the magnitude of the sink required to achieved an eqqui-- 
librium condition at the center of the vortex. Not immediately apparent is the fact that the 
sink How also respresents a drag that is attributable to the vortex trapping process. It was 
then reasoned that not only is the drag undesirable, but a large amount of fluid moving along 
the vortex core can disrupt the vortex formation and, if large enough, can actually occupy thie 
entire trapped vortex region at spanwise stations near the wingtip where the core flow spills 
inte the free stream. Research was then started on finding ways by which the mass flow al 
the souree/vortex location could be made to vanish, 
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A mechaniant whereby the source flow can be made to vanish and still have an equilibrium 
poinl. for tle vortex is illustrated here, The two-fence trapped-vortex configuration in the lower 
part of the figure is divided into three separate flat-plate boundaries. In the first, the horizontal 
(lat plate serves as a reflection plane with an image vortex below the surface which induces 
an upstream velocity on the vortex that is exactly equal to the oncoming free-stream velocity. 
This configuration yields an equilibrium: potnt without a source but requires a fence of some 
sort Lo promote the formation of the vortex. A fence upstream of the vortex provides the shear 
layer trenlioned previously that builds the circulation in the vortex. The vertical hotndary 
also Induces an upward velocity through the influence of the image vortex necded to make 
the surface a streamline. The upward velocity due to the front fence needs to be offset by a 
sink localed beneath the horizontal plane if some other actifice is not used to bring about an 
eqnilibrium condition. Such an artifice is available as a fence clowustream of the vortex. As 
indicated in the figure, the image vortex for the rear fence induces a downward velocity on 
the vortex. Therefore, if the vortex to he trapped is midway between lwo vertical surfaces of 
aboul the same size, an equilibrium condition is achieved for the vortex without the presence 
of a source or sink. : 


Uhe twa-fence concept does several things for the flow field. First, it makes it possible to 
trap a vortex at ifs equilibrium location without the use of a source or sink. The front fence 
serves as al upstream fimit on the drapped-vortes flow field and as a means for reneralinyr 
a shear layer that supplies vorticity to the vortex. The second fetce serves as a downstream 
limit. on the size of the vortex bubble and as a reflection plane for the vortex so thal trapping 
can be achieved without the need for a source or sink. Since a source or sink is not required 
for the establishment of an equilibrium point. the drag duc to vortex trapping is negligible 
which means that efficient lift enhancement has been achieved. Another big advantage is thal 
the flow along the core of the vortex is also negligihle making it much easier to establish and 
maintain the vortex flow field. Mass removal from the core is then only necessary to establish 
the vortox aud to remove low energy fluid generated by viscous losses. 
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Before proceeding to airfoil-type trapped-vortex configurations, consider the simple case 
wherein a vortex is trapped over an infinite plane. As mentioned previously, a source is not 
needed in order to achieve an equilibrium condition. In practice however, fences are needed! 
bo fix bhe upstream and downstream extents of the vortex bubble and to provide a separated 
flow region with a shear layer to supply the vorticity that builds into the circulation for the 
vorlex, Pences can be added to the flow field without disturbing the equilibritun conclition or 
the streamline pattern if Lhe fences are placed upstream and downstream of the vortex on the 
aurftace of the vortex bubble as shown in the lower part of the figure. If the fences are thin atid 
fit, or conform to, the surface of the vortex bubble, the flow field characteristics are unchanged 
hy addition of the fences. A number of the solutions to be presented will be noted to have 
only one fence that is flat and that is needed to make 72 = 0. The other fence ts assumed {o 
he of the conforming type that fits the vortex bubble so closely that no appreciable change in 
the flow field is brought about. 





location 
a Image and physical streamlines for trapped vorter flow field. 


b. Fences fore and aft that conform to shape of vortex separation bubble. 
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The procedure that was used to calculate the trapped-vortex flow field over an airfoil 
wherein a source or sink is nol needed ts ikustraled in the figure below. The first slep in the 
procedure is to calculate the flow field when only the front and rear stagnation poiuls of (he 
vortex bubble are specified. In such a case. the vortex bubble is assumed to have conforming 
fences that do not interfere with the equilthrium condition. Under those conditions, fa sink is 
required in order to achieve an equilibrium condition for a source/vortex coinbination as shown 
in the upper figure, the height of the rear fence (which is approximately flat) is incroascd in 
ateps until the sink How is negligibly small. The sink flow is highlighted in the upper figure 
by cross-hateling the streamtubes entering the sink. When the proper height of the flat plate 


rear fence has been found by such an iterative process, it is retained as the most effictent. or. 


Wr = 0, solution for a vortex bubble of a specified size and location on an airfoil al a given 
angle of altack, Conversely, ifthe flow fteld solution for the conforming-fence geometry had 
required a source rather than a sink. the height of a Mal front fence would have been increase 
nulil 27 = 0. The foregomg procedure was used to obtain all of the am: = 0 lrapped-vortex 
solutions presented here. 
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Vortex trapped on Clark Y airfoil (NACA 4412); o = 0.1. 
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ln order fo obtain a data set of solutions that can be used to study the characteristics 
of airfoils with trapped vortices, a sequence of 232 = 0 cases were calculated for the flow over 
au NACA 4432 (or Clark Y) airfoil at augles of alftack from a = —4% through w= $127 in 
inevements of 2°. Since the streamlines for tde various solutiona do not change very much, 
only the solutions for a = +4° are presented on this slide. The various solutions differ from 
one another in that the size of the trapped-vortex bubble increases gradually from zero to a 
nize. that nearly covers the entire upper surface of the airfoil. It could be imagined that the 
seqquence of figures represents a streamwise cross-section of the flow field as the wing ia changed 
from its cruise configuration (i.e. no vortex) to the vortex-bubble size (and lift) needed for 
landing. Conversely, when the aircraft takes off. the fences are first deployed so as to clevelop 
the size of trappedd-vortex needed for high lift. As the aircraft becomes airborne and increases 
its flight velocity, the fences are changed so that the vortex bubble shrinks in size progressively 
undil ibe cruise configuration is achieved. 
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The various characteristics of the trapped-vortex airfoils are now presented. The lirst 
parameter Hustraied is the height of the flat fences used to bring about the a = 0 condition. 
The parameters that are used lo define the chordwise extent of the vortex bubble are shawn 
in the inset figure. The chordwise beginning or front of the bubble, ry, is taken as the 
intersection of Che hudyble or fence surface with the upper surface of the airfoil. Similarly, the 
rear or downstreain end of the vortex hulble. :c,. is defined as the point where the hbuebhle 
surface intersects the surface of the airfoil. It is uoted that a flat fence length of about O.1e 
is required in order to obtain a vortex hubble that covers 26% of the airfuil. A Mat fonee 
length of about 0.2¢ produces a vortex bubhle that covers about half of the airfoil surface. 
This figure and the previous one clearly show that the size of the vortex bubble is largely 
controlled by the spacing between the front and rear fences. The height of the fences that are 
flat and do not confornt to the shape of the vortex bubble govern the magnitude of the souree 
or sink needed for equilibrium and are used to make a = 0. Conforming fence portions of a 
certain length will likely also be necessary in practice to produce the shear layer needed for 
the developinent of the vorlex and to control the physical limits of the vortex bubble. The 
present study does not include a study of the size of conforming fences that are needed. 
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The lift, coeffictent developed by the various trapped- vortex configurations is presented 
on this slide for the range of vortex bubble sizes that were studied. It is voted that the ffl 
increases slowly at first as the size of the vortex bubble increases from zero. At the larger 
vortex sizes, the lift changes rapidly with the size of the vortex bubble. Also to he noted is 
thal nol all of the curves end at the large vortex bubble sizes. The compulations indicate 
that th ts nol possible to find an equilibrium point for m7 = 0 in certain cases. Aldhough a 
physical reason for the solution failure was not found, it seems reasonahle that fence heights 
above certain values should not be possible solutions because the fences begin to interfere 
with the vortical flow field and cause it to become too distended in the vertical cirection. An 
explanation or criterion for the fence lengths above which solutions can no longer be fount 
was nol. found. 


Even a casual look at the curves of lift as a function of bubble size suggests that the 
curves are about of the same shape and that they might possibly collapse to a single curve if 
the Tift increment due to the (rapped vortex is plotted ag a function of the size of the vortex 
bubble. (2, —wy)/e. Those results are presented on the next slide. 


LIFT COEFFICIENT AS A FUNCTION OF VORTEX BUSBLE SIZE 
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The data on the previous slide collapses to a single curve only for the smaller values of 
(vp, — ay ic. As the vortex bubble size increases the differences hetween the curves increases, 
even though the curves all lave about the same shape. Manipulation of the various parameters 


might provide a better correlation of the data but was not tried. 
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lnorder to cemonstrate that the lift responds in the conventional way to angle of attack, 
(his slide presents Che Tift produced as a function of angle of attack for various sizes of the 
Lrapped-vortex bubble. It is noted that the varlation of lift with augic of allack for duahble 
sizes that are GO% or less of the chord are approximately linear with angle of atlack. The 
alope of Lhe Tift curves increases with increasing size of the vortex bubble but uot dramatically, 
These results tndicate that trapped-vortex airfoils have a conventional response to angle of 
attack. The fignre also provides an estimate of the reduction in angle of attack that can he 
achieved by adding a trapped vortex to the flow field over the airfoil. For example. aslilition 
of a trapped vortex that covers 26% of the airfoil, permits about a 4° reduction in angle of 
aifack for a given section lift coefficient. 


LIFT COEFFICIENT AS A FUNCTION OF 





ANGLE OF ATTACK 
T mY 
) Pp fa 
nh 
5 oo fesoy 
en ieee 
f | | 
t | octrh | 
a . 7 : | [i 
P | . | id gusts. 
— sae 
| F i | aad 
e a7 


i ; ' 
po° 
Ps — ' : i 
4 -,’ a “2 | | i Fr) 4%. 
ox, degrees 


1802 


The pitching moment about the guarter-chord location is expected to vary greally when 
the voriex bubble is large and moves aft. Even though an attempt was made to keep the 
center of the vortex bubble at. ahout the same chordwise station, the pitching moment ts secu 
lo become quite large. Latitude is available, however. for placing the vortex bubble fore or aft 
on the airfoil to influence the pitching moment-see next slice. 
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fn this particular sequence of trapped-vortex cases, the size of the trapped-vortex biudible 
is hetl approximately constant as the chorilwise location of the hubble is move aft in a series 
of steps from a very forward location. The cases presented Hlustrate some of the latitude (hat 
is avaiable for manipulating the characteristics of the airfoil. 
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The characteristics of Lhe lrapped-voriex cases presetited on the previous slice are sum- 
marized here. As expected, the pitching moment can be made as small as desired by moving 
the vortex bubble forward. The lift generated by the trapped vortex does decrease with the 
mere forward location but not disastrously. The mininuo height or length of the fat fences 
also changes a bit with the location of the trapped vortex but not by a large amount. 
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The foregoing slides provide an overview of the characteristics of one airfoil shape which 
has its fit enhanced by a trapped vortex flow field. Results for other airfoil shapes will differ 
tn delail but will genenerally have much the same character. This information provides the 
begining steps i the felfillment of the objective of the research which is to Aud the necessary 
ane sufficient conditions for vortex trapping. Not only should the vortex trapping be efficicul 
and effective for two-dimensional (or airfoil) situations but also in the three-dimenstonal or 
wing situations. Furthermore, the trapped-vortex configurations should be efficient, casy to 
produce and maintain and not too onerous to implement on actual aircraft. With these 
gvicelines for the research program, it is concluded from the investigation presented here 
tial vortex trapping in two-dimensions is reaching a point of good understanding. More 
detailed studies not only with conformal mapping methods but also with other methods nerd 
lo be carried out to fill out the characteristics of trapped-vortex airfoils. As noted in the 
items listed] below, the most pertinent contributions of the present stucly to date include the 
introduction Of a second fence ta help control the characteristics of the trapped-vortex flow 
field. fa particular, the use of fence curvature and height to bring about the equilibrium or zero 
veloctly condition at the center of the vortex with negligible mass removal from the vortex core 
makes the trapped-vortex high-lift concept an efficient one. In this way the two-fence concept 
provides the uecessary tools in two-dimensions at least for producing efficient easily forntable 
high {18 sirfois. Phe other conchisions listed helow are essentially self explanatory. [t shoule 
he remarked, however, that the steps from two- to three-dimensions will require sone good 
ideas if the trapped-vortex flow fields are to be realized on real wings wherein only the local 
flaw fiekls are used as the suction needed for evacuating the vortex core. The special suction 
orifices used i two dimensions will not then be needed, Encouragement is provided however, 
by the saecess achieved with the two-dimensional resulta and i is believed that comparable 
success chi be achieved with three-dimensional configurations. 


CONCLUSIONS 


1. TWO DIMENSIONAL RESULTS INDICATE THAT TRAPPED 
VORTICES CAN PROVIDE LARGE AMOUNTS OF LIFT 
ENHANCEMENT. 


2. AN UPSTREAM AND A DOWNSTREAM FENCE APPEAR 
TO BE NECESSARY PARTS OF THE TWO-DIMENSIONAL 
TRAPPING PROCESS. 


3. FENCE HEIGHTS MUST BE ADJUSTED SO THAT SOURCE 
STRENGTH IS ZERO IN ORDER TO PROMOTE VORTEX 
FORMATION AND TO REDUCE DRAG. 


4. ADDITIONAL DESIGN GUIDELINES WILL NO DOUBT BE 
NEEDED FOR VORTEX TRAPPING ON WINGS IN THE 
FULL THREE-DIMENSIONAL ENVIRONMENT. 


Session XIII. Supersonic Laminar Flow Control 


1807 


THIS PAGE INTENTIONALLY BLANK 


1808 





M. F. Soper 


Toroa / royal albatross 


Native birds 


A toroa in flight is an unforgettable spectacle. The toroa or royal albatross is a graceful giant with a wing span of 
over three metres. Renowned ocean wanderers, they travel vast distances from their breeding grounds to feed 
— as much as 190,000 kilometres a year. They breed mainly on remote islands and spend at least 85 percent of 
their lives at sea, landing on water to feed or sleep. Along with the wandering albatross, royal albatross are the 
largest seabirds in the world. 

There are two species of royal albatross, southern and northern. The southern is slightly larger than the northern. 
At sea it can be distinguished from the northern by its white upper-wings with black edges and tips, whereas 

the upper-wings of the northern are completely black. Both species have a black cutting edge to their upper 
mandible, which sets them apart from adults of the closely related wandering albatross. Juvenile royal albatross 
have black flecks on their upper-parts. 

Toroa have great spiritual significance to many iwi. For example Moriori of the Chatham Islands wear plumes 
(raukura) of hopo (the local name for toroa) to signify their allegiance to the pacifist principles of the chief Nunuku 
Whenua. Taranaki iwi likewise wear toroa feathers to signify loyalty to the Parihaka prophet Te Whiti O Rongomai, 
a pioneer of non-violent civil disobedience. 


Auckland and Campbell islands, which are among the 
world’s most important seabird sanctuaries. 

Toroa breed only in New Zealand waters. The northern 

species nest on the Chatham Islands and at Taiaroa 

Head on the Otago Peninsula, one of only two places 


in the world where albatross breed at a mainland site. 
The southern species breeds on the subantarctic 


Royal albatross range throughout the Southern Ocean 
and are most commonly seen in New Zealand coastal 
waters during winter. 
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Did you know? 


Birds have been 
banded on Campbell 
Island since the 
early 1940s. Bands 
returned from dead 
and live birds caught 
at sea or found on 
beaches show a 
migration of young 
birds and non- 
breeders to southern 
South American 
waters and back 

to New Zealand. In 
other work, satellite 
transmitter packages 
developed by French, 
British and Australian 
researchers were 
used to track the 
birds’ flight paths. 
Signals were 
monitored by satellite 
and the information 
used to learn more 
about the birds 

while at sea. One 
bird travelled 13000 
kilometres during a 
two-week foraging 


trip. 











e While royal albatross may be graceful in flight, their 
large size makes them appear clumsy on land as 
they doggedly walk from their nests to exposed 
sites to take off into the wind. 


¢ Royal albatross usually mate for life, despite long 
separations at sea. Established pairs return to the 
same nesting area each time they breed. 


¢ Toroa spend most of their lives at sea, returning 
to land only to breed and raise their young. They 
start breeding at around 6-10 years old, each pair 
raising one chick every two years. 


e Joroa are some of the longest-lived birds in the 
world, regularly living into their 40s. One bird at 
Taiaroa known as ‘Grandma’, raised her last chick 
at the age of 62! 


Royal albatross males arrive at the nesting site first 
to prepare the nest. Both birds share incubation 
duty in spells of two days to three weeks (eight 
days on average) over a period of about 11 weeks 
— one of the longest incubation periods of any bird. 
After making a hole in the tough shell, chicks take 
about three to six days to finally emerge from the 


egg. 


e When the chick has hatched, the parents take 
turns at guarding and feeding it for the first five or 
six weeks. Chicks are then left unguarded, except 
for feeding visits, until they fledge at about eight 
months. After a successful fledging, the parents 
will leave the colony and spend the following year 
at sea before returning to breed again, completing 
a two-year cycle. If the pair fails to breed they 
may return in consecutive years until they are 
successful. 


e When a chick is fully fledged, it will stretch its wings 
and with the help of a large gust of wind, take off 
for its first major voyage. Young albatross spend 
several years away from New Zealand feeding in 
South American waters. 


e Adolescents return to look for a mate at 3-8 years 
of age. They gather on ridgelines where the male 
birds display their virility by stretching their wings, 
throwing their heads up and screaming raucously, 
while admiring females gather round and take part 
in elaborate courtship rituals. Sound familiar? 


e Although they eat some fish and other marine 
creatures, royal albatross feed mainly on squid 
plucked from the sea. 


Their slow reproduction rates, changes in habitat 

and climate and some fishing practices make toroa 
vulnerable. They have declined to the extent that they 
are listed as endangered by the International Union 
for the Conservation of Nature (IUCN) and ‘nationally 
vulnerable’ by the Department of Conservation (DOC). 


A storm on the Chatham Islands in 1985 destroyed the 
albatrosses’ nesting habitat, reducing the percentage 
of nests producing fledglings to as low as 3% in 

some years. There are encouraging signs that their 
population may be recovering from this event. 


Chatham Islanders have traditionally harvested royal 
albatross. However, since 1921 the birds have been 
fully protected. DOC is working towards a cooperative 
relationship with the traditional owners of the breeding 
islands that combines research with conservation. 


Long-line fishing, drift-netting and trawling are a 
threat to a lot of seabird species. Many albatrosses 
discover that fishing vessels offer an easy food 
source and will follow boats to feed on fish bait and 
discards. They may take the food without coming to 
any harm, but some get caught in fishing gear. While 
most fishing boats catch very small numbers of royal 
albatross, scientists are concerned that because 
there are so many fishing boats around the world, 
the total numbers caught may be having an impact 
on some albatross populations. The New Zealand 
fishing industry is strongly supporting DOC initiatives 
such as Southern Seabird Solutions, which promote 
better fishing practises that do not catch seabirds. A 
combination of regulation and innovative techniques 
such as bird-scaring lines, weighted lines, underwater 
bait-setting devices and retention of offal can reduce 
the by-catch of albatrosses. 


A convention prohibiting fishing with long drift nets in 

the South Pacific was signed in New Zealand in 1989 

and entered into force in 1991. This paved the way for 
a United Nations resolution in 1991 calling for a global 
moratorium on long drift nets on the high seas. 


Thousands of seabirds die in the northern hemisphere 
each year from swallowing small pieces of plastic. 
Although it is thought to be less of a problem in New 
Zealand, regurgitated plastics are often found beside 
royal albatross nests on Campbell Island. 





(oh! Mer-TaMtfolemat=)| ome) gel: We 


Certain fishing practices, such as longline fishing, 
trawling and drift netting, are a major threat to many 
seabird species. You can help by writing a letter to 
authorities around the world encouraging them to 
change unfriendly fishing practices in order to help 
these birds. 
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For further information about the royal albatross, visit 
your local Department of Conservation office or the 
DOC website at www.doc.govt.nz. 


A. Maloney 
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Summary 





¢ A brief introduction to modeFRONTIER 


¢ Description of MOdeFRONTIER direct interface for STAR-CCM+ 


¢ Application problem definition 


¢ Optimization results 
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is an integration platform for multi-objective 
optimization, automation of design processes 
and analytic decision making providing 


seamless coupling with engineering tools 
within various disciplines 
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ESTECO started in 1999 as a University spin-off. 


modeFRONTIER was the first commercial tool that allowed a MULTI-OBJECTIVE 
optimization applied to ANY engineering design area 


Now modeFRONTIER is used worldwide 





modeFRONTIER modeFRONTIER modeFRONTIER modeFRONTIER modeFRONTIER 
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Traditional Design 


Modity 
Configuration 
















Evaluate 
Results 


Yes ) 












Parametric 
models 
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Optimization Approach 


Design Objectives 


and Constraints | 





mode FRONTIER 


¥ 





Optimal trade-off 
Solution 
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The Concept behind modeFRONTIER 


Scheduler: 
(DOE, optimization algorithms...) 


= Wj wode FRONTIER < 
a" 


The Black Box: 


Input Variables: (ADAMS, ANSYS, GT-Suite, etc.) Output Variables: 
Entities defining the Measures from the 
design space. system 





modeFRONTIER can be coupled with most software (CAD, CAE or general application tools) and it 
enables the simultaneous use of a number of such software packages even on different machines 
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Response Surface Tool Statistical Analysis Multivariate Analysis 
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Direct interface with STAR-CCM+: how it works 





¢ Input parameters (simulation or geometry modeled within) are automatically introspected 
¢ Available output results are automatically introspected and can be selected 
¢ Optimization variables nodes are automatically created in the workflow 


¢ Optimization can be run changing the inputs and optimizing the selected outputs 
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¢ Optimization setup with external CAD and Optimate (STAR-CCM+) 


Inputs 
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The boomerang return is due to its interaction with the air that makes it work as a gyroscope. 
There are two kind of precessions: 

¢ @), responsible for the boomerang return 

¢ ©, responsible for the boomerang plane of rotation change 


To simulate accurately its trajectory, it is necessary to write its equations of motions, in which aerodynamics 
coefficients must be provided updated at each time step (since angle of attack and velocity changes) 
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F, , Fy , k, external forces components 





(ci, =2 T,, , Ty ,T, external torques components 
3 
F = (Fc an) V boomerang center of mass velocity 
a r Y boomerang angle of attack 
[Y= —_ (& sin? — EF, cos W) + — 
mV I3@, 
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Purpose of this study is to find a boomerang geometry and a set of launching parameters 
in order to: 


¢ 1. Minimize energy required for the launch obtaining a minimum launch range (>14m) 


¢ 2. Maximize the accuracy of return 
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4 CAD parameterization \ 
A candidate boomerang geometry is “ 
proposed 


STAR-CCM+ analysis 


Boomerang aerodynamic coefficients are found 
for 12 different angles Y and speed U 





/ RSM analysis 
| The 12 samples are used by mF to extrapolate 
| aerodynamic coefficients for any (2 U) pair 
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CAD CFD RSM Matlab - tuning 


Input_Variables 


mF_batch_prj scripts to _create_rsm 


oO 


DOE create _Meteb_RSM launch_parameters_tuning Exit 
ve we EXIT 
oo - =a oh 


transf_rsm_c 


>So 





[2 Cm_x Cm_y Ccmz I3 volume 
Vv Vv Vv v v V 

Lay Lay LA, Lay Lay Lay Range Energy Accuracy 
fs) a) O fs) a) fe) y 


* 


oO 


constr_range min_Energy 
V 


7 
ae # 
~ “a 


The main objective is to find a boomerang geometry which minimizes the Energy required for its 
thrown, satisfying at the same time a constraint on the range 
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The boomerang shape is modified by a CAD parametric model CAD 
9 geometry parameters have been considered, including: 


¢ Blade profiles (9 Bezier control points) 
¢ Dihedral angle 
e Angle between arms 
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The main workflow launches for each candidate geometry a new mF workflow that executes a DOE 
of (12) STAR-CCM+ analysis changing the value of angle Y and speed U 





CFD simulation via STAR-CCM+: Mesh 





¢ Two domains are defined: a sohere around the boomerang which rotates with it at each time step of 
its spin (Y, U are fixed), and a fixed domain in the rest of domain 


¢ The mesh (2.5 millions of cells) is polyhedral within the sphere around the boomerang, with prisms 
layers at the boomerang walls, and hexahedral in the rest of the domain 


¢ The STAR-CCM+ General Grid interface is used to merge the two domains 
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CFD simulation via STAR-CCM+: CFD analysis 


¢ The two-equations RANS SST (Shear Stress Transport) turbulent model, with wall functions, is chosen 
and a segregated solver with constant density is employed 


¢ A full not-stationary analysis is run over a proper interval of time until the flow becomes periodic (after 
about 5-6 spin periods) 
WU fixed 
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RSM 
The set of (12) STAR-CCM+ analysis (yellow points) is used to train a Response 


Surface (Radial Basis Function) available in modeFRONTIER, to extrapolate the 
response for any value of angle Y and speed U 
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modeFRONTIER inner workflow (Follower Optimization 


Launching parameters: 


Matlab - tuning 
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The internal objective for each candidate geometry is to find the launching parameters which 
minimize the arrival distance (returning accuracy) 














modeFRONTIER Optimization Results 


¢Simplex algorithm (39 designs only) is used to find the optimal solutions 
¢ One solution is selected as optimal compromise 


result 


Accuracy 





Energy 
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Optimal geometry 


Optimal launching parameters 


¢ The initial spin is about 4Hz 
¢ The initial velocity is 15m/s 
¢ The tilt angle is about 0° 

¢ The aim is about 20° 
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The boomerang shape optimization here proposed shows how efficiently and powerfully a 
complex and multi-disciplinary optimization problem can be set up in modeFRONTIER 


In particular, the new direct interface with STAR-CCM+ allows to define the automatic 
integration and execution of any STAR model in the optimization workflow 


Any problem of industrial relevance can be optimized with modeFRONTIER, as confirmed by 
many of our customers including many leading companies working with STAR-CCM+ (please 
check www.esteco.com for more details) 





Thank you! 


ESTECO 
Area Science Park 
Padriciano, 99 


34149 Trieste - Italy 


e-mail: engineering@esteco.com 


ESTECO North America 
3955 Orchard Hill Place , Suite 430 
Novi, MI 48375 
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Order PROCELLARITFORMES 


A rather distinct group of some 80-100 species of pelagic seabirds, ranging in size from huge to tiny and in habits 
from aerial (feeding in flight) to aquatic (pursuit-diving for food), but otherwise with similar biology. About 
three-quarters of the species occur or have been recorded in our region. They are found throughout the oceans 
and most come ashore voluntarily only to breed. They are distinguished by their hooked bills, covered in horny 
plates with raised tubular nostrils (hence the name Tubinares). Their olfactory systems are unusually well 
developed (Bang 1966) and they have a distinctly musky odour, which suggest that they may locate one another 
and their breeding places by smell; they are attracted to biogenic oils at sea, also no doubt by smell. Probably they 
are most closely related to penguins and more remotely to other shorebirds and waterbirds such as Charadrii- 
formes and Pelecaniiformes. Their diversity and abundance in the s. hemisphere suggest that the group originated 
there, though some important groups occurred in the northern hemisphere by middle Tertiary (Brodkorb 1963; 
Olson 1975). 

Structurally, the wings may be long in aerial species and shorter in divers of the genera Puffinus and Pel- 
ecanoides, with 11 primaries, the outermost minute, and 10-40 secondaries in the Oceanitinae and great 
albatrosses respectively. The tail varies in length, being forked in Oceanodroma, forked to pointed in other forms, 
usually with 12 rectrices but up to 16 in fulmars. The tarsi are light and cylindrical in aerial forms; strong and 
laterally compressed with legs set far back in aquatic ones. The front toes are webbed; hind toe small or absent. 
The proventriculus is long and glandular; the gizzard small and twisted; and the small intestine often spiral in 
Pterodroma, presumably to aid absorption of the unusual lipids in their food. Chicks are helpless and covered in 
down, with two coats except in some Oceanitinae. Some larger species have a darker immature plumage, and the 
female is often darker than the male in the great albatrosses. The male is usually larger than the female, though 
smaller in the Oceanitinae and some other small species. Otherwise there is little difference in appearance with 
sex or age, except that young birds may have more pronounced pale or dark edges to the feathers. Many have 
simple counter-shaded markings that often appear to have given rise to uniformly dark or, less often, to pale 
derivatives; some species in most groups are dimorphic or polymorphic. The more complex groups have often 
developed distinctive markings of the extremities. 

Breed more or less colonially on offshore islands, coastal cliffs, or on hills and deserts inland, where they 
perform complex vocal and aerial displays. The nest is a simple scrape or cup in a burrow or natural hole, 
sometimes under vegetation. The s. albatrosses build large cone-shaped nests in the open; may be lined with any 
debris available in the area. Smaller species visit it only at night, though larger ones and those breeding on remote 
islands may come to nests in the open by day. Parents incubate for spells of several days in turn and generally leave 
the chick alone soon after it hatches, only returning at long intervals to feed it by regurgitation. In consequence 
the chick is vulnerable to introduced predators and some species are now greatly reduced and at least two are now 
extinct. Some species also periodically liable to have unsuccessful breeding seasons. Many young or even old birds 
may be wrecked ashore and die when they meet bad weather or suffer shortage of food on migration or in the 
winter. Though it has been claimed that they are also vulnerable to all sorts of pollution, the evidence is weak 
(Bourne 1976). There is at present anxiety about the effect of some fishing methods, such as long-lining, which 
may be endangering species such as the great albatrosses. 

All species feed at sea on a variety of fish, cephalopods and small marine invertebrates, either socially or 
alone; larger species may scavenge all sorts of offal or prey on other birds. Most, except perhaps Pelecanoides, can 
digest the complex lipids formed by some marine animals (Clarke & Prince 1976), and may eject them to soil the 
plumage of their enemies with lethal results (Swennen 1974). Some species can digest wax (Obst 1986). Many now 
take wastes from whaling and fishing operations (Fisher 1952). All have long life-cycles in proportion to their size; 
they disperse on fledging and then prospect for nest-sites for 2-12 years in their youth. They usually lay a single 
large white egg annually; though a successful breeding cycle may be completed in less than a year in at least one 
tropical species, Puffinus lherminieri, it may take 2 years in larger southern ones. Before laying, the birds court for 
weeks or months, then go to sea for feeding. Incubation lasts 6-8 weeks, and fledging 2-9 months. Once the fat 
chick fledges it fends for itself, even in species that immediately make a long migration, sometimes to the opposite 
hemisphere. 

Tendency for failed breeders and non-breeders to begin moult before successful breeders. Five strategies of 
wing-moult in breeding adults: (1) In albatrosses, remiges replaced in staffelmauser interrupted while breeding; in 
nearly all other species, primaries moulted outwards; possibly simultaneously in some diving-petrels. (2) In most 
subantarctic and temperate species, moult begins soon after breeding and is completed shortly before next 
breeding season. (3) In most tropical species, moult aseasonal, between breeding attempts; resumption of breeding 
apparently depends on when moult completed. (4) In trans-equatorial migrants, wing-moult delayed until they 
reach non-breeding quarters, where it is completed; moult rapid but no satisfactory evidence for flightlessness. In 
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some species, body-moult also in winter quarters; in others, at breeding grounds. (5) In some species of high 
latitudes, rapid moult completed in summer when they breed: some begin moult long before breeding fin- 
ished. 

The history of the classification of the Order is very confused, as is seen by comparing Timmermann’s (1965) 
discussion of their Mallophagan parasites with that by Klemm (1969) of their leg muscles and that by Harper 
(1978) of their proteins, but it is now widely agreed that the Order is best divided into four families: Diomedeidae 
or large to huge aerial albatrosses; Procellariidae or medium-sized, mainly aerial but sometimes aquatic, petrels, 
shearwaters and prions; Hydrobatidae or small to tiny, aerial storm-petrels; and Pelecanoididae or small aquatic 
diving-petrels. 
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Family DIOMEDEIDAE albatrosses 


The albatrosses are a small group of some 13-16 large petrels with long wings adapted for gliding flight and with 
long powerful beaks adapted for seizing prey at the surface; nine species breed in our region and one other has been 
recorded as a vagrant. Because they are so large, they must breed in the open, where they walk well for petrels. Most 
s. species build substantial conical nests but n. ones, breeding in warm climates, make only scrapes. Young birds in 
some species have a drab plumage but adults of many species develop bolder markings with brightly coloured 
stripes on the bill, used in social displays when breeding. Three distinct eroups occur in the Southern Ocean but 
the distinction between two is blurred by intermediate forms that occur in North Pacific: 

(1) The great albatrosses are huge, long-winged, long- and pale-billed, short-tailed birds that glide round the 
world in Southern Ocean. Until recently, there were thought to be two species: the more pelagic Wandering 
Albatross D. exulans breeding on most of the subantarctic islands, which is dark with a white underwing when 
young, becoming more or less white with dark wing-tips when adult; and the more coastal Royal Albatross D. 
epomophora, breeding round NZ, which resembles the extreme white adult Wanderer throughout its life but has a 
dark cutting-edge to the upper mandible. A few birds breeding in extreme immature exulans-type of plumage on Ile 
Amsterdam in the Indian Ocean have recently been described as a third species D. amsterdamensis but there is 
continuing debate whether this is justified owing to the occurrence of similar populations in South Atlantic and 
round NZ (Bourne 1989). 

(2) The medium-sized albatrosses Diomedea (Thalassarche), often called mollymawks, are a compact group of 
white-bodied, dark-backed species with brightly marked bills in adults, all five species being found in our region. 
They consist of two comparatively coastal species, the Black-browed Albatross melanophrys with main breeding 
colonies round South America, and the Shy cauta with 3-4 rather well-defined subspecies, sometimes treated as 
separate species, breeding in A’asia. There are also three pelagic species: Grey-headed chrysostoma to the south, 
Yellow-nosed chlororhynchus in subtropical South Atlantic and Indian Oceans, and Buller’s bulleri in equivalent 
parts of South Pacific. 

The differences between Groups (1) and (2) are rather marked and they would doubtless be treated as distinct 
genera if it were not that four other albatrosses with intermediate characters breed in North Pacific: Black-footed 
nigripes with plumage resembling that of sooty albatrosses, though shape differs; Laysan immutabilis with plumage 
like that of the medium-sized albatrosses (Group 2); Short-tailed albatrus with a sequence of plumages rather like 
those of Wanderer, though smaller; and the Waved irrorata with dark plumage except for pale head, neck and 
underwing. Because it is hard to make any clear distinction between these birds they are normally all included in an 
unusually wide genus Diomedea. 

(3) The sooty albatrosses Phoebetria. Two extremely aerial, highly pelagic and rather aggressive or predatory 
species with fairly small bills with a groove along the lower mandible; long wings; long pointed tails; and dark 
plumage; nest on steep places and have vocal aerial displays. 
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General characters are: body, short and broad; head, large; neck, short. Wing, long and narrow, folded in 
three almost equal parts, 11 primaries, p10 longest, p11 minute; up to about 40 secondaries, diastataxic. Tail, short 
and square in Diomedea, longer and wedge-shaped in Phoebetria, 12 feathers. Bill, heavy and composed of several 
horny plates; hooked; nostrils in tubes on either side. Legs, strong; three front toes joined by web; hind toe absent 
or vestigial. Oil gland, feathered. Sexes similar; male larger on average. Plumage mainly white except in Phoebetria, 
in which it is dark grey. Juveniles and immatures generally separable but mostly not very different from adults 
except in D. exulans and D. albatrus; fully adult plumage attained only after several years. Stance upright and able to 
walk much better than most other Procellariiformes. Swim and rest on sea buoyantly with head held high. Feed 
mostly on fish and squid by surface-seizing or shallow diving, but sooty albatrosses also take birds. Follow ships for 
scavenging. 








Fig. 1 Dorsal view of base of bill of small Diomedea 


Fig. la Black-browed Albatross D, melanoprys Fig. 1d Yellow-nosed Albatross, D. chlorohynchos, subspecies bassi 
Fig. lb Grey-headed Albatross D. chrysostoma Fig. le Shy Albatross D. cauta 

Fig. lc Yellow-nosed Albatross, D. chlorohynchos, subspecies Fig. lf Buller’s Albatross D. bulleri 

chlorohynchos 


Long-lasting monogamous pair-bond. Breed colonially, pairs often returning to same site. Defend small nest- 
territories. Perform spectacular agonistic and sexual displays at nest in Diomedea; vocal aerial displays in Phoebetria. 
Eggs, white, minutely spotted reddish. Clutch-size; one; no replacement laying. Incubation by both sexes in long 
alternate spells. Incubation period, 2 or more months. Nestling, semi-altricial, nidicolous; hatched in down. 
Brooded for a short time after hatching; then left alone in nest, parents returning only to feed chick by incomplete 
regurgitation. Nestling period long, up to 12 months, and so in some species successful adults cannot breed 
annually. Young independent on fledging. Maturity reached only after several years. Some populations were 
reduced in the past, notably by egg-collecting, but there appear to be few threats now except that some great 
albatrosses are caught by long-line fishing. 


REFERENCES 
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Diomedea exulans Linnaeus, 1758, Syst. Nat., ed. 10, 1: 132; based chiefly on ‘The Albatross’ of Edwards, 
1747 — Cape of Good Hope ex Edwards. 


The genus is named after Diomedes, King of Argos, next to Achilles the bravest hero in the Greek army at 
Troy. When he died, his companions were inconsolable and were metamorphosed into birds (aves 
Diomedeae). The specific name refers to the wandering habits of the species (exsulans, living as an exile). 


OTHER ENGLISH NAMES Cape Sheep; Man-of-War Bird; Great Wandering, Snowy or White-winged Albatross. 
POLYTYPIC Nominate exulans Linnaeus, 1758, Tristan da Cunha & Gough IL. chionoptera Salvin, 1896, S. 


Georgia, islands in s. Indian Ocean and Macquarie I.; perhaps two other subspecies on Antipodes Is and 
Auckland and Campbell Is (Robertson & Bell 1984). 
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FIELDIDENTIFICATION Length 110-135 cm; wing- 
span 250-350 cm; weight 6-11 ke. Huge full-bodied albatross 
with extremely long wings and short, gently wedge-shaped 
tail; bill, huge and pink with bulbous tip. Similar only to other 
great albatrosses — Royal D. epomophora and Amsterdam D. 
amsterdamensis. Much larger than the black-backed alba- 
trosses (mollymawks). Juveniles have mostly dark plumage 
which gradually whitens over along period. Extent of white of 
breeding adults varies between individuals, sexes and geo- 
graphically (n. populations tend to be darker). Harrison (1979, 
1985, 1987; on which account based) proposed stages of plu- 
mage for field identification; from 1 (darkest and youngest) to 
7 (oldest and whitest). Sexes similar; males average larger and 
whiter than females. No seasonal variation. 
DESCRIPTION STAGE 7. Head, neck and body, 
white; often with pink to orange-pink stain on ear-coverts; 
some with trace of light-grey vermiculations on hindneck and 
saddle. Tail, white; black outer tail-feathers retained on some. 
Upperwing, mostly white with black remiges and scattered 
white patches on mostly black primary coverts; some darker 
patches and mottling on outer-half of inner wing-coverts, cre- 
ating chequered transition between white inner forewing and 
dark trailing-edge and outerwing. Underwing, white with 
black remiges forming thin dark trailing-edge and large dark 
tip. STAGE 1 JUVENILE). Head, neck, body and tail, wholly 
chocolate-brown except for conspicuous white mask. 
Underwing, as adult except for thin black margin between 
carpal joint and base of outermost primary and chocolate- 
brown pre-axillary notch, conspicuous at base of inner leading 
edge of wing. STAGE 2. Like Stage 1 but with white mottling 
appearing on hindneck, saddle, rump, upper tail-coverts and 
centre of inner upperwing behind elbow. Belly and flanks, 
mostly white with brown breast-band and under tail-coverts. 
STAGE 3. Head and neck, mostly white except for brownish 
crown and mottling on sides of neck. Body, mostly white with 
brown and grey vermiculations on saddle and flanks and in- 
distinct breast-band. Tail, white with black sides and tip. 
Upperwing, mainly blackish brown with small whitish patch 
on centre of innerwing behind elbow. Underwing, as Stage 1 
and 2; pre-axillary notch still present but may be paler. STAGE 
4. Like Stage 3 but head and body whiter and breast-band 
paler, greyer and less distinct; tail, whiter with dark sides and 
tips of outermost feathers usually retained. Underwing, like 
Stage 3 but lacks pre-axillary notch. STAGE 5. Like Stage 4 
except head and body mostly white; tail mostly white, usually 
with dark outer feathers. On upperwing, white central patch 
behind elbow linked to white of back, forming white wedge 
extending from body into innerwing, leaving broad black 
leading- and trailing-edges. Underwing, as Stage 4 but black 
leading-edge between carpal joint and base of outermost pri- 
mary, thinner and not continuous. STAGE 6. Like Stage 5 
except dark leading-edge of upperwing heavily blotched with 
white or mainly white; white central wedge extends forwards 
and outwards, merging with white of leading-edge. Unlike in 
Stage 7, usually retain some dark inward of carpal joint, thus 
less white on innerwing; transition between white of forewing 
and dark of trailing-edge and outerwing chequered, as on 
Stage 7. Tail usually retains some dusky markings though 
some birds have wholly white tail. Bare parts of all birds. Bill, 
large, with bulbous maxillary unguis; pinkish horn with 
yellowish-horn ungues forming contrasting paler tip; flushes 
to brighter pink, especially in breeding birds. Nostrils, small, 
on sides of bill, opening slightly upwards. Iris, brown. Legs 
and feet, flesh-pink with bluish-coloured webs and horn-col- 


oured claws. Feet project well beyond tip of tail in flight; 
sometimes tucked forward into belly feathering. 

SIMILAR SPECIES Most likely to be confused with 
Royal Albatross. At close quarters, Royal recognized by dark 
cutting-edge of upper mandible. Head and body always clean 
white except for a little brown mottling on cap of some imma- 
tures (sanfordi only) and black speckling on lower back and 
rump in juveniles; never has brownish body plumage and ver- 
miculations of Wandering Albatross Stages 1-4 nor greyish 
vermiculations of Wandering Stages 5-6 (sometimes 7) that 
give dirty appearance to body. In Royal Stages 2-5, tail en- 
tirely white; in Stage 1, tail mostly white with narrow black 
terminal band, never with dark sides as on nearly all Wan- 
dering. Main confusion lies between Wandering Stages 4-7 
and Royal Stages 1-5 of southern subspecies, which differ in 
progressive whitening of upperwing: in Royal, wing whitens 
from leading-edge of innerwing backwards (in Wandering, 
white patch in centre of innerwing links with white of saddle 
to form white wedge between dark leading- and trailing-edges; 
white wedge then expands forward and outwards, merging 
with white of leading-edge until inner forewing mostly white); 
in Royal, white of inner forewing merges evenly into dark 
trailing-edge by way of white dusting on marginal coverts and 
delicate white fringing to lesser and median secondary coverts 
(in Wandering, transition between white of forewing and dark 
of trailing-edge and outerwing, heavily blotched giving more 
chequered appearance). Separation of Royal and Wandering 
in whitest stages (5 and 7 respectively) difficult because up- 
perwing pattern very similar; apart from diagnostic black 
cutting-edge of bill in Royal, black in tail indicates Wander- 
ing. Subtle distinctions in jizz often helpful: Royal has longer 
bill and more sloping, less angular forehead profile giving 
more rounded, slender-headed appearance (Wandering tends 
to look flatter-crowned with shorter, often pinker bill). Very 
rare Amsterdam Albatross, similar to Wandering Stages 1-3 
but has diagnostic dark-brown cutting-edge to upper man- 
dible, which appears black at distance, and greenish-brown 
ungues, forming dusky tip contrasting with rest of bill. All 
other albatrosses of s. oceans have black backs; are much 
smaller with proportionately smaller bill and shorter wings; 
feet do not project beyond tail; black leading-margin on inner 
underwing. Much greater size, mostly white underwing, white 
mask and pink bill separate brown-bodied Wandering Alba- 
trosses from giant-petrels Macronectes spp and sooty alba- 
trosses Phoebetria spp. 

Circumpolar in s. oceans; adult females, juveniles and 
immatures occur farther N than adult males which tend to 
range in higher latitudes. Occupy wide range of marine habi- 
tats; occur in pelagic waters but also concentrate in shelf- 
break and continental shelf waters. Commonly seen from 
land. Graceful in flight: long sweeping glides and soaring on 
stiff outstretched wings, rarely flapping except in calm con- 
ditions, when flight laboured with deep pliable wing-beats. 
On calm days, spend much time sitting on sea. Take-off and 
flight in light winds, laboured; long run-off before becoming 
airborne, heavy flapping to stay aloft. Feed by surface-seizing, 
occasionally surface-plunging, pursuit-plunging and surface- 
diving. Habitually follow ships; attend fishing vessels, squab- 
bling over offal. Solitary or gregarious at sea. Give harsh 
croaking at sea and croaking, groaning and whining calls at 
colonies. 


HABITAT Marine, pelagic and aerial. In Antarctic, oc- 
cur in open water, rarely entering belt of icebergs (Falla 1937: 


Hicks 1973), but in late summer birds may approach edge of 
pack-ice (Darby 1970); in winter, uncommon § of Antarctic 
Convergence indicating northward shift in distribution (Szijj 
1967). At Iles Crozet, males feed over Antarctic waters; fe- 
males and immatures in Subtropics (Weimerskirch & Jouven- 
tin 1987). Observed over waters with surface-temperatures 
from -2 to 24°C (Bierman & Voous 1950; Grindley 1981); in 
s. Indian Ocean prefer waters of 6.3-7.7 °C (Rand 1963); off 
South America, 8-12 °C Jehl 1973); in South Atlantic Ocean, 
two zones of increased abundance at 0 °C and 9-20 °C (Bier- 
man & Voous 1950). Versatile feeders in shelf or pelagic 
waters; some birds concentrate over shelves off breeding 
islands or continental land masses, in productive waters at 
upwellings and boundaries of currents; others remain in pe- 
lagic waters all year (Weimerskirch et al. 1985). In Antarctic, 
concentrate near submarine plateaux, banks and ridges (John- 
stone & Kerry 1976). In A’asian region, occur inshore, off- 
shore and in pelagic waters (Norris 1967; Barton 1979, 1980; 
Blaber 1986), regularly feeding in sheltered harbours and 
straits (Secker 1969). Gather at outfalls of unmodified sewage 
(Milledge 1977}. 

Breed on subantarctic and Antarctic islands in Indian 
Ocean, Atlantic Ocean and seas S$ of NZ. Nest on coastal or 
inland ridges, slopes, plateaux and plains, often on marshy 


ground (Falla 1937; Bailey & Sorensen 1962; Rand 1954, 
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1963; Warham & Bell 1979; Weimerskirch et al. 1986). Prefer 
open or patchy vegetation (tussock, fern or shrubs) for easy 
access, near exposed ridges or hillocks for take-off (Warham 
& Bell 1979). 

Fly within 15 m of sea surface, using updraft from wave 
fronts for lift; circle over breeding islands to heights of at least 
1500 m (Murphy). Feed from surface or just below, or make 
shallow dives from heights of 2-5 m (Voisin 1981; Harper 
1987). 

Decline in breeding populations at Iles Crozet and 
Macquarie I. attributed to accidental and deliberate killing at 
commercial fishing grounds (Tomkins 1985b; Weimerskirch 
& Jouventin 1987). At Iles Crozet, females and immatures 
suffer higher mortality because they feed over offshore sub- 
tropical waters where fishing grounds concentrated (Wei- 
merskirch & Jouventin 1987). 


DISTRIBUTION AND POPULATION Circumpo- 
lar; in Antarctic, subantarctic and subtropical waters of Atlan- 
tic, Pacific and Indian Oceans. 

Pelagic range in summer, mid-ocean in Indian, South 
Pacific and Atlantic Oceans; occasionally to 67°S in Weddell 
Sea and § of 68°S in South Pacific (Gain 1914; Falla 1937; Siple 
& Lindsey 1937; Fleming 1950; van Oordt & Kruijt 1954; 
Holgersen 1957; Dell 1960; Gill 1967; Ozawa 1967; Darby 
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1970; Tickell & Woods 1972; Johnstone & Kerry 1976; Zink 
1981; Jouventin et al. 1982; Thurston 1982; Ainley et al. 1984; 
McQuaid & Ricketts 1984; Meeth & Meeth 1986; Stahl 
1987). Small numbers occur off South Africa, N to 26°S off 
sw. coast (Lambert 1971); in s. Aust. and NZ waters (Gibson 
1967); w. South America, N to 33°S (Murphy); e. South Amer- 
ica to 38-43°S (Bennett 1948; Ozawa 1967: Tickell & Woods 
1972) and occasionally to 24°S off Brazil (Dixon 1933). 

In winter and spring; pelagic range farther N: rare S of 
Polar Front (Falla 1937; Harrison 1962; Ozawa 1967; Szijj 
1967; Summerhayes 1969; Jouventin et al. 1982; Tickell & 
Woods 1972; Thurston 1982; Bourne & Curtis 1985; Eakin et 
al. 1986; Clark 1987; J-C. Stahl). Common off sw. Africa, 
Mar.-Oct., N to 17°S (Summerhayes et al. 1974, Hansen 
1978). Maximum abundance: w. Tasman Sea, May; e. Tasman 
Sea, May-June; NSW, mid-June to mid-Sept.; NZ coastal 
waters, June-July; central Tasman Sea, Aug. (Gibson 1967). In 
Humboldt Current, normally N to 33°S (Murphy), though 
one recovery 21°S (Derenne et al. 1972). 

In s. Indian Ocean, juveniles present only in n. part of 
range; occuring to 35-45°S in summer (Jouventin et al. 19872, 
Stahl 1987; Weimerskirch & Jouventin 1987), 30-38°S in 
winter and spring (J-C. Stahl). In South Atlantic, n. records 
mostly subadults (Tickell & Woods 1972). N. records in sw. 
Pacific mostly dark birds, either subadults or dark adults from 
Antipodes Is (Summerhayes 1969; Jenkins 1980, 1986). Males 
range farther S (mostly 50-60°S) than females (35-45°S) in s. 
Indian Ocean in summer (Weimerskirch & Jouventin 
1987). 

Occasional: Fiji, May—Oct. (Jenkins 1986); Tonga, July- 
Nov. (Jenkins 1980). Accidental: St. Helena (Wilkins 1923), 
Réunion (Milon & Jouanin 1953), Mauritius (Murphy), 
Marquesas (Pratt et al. 1987), Tuamotu Arch. (Robertson 
1972a). Records in n. hemisphere: immature Panama, [?] Aug. 
1937 (Murphy 1938); female California, July 1967 (Paxton 
1968); immature sighted off Portugal, Oct. 1963 (Bourne 
1966); immature male killed off Palermo, Sicily, Oct. 1957 
(Orlando 1958). 
BREEDING 


Localities and populations given in 
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Table 1. Last bred Tristan da Cunha in 1907, almost certainly 
eliminated by man (Hagen 1952; Fraser et al. 1988). Last bred 
Falkland Is, 1959 (Woods 1975). Recent annual declines re- 
ported: Iles Crozet, 6.2%, 1969-76; Ile de la Possession, 2.5%, 
1977-86 (Weimerskirch & Jouventin 1987); S. Georgia, 1.1%, 
1961-86 (Croxall et al. 1988a); Tle aux Cochons, 1974-82, 
12.5% (Voisin 1984). At Iles Kerguelen, colonies under study 
decreased from 224-255 pairs 1971 to 125-129 pairs in 1985 
(Weimerskirch et al. 1989); at Macquarie I., 29 pairs 1968 to 
5-7 pairs 1979-82 but fluctuations from year to year (Roun- 
sevell & Brothers 1984). Declines probably because more and 
more immatures and adults caught in long-line fishing oper- 
ations; possibly also by trawling Jlouventin et al. 1984, Tom- 
kins 1985b; Croxall et al. 1988a,b). At Inaccessible I., popu- 
lation decreased from c. 200 pairs in early 1870s to two pairs 





Table 1, 
Locality Year Population Reference 
(breeding pairs/‘y) 
Inaccessible I. 1982-87 3 2 
Gough I. 1982 798+ 9 
Prince Edward Is 1984 2461+ 9 
Iles Crozet 1981-82 1962 4 
Iles Kerguelen 1984-8 1095 10 
Heard I. 1980 1 3 
Macquarie I. 1982 fi q 
5. Georgia << 1984 4300* I 
Antipodes I. 1969 L000 8 
Auckland Is (250 5 
1975-77 1 6 


Campbell I. 


*" Estimate of total population. 

References: (1) Croxall et al. 1984; (2) Fraser et al. 1988; (3) Johnstone 
1982; (4) Jouventin et al. 1984; (5) Robertson 1975: (6) Robertson 
1980; (7) Rounsevell & Brothers 1984; (8) Warham &. Bell 1979; (9) 
Watkins 1987; (10) Weimerskirch et al. 1989, 


1938, because pigs took eggs and chicks; population stable at 
very low level since (Fraser et al. 1988). 


MOVEMENTS Migratory or dispersive. 

DEPARTURE Chicks fledge: Iles Crozet, 25 Nov.- 
2 Feb. (Fressanges du Bost & Segonzac 1976); [les Kerguelen, 
21 Nov.-5 Jan. (Paulian 1953); $8. Georgia, 17 Nov.-18 Jan. 
(Tickell 1968. Croxall & Prince 1987); Marion I., mid-Dec- 
mid-Jan. (van Zinderen Bakker 1971); Campbell I., late Dec- 
mid-Feb. (Bailey & Sorensen 1962); Gough I., Jan-early Feb. 
(Swales 1965); Auckland Is, mid-Jan-mid-Feb (Bailey & Sor- 
ensen 1962). Antipodes Is, late Jan-Mar. (Warham & Bell 
1979). Non-breeders leave Macquarie I., early Jan.-early Apr., 
failed breeders leaving within a few days of failure (Tomkins 
1985a). 

NON-BREEDING Banding returns suggest adults 
and young disperse or migrate E (Tickell 1968; see Banding). 
Birds from Gough I. apparently confined to South Atlantic 
and South African waters; chicks recovered off Cape (one) 
and Tripp Seamount 30°S 14°E (one); one adult Port Elizabeth 
(Cooper 1988) and those breeding Antipodes, Auckland and 
Campbell Is apparently restricted to Tasman Sea and South 
Pacific Ocean (Robertson 1972a,b, 1973; Warham & Bell 
1979; see Banding). Movements of birds from s. breeding sites 
probably circumpolar, but definite proof still lacking. Some 
birds breeding S$. Georgia and Iles Crozet migrate regularly to 
NSW (Tickell 1968; Weimerskirch et al. 1985). Birds breed- 
ing Iles Crozet recovered NSW either during non-breeding 
year after successful cycle, or during same season after loss of 
egg; one bird recovered NSW in seven different years (Wei- 
merskirch et al. 1985). Frequency of beachcast birds in NZ 
highest Jan., coinciding with fledging of chicks from nearby 
subantarctic islands, and June, which coincides with arrival of 
young birds from breeding sites on opposite side of Antarctic 
and moult in adults, making them more vulnerable to bad 
weather (Powlesland 1985). Juveniles have separate but over- 
lapping range to adults; 177 birds seen in Apr. off se. Tas., 
almost entirely brown-plumaged birds (15% chocolate-brown, 
85% stages 4-6; Blaber 1986). In Indian Ocean, juveniles 
found only N of 45°S (Falla 1937; Rand 1963; Jouventin et al 
1982; Stahl 1987; Weimerskirch & Jouventin 1987). Imma- 
tures also appear to have more northerly distribution in South 
Atlantic (Tickell & Woods 1972) and Tasman Sea in winter 
(Summerhayes 1969). Banding at feeding concentration off e. 
NSW indicates that birds of all age groups from all s. breeding 
colonies visit area, principally July-Nov. (Gibson 1967), but 
most appear to stay for only a short period (Tickell & Gibson 
1968). Two feeding sites, 56 km apart, appear to hold largely 
independent populations (Tickell & Gibson 1968) and 
suggestions that pairs or members of same colony travel to- 
gether (Tomkins 1984a) at present unsubstantiated (Sudbury 
et al. 1985). 

RETURN Between each nesting cycle, adults that 
fail to breed before June usually absent from colonies for 
about 5-12 months; those that fail after June for about 12-17 
months (Tickell 1968). Mean date of return to Iles Kerguelen, 
17 Nov. (Paulian 1953); Iles Crozet, 19 Nov. (14-23 Nov; 
Fressanges du Bost & Segonzac 1976); 5. Georgia, males 29 
Nov., females 10 Dec. (Tickell 1968); Macquarie I., late Nov. 
with non-breeding birds returning late Nov.-late Jan. (Tom- 
kins 1985a); Auckland Is and Inaccessible I., early Dec. (Elliott 
1957; Bailey & Sorensen 1962). Most birds return to breeding 
site but non-breeding male banded Macquarie I. recovered 
breeding at Heard I. 13 years later (Johnstone 1980); juveniles 
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from Marion [., and juveniles and adults from Iles Crozet 
recovered at other breeding localities (Weimerskirch et al. 
1985; Gartshore et al. 1988). Only known instance of gene- 
flow between populations, one fledgeling from Iles Crozet 
breeding at Marion I. three years later (Mougin 1977). 

BREEDING Travel up to 15 200 km between incu- 
bation bouts. One bird from Iles Crozet tracked by satellite 
recorded flying SW to Antarctica and back past Heard and 
Kerguelen Is; another flew 3664 km to NW, travelling at 62.7- 
81.2 km/h, mostly (87.1-90.9%) during daytime. Distance 
travelled related to wind speed; birds were almost stationary in 
centre of high-pressure zones for 1-7 days. After hatching, 
foraging distance 330 and 381 km for one male (Jouventin & 
Weimerskirch 1990), Other records: adult with young on Iles 
Crozet recovered 1800 km N (Weimerskirch & Jouventin 
1987), incubating bird dyed on Iles Kerguelen seen 1420km5 
(Weimerskirch et al. 1988); adult with young on 5. Georgia 
sighted off Falklands and Argentinian coast (Croxall et al 
1988b); birds banded Macquarie I. recaptured 1600 and 2240 
km N during breeding season (Tomkins 1985a); estimated for- 
aging range from 5. Georgia 1478 km (Croxall & Prince 1987). 
However birds breeding Iles Crozet frequently forage over 
neritic waters (Weimerskirch et al. 1986). Tropical cephalo- 
pods in diet of S. Georgian birds suggest most foraging N of 
Island (Prince & Morgan 1987); fish taken all characteristic of 
S. Georgian shelf (Croxall et al. 1988b). 

BANDING Returns from NZ subantarctic islands 
(NZNBS) summarized Fig. 1; from Iles Crozet (CRBPO), Fig. 2; 
Prince Edward Is (SABRU), Fig. 3; NSW, Aust. (ABBBS), Fig. 
4. A’asian returns of birds banded S. Georgia (BBL) summar- 
ized Fig. 5. Recovery of adults not significantly different from 
that of juveniles (Weimerskirch et al. 1985). 


FOOD Mostly cephalopods, many probably scavenged, 
also fish, rarely crustaceans. BEHAVIOUR. Most food taken by 
surface-seizing (Griffiths 1982), also recorded shallow-plung- 





Fig. 1. 345 151E 10X10 “> ABBBS 
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Fig. 2. 41S 176E 10X10 NZNBS 





Fig. 3. 465 51E 10x10 CRBPO 


ing and pursuit-plunging (Harper et al. 1985). Usually alight 
near food and swim towards it with outstretched neck and 
partly open wings; if food sinks, try to retrieve it by partially 
upending like ungainly duck (Harper 1987). Usually hunt for 
cephalopods at night, in winds > 15-20 knots gliding low over 
surface and dropping on prey, occasionally maintaining 
height by pattering feet along surface. In still conditions, sit 
quietly with wings closed, snapping suddenly at prey that 
appears nearby. Have been recorded pursuit-plunging from 
about 2 m with folded wings, then swimming under water 
using wings for propulsion (Voisin 1981) or plunging from 5 


Fig. 4. 465 37E 10X10 SABRU 









Fig, 5, 54S 38W 10X10 BBL 


m to just beneath surface so that only wing-tips visible (Gib- 
son & Sefton 1959); immature twice seen dropping from 
about 3.5 m and completely disappearing underwater (Harper 
1987). Of 261 observations of feeding, 98.1% were surface- 
seizing, 1.1 surface-diving, 0.8 shallow-plunging, 78.9 during 
day, 21.1 at night; of natural prey not originating from obser- 
vation vessel, 93.3% of 119 observations were at night and 
only 6.7% during day (Harper 1987); have also been seen 
carrying wild-caught cephalopods at night off NSW (Barton 
1979). Cephalopods killed and eaten in 30.1 s (14.5-156; 11). 
Often feed in association with other seabirds (Jenkins 1979: 


Table 2, Cephalopods in the diet of the Wandering Albatross. 


Loligo 

Ancistrocheirus lesueuri 
unident. Enoploteuthidae 
Octopoteuthis rugosa 
Octopoteuthis 

Taningia 

Onychoteuthis banksi 
Moroteuthis ingens 
Moroteuthis knipovitchi 
Moroteuthis robsoni 
Kondakovia longimana 
Unident. Onychoteuthidae 
Cyeloteuthis sirventt 
Cycloteuthis 

Discoteuthis discus 
Discoteuthis 

Gonetus antarcticus 
Gonatus 

Unident. Gonatidae 
Psychroteuthis glacialis 
Psychroteuthts 
Lepidoteuthis grimaldii 
Pholidoteuthis boschmai 
Architeuthis 

Histioteuthis 
Alluroteuthis antarcticus 
Batoteuthis skolops 

Illex 

Todarodes 

Martialia hyadesi 
unident. Ommastrephidae 
Chiroteuthis imperator 
Chiroteuthis macrosoma 
Chiroteuthis picteti 
Chiroteuthis veranyt 
(Chiroteuthts 
Mastigoteuthis 

Taonius belone 

Tdaonius cympoctypus 
Taonius pavo 

Taonius 

Fgea inermis 
Teuthowenta pellucida 
Teuthowenta 
Megalocranchia maxima 
Megalocranchia 
Helicocranchia 
Galiteuthis armata 
Galiteuthis 
Mesonychoteuthis hamiltoni 
Unident. Cranchiidae 
Vampyroteuthis infernalis 
Argonauta nodosa 
Argonauta 

Alloposus mollis 
Unident. cephalopods 


Number of beaks 
Number samples 





(1) Near NZ (Imber & Russ 1975); (2) S. Georgia (Clarke et al. 1981); (3) (Rodhouse et al. 1987); (4) Imber & Berruti (1981). 
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Harper 1987), repelling all competitors (Weimerskirch et al. 
1986) except flocks of White-chinned Petrels Procellaria 
aequinoctialis, which distract Wanderering Albatrosses by 
dive-bombing, then stealing food (Harper 1987), and occa- 
sionally giant-petrels Macronectes spp (9% of 64 observations; 
Weimerskirch et al. 1986). Also follow schools of dolphins, 
feeding on remnants of their prey (Gibson & Sefton 1955) as 
well as Southern Right Whales Balaena glacialis, Pilot 
Whales Globiocephala and Southern Right Whale Dolphins 
Lissodelphis peronii (Enticott 1986). 

NON-BREEDING Diet of non-breeding birds 
poorly known. Off e. NSW, appear to concentrate on large 
cephalopods Sepia apama, which caught alive or scavenged 
(Gibson & Sefton 1955, 1959; Gibson 1967); two from Drake 
Passage contained many cephalopod beaks with some fish and 
Euphausia superba (Linkowski & Rembiszewski 1978). 

BREEDING At Bird L, S$. Georgia (79 samples; 
Croxall & Prince 1987; Prince & Morgan 1987) fish 41.5% 
wt., cephalopods 39.5, crustaceans 0.2 and higher proportion 
of penguin and seal carrion 18.8 than for any other albatross as 
well as many cephalopods that were probably too large to have 
been caught alive. 

At Iles Crozet, cephalopods predominated in 68% of 31 
regurgitated samples (actual freq. 71.0%, mostly Onychoteu- 
thidae; Weimerskirch et al. 1986) with rest mostly fish 32 
(35.5); carrion (3.2) was penguin feathers (Voisin 1969; Mou- 
ein 1970). Cephalopod diet described in Table 2. 

At Bird L., 8. Georgia (Rodhouse et al. 1987) cephalo- 
pods estimated to constitute 35% of diet by wt. Loligo 0.18- 
0.30 cm lower rostral length, Ancistrocheirus lesueuri 0.62- 
0.82, Octopoteuthis rugosa 0.95, Taningia danae 0.90-1.57, 
Onychoteuthis banksi 0.32-0.67, Moroteuthis ingens 0.90-1.32, 
Moroteuthis knipovitchi 0.52-1.53, Moroteuthis robsoni 0.76- 
0.85, Kondakovia longimana 0.23-2.00, Discoteuthis 0.41- 
0.78, Gonatus antarcticus 0.49-0.81, Psychroteuthis glactalts 
0.35-0.66, Lepidoteuthis grimaldii 0.91-1.58, Pholidoteuthis 
boschmai 0.27, Architeuthis 1.33, Histioteuthis atlantica 0.25- 
0.67, Histioteuthis eltaninae 0,25-0.45, H sp. 0.29-0.53, Allu- 
roteuthis antarcticus 0.40-0.60, Batoteuthis skolops 0.29-0.54, 
Iilex 0.33-0.82, Martialia hyadesi 0.35-0.88, Chiroteuthis im- 
perator 0.52-0.61, C. spp 0.34-0.82, Mastigoteuthis 0.48-0.79, 
Taonius pavo 0,70-1.05, Taonius 0.65-1.10, Egea inermis 0.51, 
Teuthowenia 0.70-0.90, Megalocranchia 0.82, Helicocranchia 
0.30-0.35, Galiteuthis glacialis 0.32-0.64, Galiteuthis 0.40- 
0.42, Mesonychoteuthis hamiltoni 0.59-1.00, Vampyroteuthis 
infernalis lower hood length 0.70, Octopodidae 0.48-1.14, 
Alloposus mollis 0.35-2.10. Differences noted between years: 
more Ommastrephidae and number of cephalopod species in 
winter of 1984 than 1983 and fewer Onychoteuthidae and 
Antarctic species, at time when krill populations in region 
declined. Some deep-water cephalopods may be caught only 
when they come to surface, spawn and die. 

Composition of fish diet at S$. Georgia by weight: Pseu- 
dochaenichtys georgianus 54.5% (length 377 mm [127; 225- 
634)]), Chaenocephalus aceratus 25.3% (469 [88; 319-604)), 
Muranolepsis microps 13.0% (237 [40; 241-389]), Notothenia 
gibberifrons 4.2% (322 [27; 294-358]), plus five other species. 
Fish taken were pelagic and benthic-demersal species charac- 
teristic of S. Georgian shelf; species taken by commercial 
fisheries poorly represented, which indicates that most fish 
probably taken naturally (Croxall et al. 1988b). 

Samples from near NZ (Imber & Russ 1975) also con- 
tained otoliths of fish Halargyreus johnsonti and Macruronus 
novaezelandiae. Near [les Crozet also seen feeding on jellyfish 


(Weimerskirch et al. 1986). 

INTAKE Chicks may consume up to 100 kg of food 
during rearing period (Rodhouse et al. 1987). At S. Georgia, 
size of meals 750 g and weight gain 56 g/day (Tickell 1968) at 
rate of 0.36 meals/day (Croxall & Prince 1987) or every 5-6 
days/adult (Croxall & Prince 1980). At Ile de la Possession, 
[les Crozet, gain 85 g/day (Weimerskirch et al. 1986) with 0.35 
meals/day (Mougin 1970). Between end of brooding and fledg- 
ing chicks receive a total of about 80 meals of 1-2 kg (0.11- 
2.11 kg) (Tickell 1968). 


SOCIAL ORGANIZATION At sea, usually solitary or 
in small groups (2-10 individuals) at food near vessels; occa- 
sionally up to 50 birds together (Dixon 1933; Gibson 1967). 
Gregarious on breeding grounds. Associate with other pro- 
cellariiforms at food and when following cetaceans (Griffiths 
1982. Enticott 1986). 

BONDS Monogamous, lifelong; no known divorces 
(Tickell 1968). At Macquarie I. and Iles Crozet, sex-ratio prob- 
ably imbalanced in favour of males, survival of males being 
higher than that of females (Tomkins 1985a; Weimerskirch & 
Jouventin 1987); at Iles Crozet, sex-ratio of non-breeding birds 
1 fernale:2.5 males (Weimerskirch & Jouventin 1987). Age at 
first return to colonies: Iles Crozet 7.8 years old (4-14; 187) 
(Weimerskirch & Jouventin 1987); 5. Georgia 3-11 years old, 
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Plate 13 


Wandering Albatross Diomedea exulans 
. Dorsal, Stage 5 

. Dorsal, Stage 6 

. Dorsal, Stage 7 

. Wentral, Stage 7 

. Adult head, Stages 5, 6 or 7 
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most 5-7 years old, females slightly earlier than males (Pick- 
ering 1989). At S. Georgia, interval between first return and 
first breeding 2-8 years, median value in females, 4 years 
(n=51), in males, 5 years (n=45); establishment of pair-bond 
over one to three seasons, most (75%; 28 pairs) over two sea- 
sons (Pickering 1989), Age at first breeding: Iles Crozet, 
females 11.2 years (2.0, 7-16; 64), males 12.1 years (9-16; 24) 
(Weimerskirch & Jouventin 1987); 5. Georgia minimum 7 
years, av. females 10.6 years, males 11.0 years (Croxall 1982). 
Pair-bonds of breeders re-established during pre-laying 
period. During first season ashore, pre-breeders present at 
colonies early Feb.-early Mar. (median values); after five sea- 
sons, arrival synchronized with that of breeders; departure 
late Mar. (Pickering 1989), Nesting, incubation and chick- 
rearing by both sexes although share of male slightly higher. 
Chicks independent after fledging. 

BREEDING DISPERSION NESTING. Colonial 
when breeding. Nest density: Ile de la Possession, Iles Crozet, 
35-101 nests/km? (Weimerskirch & Jouventin 1987): 
Georgia, 40-106 nests/ha in most favourable areas (Tickell 
1968); Antipodes 1, 26 nests/10000 m* (Warham & Bell 
1979). Distance between nests, three to hundreds of metres; at 
Iles Kerguelen, very dispersed, minimum distance 50 m 
(Milon & Jouanin 1953). TERRITORIES. Limited to area close 
round nest; 1 m* to radius of 2 m oe 1968; Voisin 1969). 
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Plate 14 


Wandering Albatross Diomedea exulans 
1. Juvenile, ventral, Stage 1 

Z. Juvenile, dorsal, Stage 1 

3. Ventral, Stage 3 

4. Dorsal, Stage 3 

5. Dorsal, Stage 4 
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Territorial only when breeding. 

ROOSTING Solitary when roosting on land (Voi- 
sin 1969). Unknown at sea. 


SOCIAL BEHAVIOUR Descriptions by Matthews 
(1929) and Murphy, partly re-interpreted by Richdale (1950) 
and by van Zinderen Bakker (1971); functions of courtship 
displays not analysed. Displays conspicuous, diurnal and easy 
to observe at close range. Royal Albatross D. epomophora has 
most complex sexual behaviour of all albatrosses; agonistic 
and sexual displays of Wandering similar to Royal, except that 
Head-shaking not described for Wandering and Bill-vibrating 
not described for Royal. Individual distance usually not less 
than 2-3 m, except round food or between individuals of 
mated pairs. 

AGONISTIC BEHAVIOUR THREAT. Displays as 
Royal Albatross: Snapping, which becomes Clappering with 
increased aggression. When threatened, unguarded chick sits 
back in nest, neck stretched up, facing intruder; as threat 
increases, bill snapped repeatedly with gobbling noise and, 
ultimately, vomiting of oil or stomach contents. APPEASE- 
MENT. No appeasement or submissive displays described. 

SEXUAL BEHAVIOUR COURTSHIP. Displays 
similar to those of Royal Albatross and include Aerial Activ- 
ity, Sky-call, Yapping (=Bowing; Matthews 1929), Gawky 
Look, Billing, Scapular Action, Wing-stretching and 
Mutual Preening. Also occasional AERIAL DISPLAY: in flight, 
wings stiffened, downturned, while head swung from side to 
side (Warham & Bell 1979). Ecstatic Ritual. Apparently dif- 
fers from displays of Royal Albatross by performing Bill- 
vibrating and Whine. Bill-vibrating (=Braying; van Zinderen 
Bakker 1971): standing bird first bends neck forwards, bill 
held below horizontal, mandibles slightly apart and begins to 
vibrate mandibles rapidly against each other while bill slowly 
raised to vertical; clappering ended with single snap as bill 
reaches highest position; performed by both sexes; reported 
to follow Billing and precede Wing-stretching. Whine. Not 
described as being accompanied by Head-shaking as in Royal 
Albatross. Head thrown back so that bill points vertically, bird 
emitting powerful expiratory braying whistle followed by in- 
haled sigh; usually performed in conjunction with Wing- 
stretching; climax of Ecstatic Ritual. Whine usually followed 
by Display Meeting (van Zinderen Bakker 1971): bird places 
bill in breast feathers while producing loud bill-clappering 
sound. Parties, as in Royal Albatross. GREETING Apparently 
similar to mutual Gawky Looks performed by birds of op- 
posite sex when meeting (van Zinderen Bakker 1971). At nest, 
Yapping observed at nest-relief during incubation (Murphy). 
Once pair formed, displays become less complex with much 
time spent Yapping and Mutual Preening (van Zinderen 
Bakker 1971). COPULATION, occurs 1-26 days before laying 
(Tomkins 1983). 

RELATIONS WITHIN FAMILY GROUP Nest- 
building, incubation and chick-raising by both parents, with 
male taking slightly greater share. 


VOICE Not well known; no detailed study. Described by 
Matthews (1929), Murphy and van Zinderen Bakker (1971). 
Noisy at colonies during pre-laying period (re-establishment 
of pair-bond by breeders), incubation and chick-guarding per- 
iod (parties of pre-breeders establishing pair-bond). Sexual 
differences reported only for Croak where, in duets, croaks of 
males lower than those of females (Warham & Fitzsimons 
1987). No individual differences or geographical variation re- 
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ported. Calls similar to those of Royal Albatross, but Croak (= 
Yapping) apparently harsher and less musical. Numerous 
non-vocal sounds made by vibrating, snapping or clopping 
mandibles together. 

ADULT Croak (Warham & Fitzsimons 1987): 
harsh groaning sound uttered repeatedly during Yapping dis- 
play, often in duet. Whine: powerful expiratory braying 
whistle or shriek (van Zinderen Bakker 1971) (sonagram A; 
with preceeding inhalation), emitted at climax of Ecstatic Ri- 
tual. Also Deep Gurgling, audible at close range, emitted 
during Sky-call (van Zinderen Bakker 1971), and a guttural 
vocal sound uttered with each bill-clap during agonistic Clap- 
pering. NON-VOCAL SOUNDS: Bill-snaps: mandibles snapped 
together during agonistic Snapping; by adults and chicks. Bill- 
clappering: loud rapid clappering given a few times during 
agonistic Clappering; also given during Display Preening. Bill- 
clicks caused by clashing of bills of pair during Billing. Rat- 
tling: peculiar rattling sound with slight musical ring 
(Matthews 1929), also described as rubbery burring, rising 
from low to high note with increasing rapidity of vibration, 
filled lungs acting as sound-box, whole terminated by single 
snap; given during Bill-vibration display. 

YOUNG Newly hatched chick emits soft chirping 
when hungry; older chick, gobbling cry when begging for 
food (Matthews 1929), 


BREEDING Detailed studies at S. Georgia (Tickell 
1968) and Iles Crozet (Voisin 1969; Mougin 1970; Fressanges 
du Bost & Segonzac 1976). Information supplied by J-C. 
Stahl. Colonial breeder on vegetated exposed flats on gentle 
slopes on subantarctic islands and 5. Georgia. 

SEASON ARRIVAL AT COLONIES: 5. Georgia, first 
arrival 9 Nov., mean date: males 29 Nov. +2 days, females 10 
Dec. + 9 days; Iles Crozet, first arrival 19 Nov. (14-23 Nov.; for 
7 years), males arrive first; peak of arrival late Nov.-early Dec; 
(Mougin 1970); Iles Kerguelen, 17 Nov.-5 Jan. (Paulian 1953); 
Marion L., first arrival 22 Nov. (van Zinderen Bakker 1971); 
Auckland Is, first arrival 10 Dec. (Bailey & Sorensen 1962). 
PRE-LAYING PERIOD: Iles Crozet, 26 days (15-44); S. Georgia, 
26.6 days (8-45; 8). At 3. Georgia, male present 73.6% of days, 
female 7.4%. Pre-laying exodus noted at Marion, but period 
undetermined (van Zinderen Bakker 1971). At S. Georgia, 
females return 0-4 days before laying after absence of 5-18 
days. Copulations 1-26 days before laying; fertilization prob- 
ably 1-5 days before laying (Tomkins 1983), DEPARTURE 
FROM COLONIES: Fledgelings: 5. Georgia, mean date 10 Dec. 
(17 Nov-18 Jan.; 63). Iles Crozet, first departure 25 Nov. (20- 
30 Nov.), last 19 Jan. (9 Jan.-2 Feb.) over seven seasons; average 
date 14 Dec. Marion I., most mid-Dec. to mid-Jan. (van Zin- 
deren Bakker 1971). Iles Kerguelen, first departure 21 Nov. 
(Paulian 1953). Campbell I., late Dec.-mid-Feb. (Bailey & Sor- 
ensen 1962). Auckland Is, 22 Jan.-5 Feb.+ (Bailey & Sorensen 


1962). Antipodes Is, end Jan.-12 Mar.+ (Warham & Bell 
1979). Pre-breeders: Mar.-Apr. in most localities. At S. 
Georgia, pre-breeders at colonies early Feb.-early Mar. during 
first season ashore; arrival date progressively earlier during 
following seasons, synchronized with that of breeders by sixth 
season; stay ashore progressively longer, median 3 days first 
season; median 34 days (males), 24 days (females) during sixth 
season; median departure date, sixth season ashore, 20 Mar. 
(Pickering 1989). 





(S. Georgia) 


SITE Selection of nest-site restricted by availability 
of vegetation for building material and of nearby open ex- 
posed site for taking-off (Tickell 1968). Nests established 
among short vegetation (among tussocks at Antipodes; War- 
ham & Bell 1979) on coastal plains, swampy valley floors, 
crests of broad ridges or gentle slopes; usually 0-300 m asl but 
above 300 m at Gough I. to avoid shrubby vegetation of 
lowlands (Swales 1965), At Iles Crozet, same nest used in suc- 
cessive season by 23.3% (n=176) of breeders, 37.9% (n=66) of 
failed breeders; only 2.5% (n=810) of breeders have changed 
colony in 15 years (Weimerskirch & Jouventin 1987}, with 
average displacements of 3,2 km (0.6; 0.6-14.0; Fressanges du 
Bost & Segonzac 1976). At S. Georgia, 20% of pairs used same 
nest in successive season; rest moved by 7 m on average (max- 
imum 23 m; 150 m if remating). Slight association and com- 
petition with giant-petrels, mostly Southern Giant-Petrels M. 
giganteus, at Iles Crozet and 5. Georgia (Voisin 1968; Hunter 
1984). Nest-site selected by male, who arrives first on col- 
Ones. 

NEST, MATERIALS Nest, a truncated cone, 
usually surrounded by circular patch almost bare of veg- 
etation. Material: grass, Acaena twigs, moss, roots and soil. 
MEASUREMENTS: diameter of base 70-100 cm; diameter at top 
40-50 em; height 20-50 cm. Building started by male soon 
after arrival, but female builds actively 1-2 days before laying; 
material dropped over shoulders (Tickell 1968; Voisin 1969; 
van Zinderen Bakker 1971). Nest maintained by addition of 
material during incubation and by chick; new nest besides 
natal one sometimes built by chick (Voisin 1969), 

EGGS Sub-elliptical to oval; rough-shelled, mat; 
white with diffuse ring of red-brown dots round broad end 
(Tickell 1968). 

MEASUREMENTS. 

Gough I. 127.0 (117-130.8; 87) x 77.0 (71.6-82.3) (Ver- 
rill 1895) 

Marion I. 133.4 (123.5-142.5; 47) x 81.0 (75.5 x 85.5; 47) 
(Rand 1954) 


Tles Crozet 
Iles Kerguelen 
Macquarie I. 


133.2 (130.7-136.0; 3) x 80.9 (80.4-81.4) 
136.2 (122.4-140.0, 6) x 80.1 (76.4 x 83.4) 
129.1 (5.9; 121.3-139.0, 16) x 81.5 (2.8; 78.1- 
87.2) (Tomkins 1984b). 

125.1 (5.1) x 78.1 (2.7) (Warham & Bell 
1979) 


Antipodes Is 


Auckland Is 126.9 (120.8-134.4; 12) x 77.3 (73.0-81.7) 
(Robertson 1975). 

WEIGHTS. 

Marion I. 484 (437-551; 9) (Rand 1954) 

Macquarie I. 476.7 (46.9; 415-520; 4) (Tomkins 1984b) 

S. Georgia Median, 448.5 


CLUTCH-SIZE One; one record of two abnormally 
small eggs (Tickell 1968). At Iles Crozet, no successful breed- 
ers lay in next season after success; 89% lay again in second 
season. Of failed breeders, 84% lay again in next season after 
failure and 10% not until second season (Jouventin & Wei- 
merskirch 1988). At S. Georgia, 73% of birds that fail lay again 
next season but, if loss occurs after June, do not do so till 
second season (Tickell 1968). Development of gonads blocked 
throughout period of rearing chick (Hector et al. 1986). 

LAYING 5. Georgia, 24 Dec. (6 days; 10 Dec.-17 
Jan.; 3 years, 260 eggs), no significant difference between years 
(Tickell 1968). Iles Crozet, starts 20 Dec. (17-24 Dec.), ends 19 
Jan. (7-25 Jan.) for seven years. Iles Kerguelen, 10 Dec.-15 Jan. 
(Paulian 1953; Thomas 1983). Marion I., 22 Dec.-12 Jan., peak 
1-6 Jan. (van Zinderen Bakker 1971). Antipodes Is, start mid- 
Jan. (Warham & Bell 1979). Auckland Is 42% of eggs laid by 22 
Jan. (Bailey & Sorensen 1962). Campbell L., starts 13 Feb. 
(Bailey & Sorensen 1962). Most eggs (70%; n=20) laid during 
daytime (Tickell 1968). In successive seasons only 4.2 days 
difference in laying dates (9.0; 0-20; 50), irrespective of change 
of mate; seems to indicate genetic control of laying date (Tick- 
ell 1968). 

INCUBATION By both sexes in alternate shifts. At 
Iles Crozet, average of 7.4 shifts (4-10); shortest shift, first by 
female, 2.4 days; longest, third by female 13.8 days; share of 
male, 52.4%. At S. Georgia, 9.7 shifts (1.8; 5-14); shortest 
shift, frst by female, 3-6 days; longest, fourth by male, 11.5 
days; share of male 57.7%. INCUBATION PERIOD. Gough L, c. 
68 days (Swales 1965); Marion I., 78.6 days (75-81; 12) (van 
Zinderen Bakker 1971); Iles Crozet 79 (77-83); S. Georgia, 
78.4 (1.1; 75-82; 163). 

NESTLING Semi-altricial, nidicolous. Hatching 
lasts 3.2 days (1-5) (Tickell 1968). Down entirely white at 
hatching (Voisin 1969). Brooded and guarded by both par- 
ents. BROOD-STAGE: Marion I. 29 days (24-33; 5) (La Grange 
1962); Iles Crozet 31 days (24-43; 11); brooding shifts, male 4.0 
days, female 4.2 days; 5. Georgia 32 days (21-43; 74), c. 11 
brooding shifts of 2.6 days (1-10; 735). At Iles Crozet, guard- 
stage of up to 6 days (Voisin 1969). Chicks fed by both parents: 
at S. Georgia, 53% (n=118) of feeding by male. Not fed first 24 
h, only small amounts of food given for next 3 days. Feeding 
usually lasts 10-30 min, but parent may stay for up to 2 days 
(Voisin 1969). Meal consists of 5-7 regurgitations; chick in- 
serts mandibles across those of parent. Most (94%) of feeding 
during daytime, 61% during morning hours (n=166) (Tickell 
1968). NESTLING PERIOD. At Iles Crozet, 271 days (258-288; 
31); S. Georgia 277.7 (16.7; 263-303; 35). 

GROWTH At Iles Crozet, weight at hatching c. 400 
g, maximum weight 13 100 g (137% ad. weight) after c. 210 
days, weight at fledging 9980 g (104% ad. weight; 12 chicks in 
two seasons) (Mougin 1970). At S. Georgia, weight at hatching 
352 2 (303-410; 6); max. weight 11 700 g (135% ad. weight), 
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after 221-230 days, weight at fledging 8930 g (103% ad. 
weight). Culmen and wing almost full grown at fledging (Tick- 
ell 1968. Voisin 1969). 

FLEDGING TO MATURITY Independent at 
fledging. Age at first retura to colonies: Iles Crozet ay. 7.8 
years (4-14; 87); S. Georgia, 3-11 years, most 5-7 years, males 
at slightly older age than females (Pickering 1989). Age at first 
breeding: Iles Crozet, females 11.2 years (2.0; 7-16; 64), males 
12.1 years (9-16; 24) (Weimerskirch & Jouventin 1987); mean 
age of first breeding has decreased from 13 to 10.5 years since 
1975 following population decrease; minimum age of first 
breeding has remained unchanged Jouventin & Weimers- 
kirch 1985); S. Georgia, minimum 7 years, average females 
10.6 years, males 11.0 years (Croxall 1982). Males physiolog- 
ically mature at 5 years old; females at 4-7 years; gonadal 
maturation blocked by high progesterone levels, thus female 
condition decisive in determining when a pair first attempts to 
breed (Hector et al. 1986). 

SUCCESS At Iles Crozet, hatching 66.9% (10.3; 
54.2-79.3; 166 eggs, three seasons), fledging 89.4% (5.0; 84.2- 
96.9: 188 chicks, five seasons), total success 64% (13; 25-77; 15 
seasons) (Weimerskirch & Jouventin 1987); 0.37 chicks pro- 
duced per year per female (Weimerskirch et al. 1987). At 
Marion I., hatching 50-59% (44 eggs, two seasons), fledging 
38.5% (13 chicks, one season), total success 22.7% (22 eggs, 
one season) (van Zinderen Bakker 1971). Macquarie I., hatch- 
ing 61.4% (25.9; 14-89, 57 eggs, five seasons), fledging 85.7% 
(39.7; 0-100; 35 chicks, five seasons), total success, 52.6% 
(26.7; 0-78; 57 eggs, five seasons) (Tomkins 1985a). At 8S. 
Georgia, hatching 72.2% (385 eggs, one season), fledging 
81.3% (278 chicks, one season), total success 64.8% (6.5; 1185- 
1523 eggs, ten seasons) (Croxall et al. 1988a). ANNUAL SURVL 
VAL: Iles Crozet: age 0-5 years, 71.5%; 5-11 years, 91.8%; over 
11 years, 92.2% (males 93.8%, females 89.9%) (Weimerskirch 
& Jouventin 1987). S. Georgia: adult average 96% (Croxall 
1982), males 94.6%, females 96.9% (Tickell 1968), Macquarie 
L.: males 85.4%, fernales 73.2 %o (Tomkins 1985a). CONTROL- 
LING FACTORS: egg loss mainly in first 10 days of incubation, 
mostly from trampling by incubating birds, delayed return of 
partner, infertility (van Zinderen Bakker 1971; Fressanges du 
Bost & Segonzac 1976). Loss of chicks c. 50% in first 10 days; 
mainly by starvation, predation by skuas, giant-petrels, rarely 
by rats at Ile de la Possession, [les Crozet, bad weather (Fres- 
sanges du Bost & Segonzac 1976). Recent population decline 
at Iles Crozet mainly because fewer adults have survived; 
90.5% during period of rapid decline 1968-76 (6.2% /year), 
94.4% during more gradual decline 1977-86 (2.5% /year); sur- 
vival of adults apparently affected during breeding cycle, as 
breeding success strongly correlated with adult survival; lower 
survival of females attributed to their more northern foraging 
area where more exposed to long-line fishing operations (Wei- 
merskirch & Jouventin 1987). At Macquarie I., decline attri- 
buted to reduced survival of adults and possibly immatures; 
adult survival dropped from 94.6% in 1965-68 (Carrick & 
Ingham 1970) to 78.7% in 1974-78 (Tomkins 1985b), while 
breeding success remained similar (Tomkins 1985b). 


PLUMAGES Except for juveniles and downy young, no 
two Waridering Albatrosses look alike. Starting with mostly 
dark plumage, birds become whiter over a long period. Rate of 
whitening not known precisely, nor whether whitening is 
continuous, interrupted or briefly reversed. Degree of white- 
ness of breeding adults varies individually, geographically and 
sexually (Tickell 1968; Tomkins 1984b). Not known whether 
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whitening halts at a given age, or whether rate of whitening 
geopraphically and sexually varies. The descriptions below are 
civen under stages suggested by Harrison (1979, 1985, 1987) 
for guidance in the field; Stage 1 is darkest and probably 
youngest; Stage 7, palest and probably oldest. These are not 
distinct plumages defined by different generations of feathers, 
and many intermediates occur. The ‘Gibson Code’ (GC), a 
more detailed classification for use in the hand, was developed 
by Gibson (1967) and is also given below. His scheme has been 
modified and expanded by Jouventin et al. (1989). 

D.e, chionoptera., 

ADULT Stage 7. Probably only old males. At 5. 
Georgia, 43% of breeding males, and no breeding females, 
showed this condition (Tickell 1968). Recorded on a non- 
breeding male at least 16 years old (Tomkins 1984b); age of 
attainment unknown but probably after age of first breeding, 
which has been given as c. 10 years (Tickell 1968) and 9-13 
years (extremes 7-17) (Hector et al. 1986; Pickering 1989). 
HEAD AND NECK (GC=6), white; hindneck feathers have trace 
of light-grey (85) vermiculation at tips; only visible at close 
range; can be lost with wear and may be totally absent. Vary- 
ing pink (-) to orange-pink (c5} stain at the edges of feathers 
just behind ears; often absent or restricted to one side of head; 
when most extensive, stain extends round back of neck, form- 
ing narrow to broad connecting band. Large stains sometimes 
partially or completely brown (-). At 5. Georgia, stain tends to 
be larger when feeding young than when incubating or brood- 
ing. Stain often wet, although underlying skin usually dry 
(Tickell 1968); often lost shortly after death or after feathers 
are plucked (e.g. Gibson & Sefton 1959), but feathers from 
Antipodes I. showed pink stain 8 years after plucking (War- 
ham & Bell 1979), and stains retained in some skins (Murphy). 
It apparently occurs in all Stages except downy young and 
perhaps juveniles, both at breeding grounds and at sea. UPPER- 
PARTS (GC=6), white. Feathers of mantle and back can have 
trace of light-prey (85) vermiculation at tips. TAIL (GC=4, oc- 
casionally 3}, white; feathers have cream (54) shafts. UP- 
PERWING (GC=5), looks mostly white, with mottled transition 
to broad dark tip and narrow dark trailing-edge. At, and in- 
side, elbow, all coverts white. Other upper wing-coverts and 
secondaries white with inconspicuous dark-grey (83) vermi- 
culations; outer quarter of median coverts and lower lesser 
coverts, alula and all greater coverts have large grey-black (c82) 
spots at tip of outer web causing a rather mottled transition to 
dark trailing-edge. In alula, primary coverts, outermost sec- 
ondary coverts and most secondaries, grey-black (c82) spot, 
large, extending halfway down outer web, and encroaching 
onto tip of inner web; in these feathers, only grey-black areas 
are usually exposed. [Innermost secondaries and inner sec- 
ondary. coverts have smaller grey-black (82) spots at tip of 
outer web. Visible parts of primaries, grey-black (c82) with 
cream (c54) shafts that grade to dark brown (c23) tips; all pri- 
maries have concealed white bases to inner web. In palest 
birds, this narrows to sharply defined, concealed white wedge 
that almost meets tip of inner web. Grey-black (82) areas of 
remiges fade to black-brown (c119) with wear. UNDERPARTS, 
white. UNDERWING. Remiges as above; only grey-black (c82) 
areas are exposed when wing spread; reflect grey (84) in some 
direct light. Under wing-coverts, white. 

DOWNY YOUNG Protoptile, white on head and 
neck, tinged grey (-) on body; described as pure white and silky 
(Murphy), and as dirty white (Watson 1975). Mesoptile, long 
and woolly; grey (Murphy) or brownish grey (Matthews 1951), 
fading to grey-white in older birds. 


STAGE 1 All juveniles and probably no other ages, 
have this plumage; unknown when lost. HEAD AND NECK 
(GC=1). Chin to mid-throat, forehead and face, white. Rest, 
dark brown (121); all feathers have concealed white bases, 
narrowest on crown and nape; those of nape have fine white 
tips. UPPERPARTS (GC=1), dark brown (121); feathers of 
mantle and upper back, and shorter scapulars, have narrow 
open pennaceous brown (119B) tips, perhaps becoming larger 
with wear. TAIL (GC=1), black-brown {c19). UPPERWING 
(GC=1). Coverts, black-brown (119); marginals have narrow 
white fringes. Remiges look grey-black (82) when fresh, be- 
come black-brown (119) with wear. Shafts, cream (54), grading 
to brown (23) tips; in flight, pale primary shafts contrast 
strongly with rest of dark upperwing. UNDERPARTS, dark 
greyish-brown (cl21); feathers have white bases, smaller on 
upper breast, occasionally exposed in centre of lower breast 
and belly but concealed elsewhere. Axillaries, white; upper- 
most sometimes have dark greyish-brown (cl21) upper web. 
UNDERWING. Most coverts, white; on leading-edge between 
carpal joint and p10 have grey-black (82) outer web. Marginal 
coverts and lesser subhumerals at base of wing, sometimes 
dark greyish-brown (c121), giving appearance of dark pre- 
axillary notch. Remiges, grey-black (82), reflecting grey (84) in 
some direct light. 

STAGE 2 Does not occur on breeding birds at 5. 
Georgia (Tickell 1968). At Macquarie I., not yet found on 
breeding birds, but recorded in 4-, 6- and 14-year-old females 
(Tomkins 1984b), Age of attainment, unknown; probably at 
1-2 years because brought about by wear and replacement of 
juvenile plumage. HEAD AND NECK (GC=1-2), Pattern at first 
similar to Stage 1, but lower neck and hindneck later develop 
white mottling, causing brown appearance that does not con- 
trast with white face so strongly as in Stage 1, Feathers of 
crown and nape remain dark brown (121), giving dark-capped 
appearance to older birds. In the one skin examined, change 
in appearance was caused by wear, dark tips of feathers of neck 
having worn away to expose some white bases. UPPERPARTS 
(GC=1), beginning to obtain some white mottling, partly 
because wear exposes some white bases of feathers, especially 
on rump, and then because white feathers with heavy grey (84) 
or brown-erey (grey 121) vermiculations at tip appear to do so 
last on rump. TAIL, as Stage 1. UPPERWING (GC=?). Traces of 
white mottling appear in centre, most commonly at elbow; in 
the skin examined, caused by slightly exposed white bases to 
median and secondary coverts. Belly begins to whiten before 
upperwing. No white tips to feathers, contra Harrison (1985). 
UNDERPARTS, mostly off-white to white; younger birds often 
have dark-brown (121) under tail-coverts, flanks and band 
across upper breast, formed by remnant juvenile feathers. 
Dark tips of juvenile feathers on rest of underparts wear 
faster, last traces of tips causing off-white tinge. Emergent 
feathers, white; those on flanks and upper breast have dark- 
brown (cl21) or grey (84) vermiculations near tip. UN- 
DERWING, as Stage |. 

STAGE 3 At S. Georgia, not recorded on breeding 
males, on about 9% of breeding females; also recorded on two 
5-year-old and four 6-year-old pre-breeding females (Tickell 
1968). At Macquarie L., recorded on 8-year-old non-breeding 
male (not recorded in breeding birds), a 14-year-old female 
non-breeder and a female breeder at least 13 years old (Tom- 
kins 1984b). HEAD AND NECK (GC=2-3), mostly white, with 
dark-brown (121) cap on hindcrown and nape; feathers of 
crown, hindmost ear-coverts, and lower neck have dark- 
brown (119A) vermiculations at tip. UPPERPARTS (GC=3-4), 


white, heavily vermiculated dark brown (121) to grey (84) at 
tips of feathers; vermiculations tend to be fewest on mantle, 
rump and upper tail-coverts. TAIL (GC=2). Feathers, black- 
brown (c19) with white bases, largest on inner webs for about 
half length. When closed, tail looks white with broad dark 
sides and tip. UPPERWING (GC=2). Distinct but small white 
patch at elbow formed by white bases to secondary and 
median coverts. Elsewhere, white bases to secondary and 
lesser coverts are smaller. UNDERPARTS, white; upper breast 
and flanks vermiculated dark brown (c121) at tips of feathers; 
very slight vermiculations at tips of feathers on thighs and 
under tail-coverts. UNDERWING, as Stage L. 

Saas = At 5. Georgia, recorded on about 8% of 
breeding males, and 69% of breeding females. Three 5-year- 
old pre-breeding males were in this condition (Tickell 1968); 
at Macquarie I, found in 15- and 16-year-old breeding males, 
and on two 14-year-old non-breeding males; recorded in five 
of 13 females, including a 16-year-old non-breeder and four 
breeding birds (Tomkins 1984b). HEAD AND NECK (GC=3-4), 
generally whiter than Stage 3; dark-capped appearance lost, 
although feathers of crown and nape tend to have larger dark 
vermiculation at tips than rest of head and neck. UPPERPARTS 
(GC=4), white, with dark-brown (121) to grey (84) vermicu- 
lations at tips of feathers, smaller than in Stage 3. TAIL(GC=2- 
3), usually looks white, with dark sides and tip narrower than 
at Stage 3. Some feathers in centre can be wholly white, and 
outer feathers may have more white at base, especially on 
inner web. Order in which white feathers appear varies, and 
tail can have streaked appearance by alternating white and 
dark feathers. UPPERWING (GC=3). Distinct white streak, 
broadest at elbow, extending about halfway to carpal joint, 
and halfway to body; formed by white bases to secondary and 
median coverts, largest on inner webs. Inner marginal coverts 
have smaller white bases, giving appearance of slight white 
mottling on inner leading edge. UNDERPARTS, white, with 
dark vermiculations at tips of feathers of upper breast and 
flanks. UNDERWING, as Stage 1, but no dark pre-axillary 
notch. 

STAGE 5 At S. Georgia, occurs on about 15% of 
breeding males, and 20% of breeding females (Tickell 1968). 
At Macquarie I., recorded on five of 17 males, including one 
19-year-old, and in a bird at least 2 years old; not recorded in 
13 females (Tomkins 1984b). HEAD AND NECK (GC=5-6), 
white, with narrow dark vermiculations on tips of feathers of 
neck. UPPERPARTS (GC=5), white; dark vermiculation at tips 
of feathers, indistinct; can be lost with wear. TAIL (GC=3)}, 
mostly white; outer feathers have grey-black (82) to black- 
brown (c119) markings on outer webs. UPPERWING (GC=4-5}) 
White elbow-patch larger than in Stage 4, forming broad 
white wedge, widest at junction with body, which runs down 
centre of wing, petering out near carpal joint. Broad dark line 
separates white wedge from narrow whitish leading-edge; rest 
of wing, grey-black (82), becoming black-brown (c19) with 
wear. White wedge formed by white humeral coverts, white 
secondary and median coverts at elbow, and by white inner 
webs and large white bases to outer webs of secondary coverts, 
median coverts and lower rows of lesser coverts. Except at 
elbow, white areas usually vermiculated grey (c84) at tips. 
Grey-black (c82) spot at tips of these coverts is larger in out- 
ermost feathers and in those closer to leading-edge; near 
carpal joint, and in upper rows of lesser coverts, these dark 
spots are the only exposed parts of the feathers, forming dark 
leading-edge to white wedge. Marginal coverts, white, with 
varying narrow grey-black (c82) tips, forming pale leading- 
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edge. UNDERPARTS, white. UNDERWING, as Stage 4. 

STAGE 6 At S. Georgia, occurs on about 23% of 
breeding males and less than 3% of breeding females (Tickell 
1968). Recorded in 37% of Macquarie I. males, including two 
birds at least 21 years old; not recorded in Macquarie I. females 
(Tomkins 1984b). HEAD AND NECK, UPPERPARTS, UNDER- 
PARTS, as Stage 7. TAIL (GC=3, occasionally 4), white; usually 
some outer tail-feathers have grey-black (c82) markings near 
tip of outer web that give appearance of dark sides to tail. 
UPPERWING (GC=5). Dark spots at tips of outer webs of lesser 
coverts smaller than in Stage 5; almost absent in some, so 
white wedge looks continuous with white leading-edge. 
White area does not extend beyond carpal joint. UNDERWING, 
as Stage 4. 


BARE PARTS Except where stated, based on photo- 
eraphs in Lindsey (1986), NZRD, and unpublished (D.W. 
Eades). 

D.e. chionoptera and D.e. exulans. 

ADULT Plumage Stages 2-7. Iris, black-brown 
(119) to dark brown (22). Eye-ring white, sometimes with pale- 
blue (-) or pale-green (-) tinge. Pink (108D), scarlet (Falla 1937) 
and lacquer red (Murphy) also reported; these colours prob- 
ably associated with bill-flush described below, but they have 
also been attributed to blood infusion following injury or 
death (Falla 1937). In 15 photographs examined, on birds at 
Macquarie I. in Jan. or Feb., on others in Aust. waters in win- 
ter, and some at breeding grounds without dates: bill, mostly 
light pink (pale 7) to light orange-pink (light 5), sometimes 
with deeper pink (7-108D) culminicorn, nares and base of 
ramicorn. Ungues usually cream (92), occasionally pink- 
white, sometimes with green-blue (65) to light grey (85) base to 
maxillary unguis. These photographs assumed not to be of 
adults in early breeding season, when Murphy described bill 
as yellow-buff and Matthews (1951) described bills as grading 
from pink at base to yellow at tip. Tomkins (1983) regarded 
whitish-bone as normal colour at Macquarie I. between late 
Nov. and early Mar.; noted that bill of many breeding birds 
rapidly became pink or deep purple when handled, returning 
to original colour after an hour or two. This bill-flush may 
have been more common after laying; 19 non-breeders and 
three fledgelings did not flush. Photographed birds described 
above assumed to show various stages of flush. Flushing ot bill 
at Macquarie I. may occur for only short period; for, at 8. 
Georgia, bill is salmon-pink (Murphy) or rose-pink (Matthews 
1951) in winter when large chicks are being fed, and at Anti- 
podes I. some adults have bright pink bills throughout incu- 
bation and nestling periods. Flushing not given by non- 
breeders in Antipodes I. (Warham & Bell 1979; Tomkins 
1983). Flushing appears to be caused by increased flow of 
blood below ramphotheca; perhaps breeding condition, stress 
and heat shedding may bring it on (Warham & Bell 1979; 
Tomkins 1983). Pink colour is lost shortly after death. Light 
brown (-}) lower tomia have been seen on a bird from Anti- 
podes I. (C.J.R. Robertson), an inconspicuous marking that 
has not been looked for in other populations. Tarsus and toes 
vary from pale horn (92) or pale brownish-grey (c85), at least 
sometimes with pinkish (-) spot at heel, to pink-white or pink 
(3). In dark footed birds, front of tarsus and tops of toes often 
paler. Webs, pale brown-grey (c85) or pale blue-grey (88) to 
pink (-). Pink coloration of feet apparently associated with 
flush (Murphy; Tomkins 1983; Warham & Bell 1979). Soles, 
grey (84) to pinkish (-). Claws, pale horn (92) to pink-white or 
white. 
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DOWNY YOUNG In protoptile: iris, dark grey 
(Rand 1954); bill, pinkish white with cream tip, at least some- 
times with green-blue (65) base to maxillary unguis; feet, 
brown (Murphy) or bluish grey with white claws (Rand 1954). 
Feet later become whitish or flesh with darker webs (Murphy); 
large chicks have iris and bill colour similar to juvenile. 
JUVENILE Similar to adults, but much pink flush- 
ing of bill, eye-ring and legs not reported. However, several 
juveniles photographed at breeding grounds or at sea had 
mostly pink-horn to light-pink (pale 7) bill contrasting with 
pale horn (c92) unguis. 


MOULTS Based on skins of 34 birds, including nine 
breeding adults from Auckland and Antipodes Is, five juven- 
iles and 20 birds at plumage Stages 2-7 from Vic. and NZ 
waters, mostly beachcast (MV; NMNZ). 

ADULT Stages 3-7. Remiges replaced in staffel- 
mauser, interrupted, at least by breeding birds, during breed- 
ing season. Non-moulting birds have a jumble of primaries of 
different ages, at least three generations of feathers being rec- 
ognizable in primaries of some. Fifteen wings from 12 skins, 
including ten beacheasts, have been examined with active pri- 
mary moult; mean number of growing primaries = 2.0 (15; 
1-4. 1.0). As with primary wear, often asymmetry in number 
of growing primaries in each wing, the most extreme case 
being two beachcast D.e. exulans with three growing pri- 
maries in left wing, one in right. Of nine wings examined with 
more than one growing primary, all had 2-3 adjacent pri- 
maties in growth, and only two had two active moult-centres. 
In most of these, waves of moult were moving outwards, but 
in two birds, both with moult formula O7172!N!0?, waves of 
moult may have been moving inwards. Sequence of moult of 
secondaries, unknown; usually occurs at same time as pri- 
maries; may be up to four moult-centres in secondaries of one 
wing (D,W. Eades photograph). Tail-moult generally occurs at 
same time as primary moult; waves begin centrifugally from 
two points, usually t2 and t5, Body-moult has only been 
checked on five non-juveniles, all of which had heavy body- 
moult; two were adults with chicks, collected from Antipodes 
Is. in late Nov.; others collected at sea or beachcast between 
May and Aug, 

Adults moulting near nest at Marion I. in late Mar. (Rand 
1954). Of nine adults collected on breeding grounds at Auck- 
land or Antipodes Is between late Nov. and Feb., only one had 
a growing primary; may have been accidental moult. These 
birds had between Oand 8 old primaries; even in those with no 
old primaries, subtle differences in wear of different primaries 
contrasted with uniform wear of juvenile wing. Of 20 birds 
beachcast or collected at sea between Apr. and Nov., 11 had, 
and nine had not, active primary moult; perhaps, as claimed, 
moult occurs in years when breeding does not take place. 
Active moult of remiges recorded in D.e. chionoptera in Feb. 
Piccuteeel OnLy 

POST -JU VENILE Unknown when primary moult 
begins; beachcast in Wellington on 10 June still had fresh 
primaries, but had begun moult in head, neck, back and under 
wing-coverts. 


MEASUREMENTS D.e. exulans: (1) Auckland Is, 
freshly dead adults (NMNZ). (2) Auckland Is, adult skins 
(NMNZ). (3) Auckland Is, live birds; methods unknown 
(Robertson 1975). (4) Antipodes I., freshly dead adults 
(NMNZ). (5) Antipodes I., adult skins (NMNZ). (6) Antipodes 
L., live adults; WING = flattened chord; other methods as here 


(Warham & Bell 1979). 


MALES FEMALES 
WING = (1) 635, 642, 660 636 
(3) 649 (630-665; 5) 642 (625-657; 11) 
(4) 645, 640 626 (16.0; 602-636, 4} 
(6) 655 (17.0; 8) 626 (5.0; 6) 
STHP = (2) 342, 355, 354 
(5) 347 339 (10.9; 326-352; 4} 
TAIL (1) 203, 204 203 
(4) 205, 193 192.0 (7.48; 182-200, 41 
(6) 203 (2.85; &) 199 (6.11; 6) 
BILL (1) 152, 151, 155 143.5 
(3) 152.1 (150-154, 5) 142.8 (138-147; 11) 
(4) 152, 148.5 141.5 (5.2; 135.5-148.1; 4) 
(6) 1481 (2.52) 8 138.0 (4.55; 6) 
BILLD (6) 414 (1.53, 8) 37.0 (1.74; 6 
TARSUS (1) 120, 117.1, 118.5 
(4) 119, 122 112.6 (4.46; 110-119.2; 4) 
(6) 116.6 (3.24; 8 108.7 (2.87: 6) 
TOE (1) 169, 164.7, 169 158.5 
(4) 168, 169 162.3 (4.67, 159-169.2; 4) 
(6) 176.6 (2.34; 8) 167.0 (7.00; 6) 


Unsexed birds (7) Auckland Is, live birds; methods 
unknown (Robertson & Jenkins 1985). 


WING (7) 


647 (185-205, 12) 


BILL (7) 147.6 (140.4-156.7, 12) 
TARSWSe (7} 114 (108.8-119.5; 12) 
(7) 161.3 (154.0-176.5; 12) 


TE 


D.e. chionoptera: (1) Macquarie L., live birds at Stages 3-7; 


measurements for breeders and non-breeders, which do not 
differ significantly in this sample, combined. Bill length from 
junction of skin and culmen to most distant point of curve of 
maxillary unguis; bill depth at shallowest point; other 
methods as here (Tomkins 192@4b). (2) Indian Ocean, skins, 
including specimens from Marion IJ. and Iles Kerguelen and 
Crozet, and adjacent waters; methods probably as here 
(Murphy). (3) S. Georgia, adults; methods unknown (Tickell 


1968). 
MALES FEMALES 
WING {1} 671 (16.7; 635-701; 17) 635 (14.0; 618-657; 10) * 
(2) 640 (625-679: 10) : 
(3) 679 (14.4; 655-710, 21) 657 (13.7; 630-680; 22) * 
TAIL (1) 220 (9.0, 201-228; 17) -204 (8.0; 193-215; 10) * 
(2) 197 (190-205; 10) 
(3) 227:(7.40; 215-246; 21) 215 (6.23; 20,6-22.7; 23) * 
BILL (1) 165.8 (5.98; 158-181; 17) 158.3 (2.95; 156-166; 10) “* 
(2) 171 (162-177; 10) ' 
(3) 169.1 (4.03; 163-180: 21) 163.8 (3.62; 195-171; 23) * 
BILLD = (1)._—( 37.0 (1.39, 34.8-40.4; 17) 33.5 (1.73; 31.1-36.7; 10) * 
(3) 40.1 (1.66; 37.2-44.6; 21} 35.8 (1.92; 32.0-40.0; 22) * 
TARSUS (1) 126.7 (2.02; 123-132, 17) 120.6 (1.78; 117-123; 10) * 
(2) 123.2 (118-128; 10) 
(3) 118.0 (4.61; 110-127: 21) 35.8 (1.92; 32.0-40.0; 22) * 
TOE (1) 164.8 (5.12; 148-172; 17) 159.8 (5.22; 151-169; 10) * 
(2) 171.7 (167-176; 10) 
(3) 184.4 (6.93; 172-193; 21) 174.7 (4.73; 165-181; 22) * 


Unsexed birds: (4) Iles Crozet; methods unknown (Voi- 


sin 1969). 


WING (4) 678 (4.0; 640-788; 40) 
BILL (4) 154.7 (0.9: 141-165; 40) 
WEIGHTS All in kg. (1) S. Georgia, adults (Tickell 


1968). (2) Macquarie I., Stages 3-7, breeders and non-breed- 
ers. (3) Antipodes I., breeders and non-breeders, 19, 23 Feb. 
(Warham & Bell 1979). 


MALES FEMALES 


(1) 9.768 (0.875; 8193-11907; 20) 7.686 (0.559: 6.719-8.703;, 22) 
(2) &.0, 8.6, 8.6 6.8, 6.4, 75 
(3) 7.35 (0.75; 8) 5.67 (0.94; 6) ae 


Unsexed birds: (4) Marion I.; combined data for unsexed 
adults and incubating females (Brown & Adams 1984). (5) Iles 
Crozet (Voisin 1969). (6) Auckland Is (Robertson & Jenkins 
1985). 


i4) 8.02 (0.96; 9) 
(5) 9.58 (0.2; 6.80-11.5; 40) 
(6) 5.96 (5.0-7.1; 10) 


During incubation shifts at $. Georgia, males lose c. 85 
g/day, and females c. 80 g/day, amounting on average to 17% 
of body weight per shift; weights at beginning of shifts, esti- 
mated from individuals weighed three times during shifts, in 
males 10.5 (0.80; 9.49-11.94. 10), in females 9.00 (0.91; 8.18- 
9.83; 9). No evidence for general decline of weight during 
incubation period (Croxall & Ricketts 1983). Little other in- 
formation on temporal variation; feeding birds off NSW in 
winter 8.3 (5.9-11.3; 108; Gibson 1967), but sex and origin of 
birds unknown. NZ beachcasts, ages combined, believed to be 
D.e.exulans on basis of length of bill: males 6.16 (1.261; 3.560- 
8.520; 14), females 4.87 (1.31; 3.33-8.02; 7); these include 
emaciated birds of 3.33 (fernale), 3.56 (male), males with no fat 
of 7.76, 5.74, and 5.95, and a fat juvenile female weighing 
5.03. 


STRUCTURE Wing, very long and narrow. Eleven pri- 
maries; p11 minute. In five Auckland Is birds, p10 longest, p9 
14-26 shorter, p8 41-69, p7 81-117, p6 136-180, p5 189-243, 
p4 241-301, p3 293-336, p2 340-369, p! 368-398. In six Anti- 
podes Is birds, p10 longest, p9 7-20 shorter, p8 35-53, p7 
80-103, p6 127-150, p5 175-207, p4 237-268, p3 287-321, pz 
329-360, pl 357-386; wings possibly less pointed than at 
Auckland Is, but few data. About 38 secondaries, including 
seven tertials; 12 humerals, about same length as secondaries. 
Tail, short and rounded; 12 feathers. Bill, long and heavy, least 
deep in centre; maxillary unguis, hooked and mandibular 
unguis slightly decurved. No sulcus; plates of bill sharply de- 
fined. Nostril tubes, short with straight sides; nares point 
obliquely forwards and slightly upwards (differing from D. 
epomophora in which sides of nostril tubes bulge outwards and 
nares face slightly inwards). Tarsus slightly laterally com- 
pressed; scales hexagonal, 2-4 mm in diameter. Feet webbed, 
webs cancellate. Middle toe longest, outer 96-97%, inner 83- 
88%; no hind toe. 


GEOGRAPHICAL VARIATION Mostly in size and 
plumage whiteness attained; juvenile plumages similar in all 
populations. Slight evidence for geographical variation in 
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flushing of bill is discussed in Bare Parts. D.e. chionoptera, 
described in Plumages, breeds 5. Georgia, Prince Edward, 
Crozet, Kerguelen and Macquarie Is. N. populations, pres- 
ently combined as one subspecies D.e. exulans, breed at Tris- 
tan da Cunha, Auckland, Antipodes and Campbell Is (Peters). 
All smaller than chionoptera, notably in length of bill. Plu- 
mages are similar to those of chionoptera, but the snowy 
plumage, Stages 6 and 7, is seldom, perhaps never, attained; 
breeding apparently begins in darker plumages. Antipodes I. 
birds have been said to be darker than in other populations 
(Falla 1937; Warham & Bell 1979) but adequate data have yet 
to be published. Ten birds collected or photographed from 
Antipodes 1. (six breeding birds from NMNZ, two pairs of 
unknown status in Warham & Bell 1979 and NZRD) had 
cumulative Gibson scores ranging from 5-11 (plumage Stages 
3-5); in nine collected or photographed at Auckland Is 
(NMNZ; Lindsey [1986]; ].A. Bartle), cumulative Gibson 
scores 10-17 (plumage Stages 3-5). No comparable data avail- 
able for extralimital populations at Tristan da Cunha and 
Gough I., which used to be classed as separate subspecies on 
basis of small size; published measurements in Murphy, 
Hagen (1952) and Elliott (1957) are too few to support any 
conclusion. Birds from Ile Amsterdam, once combined with 
D.e. exulans (Peters), are now classified as separate species D. 
amsterdamensis (Roux et al. 1983; Jouventin & Roux 1983; 
Jouventin et al. 1989). Their plumages are similar to younger 
stages of D.exulans, most birds breeding at plumage Stage ?; 
measurements published in Jouventin et al. (1989) seem simi- 
lar to those of D.e. exulans. The major difference is in bill 
colour: D. amsterdamensis have conspicuous dark-brown 
lower tomia, similar to those of D. ebomophora, and ungues 
are dark brownish-green, darker than rest of bill. 

DIR 
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Plate 15 


Royal Albarrass Diomedea epomophora epomophora 
. Juvenile, dorsal, Stage 1 

. Juvenile, ventral, Stage 1 

. Dorsal, Stage 2 

. Dorsal, Stage 3 

. Adult, dorsal, Stage 5 

. Adult, ventral, Stage 5 
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High-Fidelity Optimization of Flapping Airfoils and 
Wings 


Matthew Culbreth,* Yves Allaneau* and Antony Jameson! 
Stanford University, Stanford, CA 94305, USA 


We present the results of several optimizations for 2D airfoils and 3D wings undergoing 
periodic, flapping-type motions. Optimizations are achieved by coupling high-fidelity 2D 
and 3D Navier-Stokes solvers with a gradient-based optimization algorithm. In 2D we 
consider a pitching and plunging NACA0O012 airfoil and investigate the maximization of 
the propulsive efficiency. In 3D we consider a rectangular wing that is hinged at the root. 
The motion of the 3D wing is parameterized by spline control points that allow span-wise 
variation of both the dihedral angle and twist angle, allowing complex wing motions and 
deformations with relatively few parameters. Propulsive efficiency is maximized for the 
3D wing in an non-twisting case as well as using one, two and four of these span-wise twist 
control points. The results of the optimizations lead to several conclusions, including that 
pitching and twisting can significantly improve the attainable propulsive efficiency, can 
delay the onset of leading edge separation, and that solutions that maximize propulsive 
efficiency appear to operate at the limit of leading edge separation. 


Nomenclature 
a Frequency of oscillation 
Qu Pitch Angle (2D) and twist angle (3D) 
6 Dihedral Angle (3D) 
h Plunging amplitude (2D) 
) Phase offset corresponding to a 
w Phase offset corresponding to 6 
ys Modified propulsive efficiency 
P Aerodynamic power 
Cr Lift coefficient 
Cp Drag coefficient 
x Vector of optimization parameters 
f(x) Optimization objective function 
g(x) Non-linear constrain functions 
Ax Linear optimization constraints 
] Lower optimization constrain bound 
u Upper optimization constrain bound 
Subscript 
U9 ,2 Coordinate directions 
k,l Value at k®” or I: control points for the dihedral or twist angle, respectively 
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I. Introduction 


There has been a significant increase in research interest in the aerodynamics of flapping wings within 
the last decade. Flapping wing flight has become a significant engineering research direction, with the 
miniaturization of electronic components and increases in battery energy density allowing for the creation of 
a class of small-scale unmanned flying vehicles generally referred to as micro air vehicles (MAVs). Flapping 
wings are of particular interest for this application both because of the numerous, highly-successful examples 
of flapping wing flight found in nature and because flapping wings provide an attractive flight system that 
provides lifting, thrust and control in a single package. Investigation of flapping wing flight is also of 
significant importance to the fundamental science of aerodynamics. The development of aircraft over the 
last century has provided a significant and mature body of understanding of the physics of steady-state 
fluid dynamics across a wide range of flight speeds. This work generally has little application, however, to 
the complex and unsteady flapping wing flight regime. Fortunately, high-fidelity computational tools and 
associated powerful computational hardware now allow for accurate simulation of flapping wings, making 
it possible to begin to understand the physics of this type of flight. These high-fidelity simulation tools 
have also opened new possibilities for research into the fluid dynamics of bird and insect flight within the 
biology community. These tools now allow detailed examination in areas such as the relationship between the 
kinematics and the resulting forces such as lift and thrust and the flow physics associated with the numerous 
examples of animal flight mechanisms. 

Flapping wing flight is complex and difficult from a research and design stand-point. As previously 
mentioned, very few of the efficient computational design tools used for large aircraft design can be used in 
the unsteady, low Reynolds number regime. Accurate simulation of flapping wings requiring costly unsteady 
numerical flow simulations that can accommodate large mesh motions and deformations and feature low 
numerical dissipation to accurately capture the complex, vortex dominated flow physics. A second difficulty 
is that there are many degrees of freedom in the parameterization of flapping wing motions when considering 
a flexible wing in a generalized periodic flapping motion. While much has been learned from observing the 
flight of birds and insects, it is still far from clear how to couple wing flexibility and flapping motion in an 
efficient or optimal way for a given flight performance metric. Additionally, it is not clear how to compactly 
parameterize flapping wing motion while still retaining the appropriate level of complexity of motion required 
for efficient flight. 

We propose to address the issues associated with selecting an appropriate parameterization and then 
identifying the parameter values that yield efficient performance by using a numerical optimization process. 
We have developed 2D and 3D viscous flow solvers that are specifically tailored to simulating a flapping 
wing. ‘These codes are coupled to a gradient-based numerical optimization algorithm. ‘The optimization 
process is then used in conjunction with parameterizations of varying complexity to both identify optimal 
flapping motions and also investigate the trade-off between the complexity of the motion and the attainable 
performance. In this paper we specifically address the optimization of propulsive efficiency in the 2D and 
3D case. In 2D we investigate the pitching and plunging airfoil case. In 3D we investigate a flapping and 
twisting wing that is hinged at the wing root and has varying degrees of complexity of span-wise twist. We 
compare the results of these optimizations both to understand the attainable limits of propulsive efficiency 
and the cost versus benefit of adding additional degrees of freedom to the wing motion. 











Background 


The study of flapping wings dates back to the beginnings of the study of aerodynamics itself , with attempts 
at constructing flapping wing vehicles that predate the first powered aircraft. The development of the 
theory of flapping wings through the twentieth century was somewhat of a footnote in the evolution of 
aerodynamic theory, which was largely driven by understanding transonic and supersonic fixed-wing flight. 
Early examples of flapping wing theory come from the work of Theodorsen! and Garrick,? who developed 
analytic models for airfoils undergoing small-amplitude oscillations in 2D potential flow, work that was 
largely motivated by attempts to understand aeroelastic flutter. Lighthill? and Weis-Fogh,* among others, 
performed pioneering research on the aerodynamics of biological fliers including the elucidation of some of the 
complex aerodynamic mechanisms used by these creatures. Interest in flapping flight increased significantly 
towards the end of the twentieth century as steady aerodynamic theory reached maturity and improvements 
in algorithms for computational fluid dynamics (CFD) and increasing computing power made higher-fidelity 
simulations of flapping wings possible. Early numerical simulations of flapping wings were carried out using 
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unsteady vortex-lattice and panel methods.**® Current computational capabilities allow for high-fidelity, 
3D, unsteady Navier-Stokes simulations of flapping wings. A number of detailed CFD studies have been 
completed, including work by Jones, et. al.,”° Shyy et. al.,'+ Persson et. al.,? and Ou et. al.'° 

There has also been long-standing research interest in optimization of flapping flight, based partially on 
the assumption that natural fliers operate in a manner that is optimal in some sense. R.T. Jones!” developed 
an expression for the optimal lift distribution along the wing during flapping motion by minimizing induced 
drag for a given wing bending moment in potential flow. Hall and Hall!’ compute the optimal span-wise 
circulation distribution on a thrusting and lifting wing using a 1D integral solution in the small amplitude case 
and a vortex-lattice code in the large amplitude case. Hamdaoui et. al'+ use a multi-objective evolutionary 
algorithm coupled with analytic flapping wing models to optimize various flight metrics. Ito!° couples a 
vortex-lattice model with a hybrid optimization method that combines a genetic algorithm with a sequential 
quadratic programming algorithm. Strang?!® utilizes a vortex-lattice code in conjunction with the gradient- 
based optimization to investigate the optimal flapping gait of a Pterosaur wing. Milano and Gharib!’ couple 
a genetic algorithm to an experimental apparatus with a two degree of freedom flapping rectangular plate to 
maximize average lift force. Tuncer and Kaya!® use gradient-based optimization coupled with a 2D overset- 
erid Navier-Stokes solver to maximize thrust and propulsive efficiency of a pitching and plunging airfoil. 
Willis et. al.!? uses a multi-fidelity approach to optimize flapping wing performance metrics. In their work, 
flapping motions are generated using optimal wake vorticity distributions generated by a wake only method, 
which are then further refined using a panel method and finally verified using a Discontinuous Galerkin-based 
3D Navier-Stokes solver. 

In this work we propose to couple a 3D Navier-Stokes solver with a gradient-based optimization in order 
create a framework for investigating various combinations of flapping motions and optimization objectives. 
The details of this framework and several optimization results are presented in the remained of this document. 











II. Optimization Framework 


Optimization Input File 
Alg O rit h mM Objective Motion Solve ' 
Function Deformation 
Value 


Time History of Forces 





Figure 1. Software block diagram 


A block diagram of the software framework for solving flapping-wing optimization problems is shown in 
figure 1. The two major components are the optimization algorithm and the flow solver, with a “wrapper” 
that mainly serves as a light-weight interface between these two larger pieces of software. Details of the flow 
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solver and optimization algorithm, as well as the formulation of the optimization problems are discussed in 
the remainder of this section. 


2D and 3D Navier-Stokes Solvers 


The 2D and 3D flow solvers are based on the low-dissipation kinetic energy preserving (KEP) finite volume 
scheme developed by Jameson?’:! and extended by Allaneau and Jameson.?* The kinetic energy preserving 
property of this scheme allows stability to be maintained with little or no artificial dissipation. This property 
is especially desirable for vortex dominated flows such as flapping flight since artificial dissipation tends to 
quickly and unnaturally dampen complex flow features. These codes have been specifically developed to 
simulate oscillating airfoils and flapping wings and to be integrated into an optimization framework. ‘The 
full details of this code can be found in the work of Allaneau et. al.?° 

A TVD Runge-Kutta second-order multistage time stepping scheme” is used for time integration. Struc- 
tured grids are used throughout, with 2D cases using a C-mesh and 3D cases using a C-H mesh. Both the 
2D and 3D solvers allow for mesh motion, with the 2D solver allowing rigid-body transformations of the 
mesh (enabling pitching and plunging motions), and the 3D solver allowing both rigid transformation and 
complex deformations of the mesh. The mesh deformations in 3D allow for a wide range of parameterized 
shape changes, such as wing bending and twisting as well as planform variations, among many other pos- 
sibilities. Deformations in 3D are accomplished in a computationally inexpensive manner using analytic 
transformations to a base, undeformed mesh via shearing, stretching and twisting operations. Examples of 
bending and twisting mesh deformations are shown in figure 2. 








Figure 2. Examples of 3D mesh deformation capabilities in the Navier-Stokes solver 


Optimization Problem 


We consider a constrained, non-linear optimization problem of the form 


minimize f (x) 
| - (1) 
subject to I< g(x) <u 
Ax 


where f is the optimization objective function, x are the optimization parameters, g(x) are non-linear 
constraint functions, A is a matrix of linear constraints on x, and 1 and u are the lower and upper constraint 
bounds. The optimization variables x parameterizes the motion and geometry of the flapping wing and f(x) 
and g(x) are functions of the time-averages of the force and power coefficients calculated by the flow solver. 
Details of the parameterization, the objective functions and the optimization algorithm for the 2D and 3D 
optimizations are given in the remainder of this section. 
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Parameterizations 


In the 2D case we consider single-mode sinusoidal pitching and plunging motions. In the general case these 
motions take the form 

a(t) = ag + acos(27 ft + ¢) (2) 

h(t) = hcos(2z ft) (3) 


The parameterization used in all cases considered herein consists of four optimization variables: the frequency 
f, the pitch and plunge amplitudes a and hf and the phase difference between pitch and plunge @. The base 
angle of attack ao is set to zero in all cases, yielding motions that are symmetric about the axis of free-stream 
flow. 

The 3D motion parameterization is intended to mimic typical flapping motions with combinations of 
varying dihedral angle at the wing root and twisting along the span. Smooth variation of the dihedral 
angle 0(s) and the twist angle a(s) (where s is the span-wise arc-length parameter) are generated by fitting 
piece-wise cubic splines through one or more control points along the span. Illustrations of the spline control 
points for dihedral angle and twist are show in figure 3. For the resulting surface deformation refer again to 
figure 2. 























Figure 3. Illustration of the 3D flapping parameterizations showing the dihedral angle (left) and the twist angle (right) 





The time-dependent flapping motion is parameterized by prescribing sinusoidal variation to a set of k 
dihedral control points and / twist control points spaced evenly along the span. Each control point maintains 
a parameter for the incremental dihedral angle 0; and the incremental twist a; along with associated phase 
angles ¢; for the flap angle and y for the twist angle. The equations of motion are of the form 


0),(t) = 04 cos(2n ft + bx) (4) 
a, (t) = a; cos(2r ft + yy) (5) 


The complete motion of the wing is parameterized by the frequency f, the dihedral and twist angles 6; and 
a,, and the dihedral and twist phase angles @, and 7, for k = 1...m and / = 1...n control points and 
resulting in 2m + 2n + 1 total optimization parameters. 


Components of the Objective Functions and Constraints 





Objective functions are based on time-averages of various integrated force and power coefficients calculated 
by the flow solver. These include the average lift C,, average thrust Cy = —Cp and the aerodynamic power 
averages P,, Ee, and P,. We also differentiate between the thrust-producing power P, = U,,Cr and the 
non-thrust-producing powers r and P,. In all cases these averages are computed by integrating over a single 
flapping cycle after a suitable number of periods have elapsed to allow the flow to reach a quasi- periodic 
state. 

Propulsive efficiency is a commonly used performance metric in propellers and flapping wing systems and 
the maximization of propulsive efficiency is an attractive objective function for the optimization of these 
systems. The propulsive efficiency is the ratio of thrust-producing power to the mechanical power required 
to flap the wing or to pitch and plunge the airfoil. However, as first observed by Jones and Platzer,° the 
propulsive efficiency is a discontinuous function. Furthermore, the discontinuity occurs in the vicinity of the 
maximum propulsive efficiency, making it a very difficult problem for many optimization algorithms. For 
this reason we use a “modified propulsive efficiency” n, given by 


a 
im = 
| |P2 + P2 + P? 
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This formulation removes the problem of a discontinuous objective function since, for all practical cases, 
es ee 


Optimization Algorithm 


A single objective function evaluation requires a time-accurate solution from the 2D or 3D flow solver. These 
solutions are computed on large clusters and typically use between 64 and 512 compute cores. However, even 
with these significant compute resources flow solutions take on the order of several hours. The significant 
computational cost of objective function evaluations motivates the use of a gradient-based optimization 
algorithm since these methods tend to require far fewer evaluations of the objective functions than gradient- 
free methods such as genetic algorithms so long as the objective function is locally smooth near the optimum. 
We have chosen to use the SNOPT package”? for all of the optimizations presented here. SNOPT is a widely 
used software library based on the sequential quadratic programming (SQP) method and is designed for 
use on constrained, non-linear optimization problems. Gradients are computed within SNOPT via finite- 
differences, which require on the order of n function evaluations per gradient calculation. The calculation of 
the gradients dominates the computational cost for all but the smallest problems, however since the function 
evaluations within each gradient calculations are independent the process can be parallelized. 








III. 2D Optimizations 


For the 2D pitching and plunging airfoil case we consider the maximization of the modified propulsive 
eficiency using a NACA0012 airfoil at a Reynolds number of 1850 based on the chord and a mach number 
of 0.2. The numerical solutions for all 2D cases use a 1024 x 128 C-mesh and are integrated over five periods 
to ensure that the force and power reach a periodic state. The force and power coefficients are integrated 
over the final oscillation cycle to obtain the averaged quantities for the objective function. The solver is run 
in parallel on a large cluster, typically using 64 compute cores per flow solution. Compute times are on the 
order of 2 hours per flow solution. Optimal solutions are validated using a 4096 x 512 mesh that is effectively 
DNS resolution for the given Reynolds number. 

The parameter values for the optimal cases are given in table 1. Figure 5 show the contours of vorticity 
magnitude at four points in the flapping cycle. Figure 4 contains the lift versus drag coefficient polars over 
one flapping cycle. 





Table 1. Optimal parameters for 2D maximization of modified propulsive efficiency 


Case Frequency ay h @ Nis 
Plunging 2.63 Hz - 0.252 - 12.1% 
Pitching/Plunging 4.61 Hz 19.78° 0.212 67.03° 31.4% 


The results from the optimizations show that an increase in modified propulsive efficiency from 12.1% to 
a maximum of 7,, = 31.4% is achieved by adding the pitching degree of freedom, and it is also interesting 
to note that the frequency nearly doubles for the pitching and plunging case. The lift versus drag polars in 
figure 4 give insight into the differences between these two cases. Non-lifting cases with 7, = 100% must all 
lie along the line C;, = 0 since P, and P, must be everywhere zero. The efficiency of a given flapping cycle 
can then generally be inferred by observing the aspect ratio of the polar. ‘The wider and the polar along C'p 
and the narrower along C’,, the greater the magnitude of 7,,,, and additionally, for 7,, to be positive the polar 
must show a net thrust. This is seen in figure 4 where the pitching and plunging case shows a significant 
increase in thrust production compared with the plunging case with only a relatively small increase in lift 
through the cycle. 

The vorticity contours in figure 5 show the formation of significant leading-edge vortices on the up and 
down stroke in the plunging case. The pitching and plunging case, however, shows an absence of leading 
edge vortex formation. Close examination of the pitching and plunging case shows the formation of a small 
separation bubble that appears near the leading edge of the airfoil at the beginning of the up and down 
strokes and moves aft as the stroke progresses, but does not develop into a vortex. The formation of this 
separation bubble indicates that the flow is likely on the verge of stalling, which would lead to the formation 
of a full leading-edge vortex. This indicates that a characteristic of maximum propulsive efficiency in the 2D 
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Figure 4. Cy versus C’p polars for optimal 2D cases. The vertical axis represent lift and the horizontal axis represents 
drag. The left plot shows the polar for the plunging case and the right plot shows the polar for the pitching and 
plunging case. 





Figure 5. Vorticity visualization of the flapping cycle for maximum propulsive efficiency. The top row depicts the 
plunging case and the bottom row depicts the pitching and plunging case. Note that the vorticity contour colormaps 
values are not quite the same between the top and bottom rows. 


case is that the motion drives the flow to the limit of separation without exceeding it, which would cause a 
drop in suction and dissipate of excessive energy into the flow by forming a vortex. 


IV. 3D Optimizations 


For the 3D flapping wing case we consider the maximization of propulsive efficiency with increasing 
numbers of span-wise control points. The wing configuration in all cases has a rectangular planform with 
an aspect ratio of 8 and a NACAO0012 airfoil section. The domain is discretized using an H-C mesh with 
256 x 64 x 64 cells and employs a symmetry plane. A mach number of 0.2 and a Reynolds number of 2000 
based on the chord length are used throughout and flow solutions are integrated over five flapping cycles, 
with the averaged force and power coefficients being integrated over the final flapping cycle. Flow solutions 
are run in parallel, typically using 256 cores, requiring approximately 3-4 hours of wall-time per simulation. 

We present results for four cases with a single dihedral control point (k = 1) and zero, one, two and 
four evenly spaced twist control points along the span (J = 0,1,2,4). The / = 0 case yields purely dihedral 
oscillation and is akin to the plunging airfoil case in 2D. The / = 1, 2&4 are akin to the pitching and plunging 
case in 2D with increasing complexity of spanwise twist distribution. In the / = 4 case the optimization 
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algorithm failed to improve on the results from the / = 2 case, and so further results from this case are not 
included. Table 2 contains a summary of the 3D optimizations. Figure 6 shows the wing geometry for each 
case with surface pressure contours at four points over one flapping cycle. Figure 7 shows the same plots 
with vorticity isosurfaces overlaid. Figure 8 shows the lift versus drag coefficient polars over one flapping 
cycle. 


Table 2. Summary of 3D maximization of propulsive efficiency for varying numbers of span-wise control points 


Control Points Frequency 6 Cumulative ‘Twist “pe 
0 3.626 Hz  6.53° — 9.60% 
1 0.641 Hz 46.9° 62.5° 46.7% 
2 0.755 Hz 47.1° 58.4° 49.5% 
4 — — — No Improvement 





Figure 6. Vorticity isosurface visualization of the flapping cycle for maximum propulsive efficiency. Color values are 
based on pressure. The top row depicts the case without twisting motion, the middle row depicts the case with one 
twist control point, and the bottom row depicts the case with two twist control points. 


The 3D results show that the addition of wing twist produces a significant improvement in the modified 
propulsive efficiency, increasing the propulsive efficiency from around 10% to almost 47% by allowing linear 
twist. The efficiency is further improved to nearly 50% by allowing for a more complex and non-linear twist 
distribution. There is also a significant decrease in frequency and increase in dihedral amplitude between 
the / = 0 and / = 1 cases, while there is a relatively minor change in the motion between the / = 1 and / = 2 
cases. All phase angles, which have been omitted for brevity, are within the range 90° + 10°. 

Figure 6 further illustrates the significant difference in the dihedral amplitude between the twist-free case 
and the twisting cases, while there is little discernible difference in the twist distribution with one versus two 
twist control points. Further analysis of the / = 2 case indicates that the twist angle reaches approximately 
78% of the final tip twist angle of 58.4° between the wing root and the mid-span control point, with relatively 
little additional twist occurring between mid-span and the wing tip. 
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Figure 7. Vorticity isosurface visualization of the flapping cycle for maximum propulsive efficiency. Color values are 
based on pressure. The top row depicts the case without twisting motion, the middle row depicts the case with one 
twist control point, and the bottom row depicts the case with two twist control points. 


The vorticity isosurface plots in figure 7 provide some insight into the flow physics of the different cases. 
The twist-free case shows evidence of significant leading edge separation during the up and down strokes 
towards the wing tips, while the plots for the twisting cases indicate much less leading edge separation. 
The twisting cases appear demonstrate some instability or separation bubble formation, with the | = 2 case 
appearing to reduce the severity of the instability over the / = 1 case. These findings are similar to the 
findings in the 2D cases in that the addition of pitching/twisting improves efficiency by apparently reducing 
or eliminating leading edge separation, and that high propulsive efficiencies are achieved by motions that 
are at the limit of leading edge separation. 

The polar plot in figure 8 also show trends that are similar to the 2D case. The polar for the twist-free 
case has a large C', range and a low-magnitude but largely negative (thrust producing) C'p range resulting in 
low efficiency. ‘The twisting cases show a reduced C’;, and increased thrust, both contributing to the increase 
in efficiency. Between the / = 1 and / = 2 cases the maximum thrust coefficient increases from approximately 
0.8 to 1.2 without a significant change in the C’, range. A noted difference between the 2D and 3D cases is 
that, while in the 2D case the Cy, range increases for the pitching and plunging case, in 3D the Cy, range 
decreases for the twisting cases. It is not clear if this is a result of the fundamental difference between 2D 
and 3D flows, or if it an indication of potential room for further optimization of the 2D results. 











V. Conclusion 


Our results show that significant improvements are made to propulsive efficiency of by introducing pitch- 
ing motion in 2D and twisting motion in 3D. The physical mechanism for this improvement in both cases 
appears to be the elimination of the inefficient leading edge vortex by providing instantaneous angle of attack 
relief with respect to the motion of the wing or airfoil. The airfoil and wing both appear to be on the verge of 
forming leading edge vortices as evidenced by the appearance of separation bubbles and instabilities in plots 
of vorticity. Overall, results show that pitching/twisting are important mechanisms for producing efficient 
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Figure 8. Cy versus C’'p polars for optimal 3D cases. The vertical axis represent lift and the horizontal axis represents 
drag. The left plot shows the polar for the non-twisting case, the center plot shows the polar for the single control 
point case and the right plot shows the polar for the 2 control point case. 


thrust in flapping flight. In the 2D parameterizations the addition of two parameters to represent the ampli- 
tude and phase of a simple sinusoidal pitching or linear twisting motion provides significant improvements 
in the maximum attainable propulsive efficiency over the cases without these motions. The 3D results show 
that further improvements are made by increasing the complexity of the twisting motion by the addition 
of a second control point at mid-span. The resulting twist distribution with two control points shows a 
relatively large change from root to mid span and a relatively small change from mid-span to the wing-tip. 
The resulting flapping motion appears to further reduce the presence of instabilities related to separation. 

The work here represents an investigation into a small subset of the potentially interesting parameteriza- 
tions and objective for flapping flight. Fortunately, the optimization framework we use allows for the study 
of a wide rage of motions and objectives with relatively little modification. Interesting future work for the 2D 
case includes increasing the complexity of the parameterization by representing the motion via a sinusoidal 
expansion or piece-wise spline, or to allow for deformation of the airfoil geometry. There are a number of 
additional parameterization possibilities in 3D as well, include parameterizing the wing planform, allowing 
more dihedral control points, altering the airfoil section, and allowing for more complex parameter trajec- 
tories. The work here also only considers the unconstrained maximization of propulsive efficiency. Finally, 
an objective perhaps more relative to efficient flight is the minimization of input power given constraints on 
lift and thrust. This case relates to steady, level flight where the net thrust and lift are zero. 











Acknowledgments 


This work is supported by the Department of the Army through the Army High Performance Computing 
Research Center, Cooperative Agreement No. W911NF-07-2-0027, and by the AFOSR under grant no. FA 
9550-07-1-0195 from the Computational Math Program under the direction of Dr. Fariba Fahroo. Yves 
Allaneau is supported by the Hugh H. Skilling Stanford Graduate Fellowship. 


References 


1'T. Theodorsen, General Theory of Aerodynamic Instability and the Mechanism of Flutter, T.R. No. 496, N.A.C.A., 1935 

21. E. Garrick, Propulsion of a Flapping and Oscillating Airfoil, T.R. No. 567, N.A.C.A., 1937. 

3M.J. Lighthill, Hydromechanics of Aquatic Animal Propulsion, Annu. Rev. of Fluid Mech., Vol. 1, p. 413-446, 1969. 

“T. Weis-Fogh, Quick Estimates of Flight Fitness in Hovering Animals, Including Novel Mechanisms for Lift Production, 
J. Exp. Biol., Vol. 59, p.169-230, 1973. 

©P. Liu, A Time-Domain Panel Method for Oscillating Propulsors with Both Chordwise and Spanwise Flexibility, Ph. D. 
Thesis, University of Newfoundland, 1996. 

°K. D. Jones and M.F.Platzer, Numerical Computation of Flapping-Wing Propulsion and Power Extraction, AIAA Paper 
97-0826, 1997. 

7K. D. Jones, C. M. Dohring and M. F. Platzer, Experimental and Computational Investigation of the Knoller-Betz Effect, 
AIAA Journal, Vol. 36, No. 7, July 1998. 

8k. D. Jones, T. C. Lund and M. F. Platzer, Experimental and Computational Investigation of Flapping Wing Propulsion 
for Micro Air Vehicles, Progress in Aeronautics and Astronautics, Vol. 195, p. 307-339, 2001. 


10 of 11 


American Institute of Aeronautics and Astronautics 


°P. -O. Persson, D. J. Willis and J. Peraire, The Numerical Simulation of Flapping Wings at Low Reynolds Numbers, 
AIAA Paper 2010-724 

10K. Ou, P. Castonguay and A. Jameson, 3D Wing Simulation with High-Order Spectral Difference Method on Deformable 
Mesh, AJAA Paper, Orlando, 2011. 

lw. Shyy, Y. Lian, J. Tang, H. Liu, P. Trizila, B. Stanford, L. Bernal, C. Cesnik, P. Friedmann, P. Ifju, Computational 
Aerodynamics of Low Reynolds Number Plunging, Pitching and Flexible Wings for MAV Applications, AIAA Paper 2008-528 

12R. T. Jones, Wing Flapping with Minimum Energy, NASA TMJ81174, 1980. 

13K. C. Hall and S. R. Hall, Minimum Induced Power requirements for Flapping Flight, J. Fluid. Mech., Vol 323, p. 285-315, 
1996. 

14. Hamdaoui, J.-B. Mouret, S. Doncieux and P. Sagaut, Optimization of Kinematics for Birds and UAVs using Evolu- 
tionary Algorithms, Proceedings of the World Academy of Science, Engineering and Technology, Vol. 30, July 2008. 

15K. Ito, Optimization of Flapping Wing Motion, ICAS 2002 Congress, 2002. 

16K. A. Strang, Efficient Flapping Flight of Pterosaurs, Ph. D. Thesis, Stanford University, 2009. 

!7M. Milano and M. Gharib, Uncovering the Physics of Flapping Flat Plates with Artificial Evolution, J. Fluid Mech., Vol. 
534, p. 403-409, 2005. 

181. H. Tuncer and M. Kaya, Optimization of Flapping Airfoils for Maximum Thrust and Propulsive Efficiency, Acta 
Polytechnica, Vol 44, No. 1, 2004. 

191. J. Willis, P. -O. Persson, E. R. Israeli, J. Peraire, S. M. Swartz, K. S. Breuer, Multifidelity Approaches for the 
Computational Analysis and Design of Effective Flapping Wing Vehicles, AITAA Paper 2008-518, 2008. 

20A. Jameson, The Construction of Discretely Conservative Finite Volume Schemes that Also Globally Conserve Energy 
or Entropy, Journal of Scientific Computing, Vol. 34-2, p. 152-187, 2008 

21A. Jameson, Formulation of Kinetic Energy Preserving Conservative Schemes for Gas Dynamics and Direct Numerical 
Simulation of One-Dimensional Viscous Compressible Flow in a Shock Tube Using Entropy and Kinetic Energy Preserving 
Schemes, Journal of Scientific Computing, Vol. 34-2, p. 188-208, 2008 

22Y, Allaneau and A. Jameson, Direct Numerical Simulations of Plunging Airfoils, A[AA Paper 2010-728, January 2010. 

23VY_ Allaneau, M. Culbreth and A. Jameson, A Computational Framework for Low Reynolds Number 3D Flapping Wings 
Simulations, AIAA Paper, Hawaii, 2011. 

24C. W. Shu, Total Variation Diminishing Time Discretizations, SIAM Journal on Scientific and Statistical Computing, 
Vol. 9, p. 1073-1084 

25P, EB. Gill, W. Murray, and M. A. Saunders, SNOPT: An SQP Algorithm for Large-Scale Constrained Optimization, 
SIAM Rev. Vol. 47-1, 99-131, 2005 





11 of 11 


American Institute of Aeronautics and Astronautics 


LETTER 


https://doi.org/10.1038/s41586-018-0533-0 











Glider soaring via reinforcement learning in the field 


Gautam Reddy!», Jerome Wong-Ng!”, Antonio Celani*, Terrence J. Sejnowski** & Massimo Vergassola!* 


Soaring birds often rely on ascending thermal plumes (thermals) 
in the atmosphere as they search for prey or migrate across large 
distances’ *. The landscape of convective currents is rugged and 
shifts on timescales of a few minutes as thermals constantly form, 
disintegrate or are transported away by the wind*®. How soaring 
birds find and navigate thermals within this complex landscape 
is unknown. Reinforcement learning’ provides an appropriate 
framework in which to identify an effective navigational strategy as a 
sequence of decisions made in response to environmental cues. Here 
we use reinforcement learning to train a glider in the field to navigate 
atmospheric thermals autonomously. We equipped a glider of two- 
metre wingspan with a flight controller that precisely controlled 
the bank angle and pitch, modulating these at intervals with the 
aim of gaining as much lift as possible. A navigational strategy was 
determined solely from the glider’s pooled experiences, collected 
over several days in the field. The strategy relies on on-board 
methods to accurately estimate the local vertical wind accelerations 
and the roll-wise torques on the glider, which serve as navigational 
cues. We establish the validity of our learned flight policy through 
field experiments, numerical simulations and estimates of the noise 
in measurements caused by atmospheric turbulence. Our results 
highlight the role of vertical wind accelerations and roll-wise torques 
as effective mechanosensory cues for soaring birds and provide a 
navigational strategy that is directly applicable to the development 
of autonomous soaring vehicles. 

In reinforcement learning, an animal maximizes its long-term reward 
by taking actions in response to its external environment and internal 
state. Learning occurs by reinforcing behaviour based on feedback from 
past experiences. Similar ideas have been used to develop intelligent 
agents that have achieved spectacular performance in strategic games 
such as backgammon® and Go’, visual-based video game play'® and 
robotics'’””. In the field, however, constraints imposed by variable 
and uncontrolled conditions prevent learning agents from using data- 
intensive learning algorithms and the optimization of model design 
needed for quicker learning. These are the conditions most often faced 
by living organisms. 

A striking example in nature is provided by thermal soaring. 
Atmospheric convection is not consistent across days and, even under 
suitable conditions, the locations, sizes, durations and strengths of 
nearby thermals are unpredictable. As a result, the distribution of sam- 
ples used to train the glider differs day-to-day. Gliders and birds oper- 
ate at spatial and temporal scales where fluctuations in wind velocities 
are due to turbulent eddies lasting a few seconds that may mask or 
falsely enhance a glider’s estimate of its mean climb rate. Further, the 
measurement of navigational cues using standard instrumentation 
may be consistently biased by aerodynamic effects that require pre- 
cise quantification. Here, we demonstrate that reinforcement learning 
can meet the challenge of learning to soar effectively in atmospheric 
turbulent environments. In past work, by contrast, the manoeuvring 
of an autonomous helicopter in ref. '’ is a control problem that is 
decoupled from environmental fluctuations and has little trial-to-trial 
variability. Past autonomous soaring algorithms have largely relied 
on locating the centroid of a drifting Gaussian thermal!*"!°, which 


is unrealistic, or have applied learning methods in highly simplified 
simulated settings'’~"” 

Using the reinforcement learning framework’, we may describe the 
behaviour of the glider as an agent traversing different states (s) by 
taking actions (a) while receiving a local reward (r). The goal is to 
find a behavioural policy that maximizes the ‘value’: that is, the mean 
sum of future rewards up to a specified horizon. We seek a model-free 
approach, which estimates the value of different actions at a particular 
state (called the Q function) solely through the agent's experiences during 
repeated instances of the task, thereby bypassing the modelling of 
complex atmospheric physics and aerodynamics (see Methods). The 
optimal policy is subsequently derived by taking actions with the 
highest Q value at each state, where the state includes sensorimotor 
cues and the glider’s aerodynamic state. 

To identify mechanosensory cues that could guide soaring, we 
recently combined the above ideas with simulations of virtual gliders 
in numerically generated turbulent flow”’. Two cues emerged from 
our screening: (1) the vertical wind acceleration (a,) along the glider’s 
path and (2) the spatial gradients in the vertical wind velocity across 
the wings of the glider (w). Intuitively, the two cues correspond to the 
gradient of the vertical wind velocity in the longitudinal and lateral 
directions of the glider, which locally orient it towards regions of higher 
lift. Simulations’? further showed that the glider’s bank angle is the cru- 
cial aerodynamic control variable; additional variables such as the angle 
of attack, or other mechanosensory cues such as temperature or vertical 
velocity, offer minor improvements when navigating within a thermal. 

To learn to soar in the field, a glider (wingspan, 2 m) was equipped 
with autonomous soaring capabilities (Fig. la, b). The glider is 
equipped with a flight controller, which uses a feedback control sys- 
tem to modulate the glider’s ailerons and elevator such that the bank 
angle and pitch take the values desired by the behavioural policy being 
used (we use two different behavioural policies during initial learning, 
and the gliders then implement a further policy—the final navigational 
strategy—after learning). Relevant measurements, such as the altitude, 
ground velocity (wu), airspeed, bank angle (ju) and pitch, are made con- 
tinuously at 10 Hz with standard instrumentation (see Methods). At 
fixed time intervals, the glider changes its heading by modulating its 
bank angle in accordance with the implemented behavioural policy. 

Noise and biases that affect learning in the field require the devel- 
opment of appropriate methods to extract environmental cues from 
measurements made by sensory devices. We found that estimates of 
a, from the derivative of the vertical ground velocity (u,) are biased 
by longitudinal motions of the glider about the pitch axis as the glider 
responds to an imbalance of forces and moments while turning. By 
modelling the glider’s longitudinal dynamics, we obtain an unbiased 
estimate of the local vertical wind velocity (w,), and a, as its derivative 
(see Methods). The estimation of the spatial gradients across the wings, 
Ww, poses a greater challenge, as it involves the difference between two 
noisy measurements at relatively close positions. The key observation 
that we used here is that the glider rolls because of contributions from 
vertical wind velocity gradients, the feedback control mechanism and 
various aerodynamic effects. The resulting roll-wise torque can be 
estimated from the small deviations of the true bank angle from 
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Fig. 1 | Soaring in the field by using turbulent navigational cues. 
a, A trajectory (orange line) of our glider soaring in Poway, California. 
b, A cartoon of the glider showing the available navigational cues— 
gradients in vertical wind velocities (indicated by the length of the blue 
arrows) along the trajectory and across its wings, which generate a vertical 
wind acceleration a, and a roll-wise torque w, respectively. c, A sample 
trace of the estimated vertical wind velocity w, and corresponding a, 
obtained in the field. d, The measured bank angle ju and the estimated 
w during the same trial as in c. w (solid green line) is estimated from the 
small deviations of the measured bank angle (solid blue line) from the 
expected bank angle (dashed orange line) after accounting for other effects 
(see Methods). The black arrows mark the enlarged bank angle trajectory 
shown in the inset in the left panel. 


the desired one, and a new dynamical model allows us to separate 
the w contribution due to velocity gradients from the other effects 
(see Methods). A sample trace of the resulting unbiased estimate of 
w is shown in Fig. lc, d, together with traces of w,, 44 and unbiased 
estimates of a,. 

Equipped with a proper procedure for estimating environmental 
cues, we next addressed the specifics of learning in the field. First, to 
constrain our state space, we discretized the range of values of a, and 
w into three states each: positive high (+), neutral (0) and negative 
high (—). Second, we found that learning is accelerated by choosing a, 
attained at the subsequent time step as the reward signal. The choice of 
az (rather than w,) is an instance of reward shaping that is justified in 
Supplementary Information, where we show that using a, as a reward 
still leads to a policy that optimizes the long-term gain in height. This 
property is a special case of our general result that a particular reward 
function or its time derivatives (of any order) yield the same optimal 
policy (Supplementary Information). Choosing w, as the reward fails 
to drive learning in the soaring problem, possibly because the velocities 
(and thus the rewards) are correlated across states and their temporal 
statistics strongly deviates from the Markovianity assumption in rein- 
forcement learning methods’. Velocity fluctuations in turbulent flow 
are long-correlated: that is, their correlation timescale is determined 
by the largest timescale of the flow (see, for instance, figure 9 of ref. *’), 
which is of the order of minutes in the atmosphere. Conversely, the 
correlation timescale of accelerations is controlled by the smallest 
timescale*!*? (the dissipation timescale in figure 7 of ref. *!). This is 
estimated to be only a fraction of a second, which is much smaller than 
the time interval between successive actions. Note that the previous 
experimental observations can be rationalized by the combination 
of the power-law spectrum of turbulent velocity fluctuations in the 
atmosphere and the extra factor of frequency squared in the spec- 
trum of acceleration versus velocity fluctuations*>. Finally, the glider’s 
experiences, represented as state—action-state-reward quadruplets, 
(St. At St415 14), Were cumulatively collected (over 15 days) into a set 
E using explorative behavioural policies. Learning is monitored by 
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Training time (minutes) 
Fig. 2 | Convergence of the learning algorithm and the learned strategy 
for navigating thermal plumes. a, The convergence of Q values during 
learning as measured by the standard deviation of the mean Q value versus 
training time in the field, obtained by bootstrapping from the experiences 
accumulated up to that point. b, The final learned policy. Each symbol 
corresponds to the best action (increasing or decreasing the bank angle 
jt by 15° or maintaining the same jz, as shown in the key on the right) to 
be taken when the glider observes a particular (a,, w) pair and is banked 
at 44. Combined symbols depict pairs of actions that are equally rewarding. 
A positive (negative) w corresponds to a higher vertical wind velocity on 
the left (right) wing of the glider and a positive (negative) 1 corresponds to 
turning right (left) with respect to the glider’s heading. 


bootstrapping the standard deviation of the Q values from E (Fig. 2a), 
calculated through value iteration methods (see Methods). 

The navigational strategy derived at the end of the training period 
is presented in Fig. 2b, which shows the actions deemed optimal for 
the 45 possible states. The rows corresponding to w=0 resemble the 
Reichmann rules**—a set of simple heuristics for soaring, which sug- 
gest a decrease (increase) in bank angle when the climb rate increases 
(decreases). Our strategy also gives a prescription for bank: for instance, 
when a, and w are both positive (top row in Fig. 2b)—that is, in a sit- 
uation when better lift is available diagonal to the glider’s heading—it 
is advantageous not to bank to the extreme but rather to maintain an 
intermediate value between —30° and —15°. Importantly, the learned 
leftward (rightward) bias in bank angle on encountering a posi- 
tive (negative) torque validates our estimation procedure for w. 

In Fig. 3a, we show a sample trajectory of a glider that used the navi- 
gational strategy in the field to remain aloft for about 12 min while spi- 
ralling to the height of low-lying clouds (see also Extended Data Fig. 1). 
On a day with strong atmospheric convection, the time spent aloft is 
limited only by visibility and the receiver's range as the glider soars 
higher or is constantly pushed away by the wind. A significant improve- 
ment in median climb rate of 0.35 m s~' was measured in the field by 
performing repeated 3-min trials over 5 days (Fig. 3b, Mann-Whitney 
U=429, neontrol = 37 Mstrategy = 49, P< 10° * two-sided). Notably, this 
value reflects a general improvement in performance averaged across 
widely variable conditions without controlling for the availability of 
nearby thermals. 

To examine possible advantages of larger gliders due to improved 
estimation of torque, we further analysed soaring performance for 
different wingspans (/). Although the naive expectation is that the 
signal-to-noise ratio in the estimate of w scales linearly with /, we show 
that the effects of atmospheric turbulence lead to a much weaker /!’° 
scaling (see Methods). Because testing our prediction would require 
a series of gliders with different wingspans, we turned to numerical 
simulations of the convective boundary layer, adapted to reflect our 
experimental set-up (Methods). Results shown in Fig. 3c, d are con- 
sistent with the predicted scaling. Intuitively, the weak 1/6 exponent 
arises because the improvement in estimation of the gradient is offset 
by the larger turbulent eddies, which only have a sweeping effect for 
smaller wingspans (that is, they do not rotate the glider but translate it, 
which does not affect the estimate of vertical velocity differences across 
its wings), and contribute to velocity differences across the wings as / 
increases. Our calculation yields an estimate of the signal-to-noise ratio 
of about 4 for typical experimental values; similar arguments for a, yield 
a signal-to-noise ratio of about 7. Experimental results, together with 
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Fig. 3 | Performance of the learned strategy and its dependence on the 
wingspan. a, A 12-min-long trajectory of the glider executing the learned 
strategy for navigating thermals in the field, coloured according to the 
vertical ground velocity at each instant. b, Experimentally measured climb 
rate for a control random policy (black dots) is compared against the 
learned strategy (red dots) over repeated 3-min trials in the field. Each dot 
represents the average climb rate in a single trial. To restrict the range of the 
axis, a few outliers are not shown. The limits on the y axis are from —2 ms 
to 1.5ms'. The orange line in the box plot shows the median, the extent 
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simulations and signal-to-noise ratio estimates, establish a, and w as 
robust navigational cues for thermal soaring. 

The real-world intricacies of soaring impose severe constraints on 
the complexity of the underlying models, reflecting a fundamental 
trade-off between learning speed and performance. Notably, the choice 
of a proper reward signal was crucial to make learning feasible with the 
limited samples available. Although reward shaping has received some 
attention in the machine learning community”, its relevance to animal 
behaviour remains poorly understood. We remark that our navigational 
strategy constitutes a set of general reactive rules, with no learning 
occurring during a particular thermal encounter. A soaring bird may 
use a model-based approach of constantly updating its estimate of the 
location of nearby thermals based on recent experience and visual cues. 
Still, the importance of vertical wind accelerations and torques for our 
policy suggests that they are likely to be useful for any other strategy; 
our methods of estimating them in a glider suggest that they should be 
accessible to birds as well. The hypothesis that birds use those mechan- 
ical cues while soaring can be tested in experiments. 

Finally, we note that single-thermal soaring is just one face of a mul- 
tifaceted question: how should a migrating bird or a cross-country 
glider fly among thermals over hundreds of kilometres for a quick, yet 
risk-averse, journey”°*®? This calls for the development of effective 
methods for identifying areas of strong updraft based on mechanical 
and visual cues. Such methods, coupled with our current work, would 
pave the way to a better understanding of how birds migrate and the 
development of autonomous vehicles that can fly for long distances and 
long periods with minimal energy cost. 


Online content 

Any methods, additional references, Nature Research reporting summaries, source 
data, statements of data availability and associated accession codes are available at 
https://doi.org/10.1038/s41586-018-0533-0. 
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METHODS 


Experimental set-up. A Parkzone Radian Pro fixed-wing plane of 2-m wing- 
span was equipped with an on-board Pixfalcon autonomous flight controller 
operating on custom-modified Arduplane firmware (http://www.ardupilot.org). 
The instrumentation available to the flight controller includes a GPS, compass, 
barometer, airspeed sensor and an inertial measurement unit. Measurements 
from multiple instruments are combined by an extended Kalman filter (EKF) to 
give an estimate of relevant quantities such as the altitude z, the sink rate with 
respect to the ground —u,, pitch @, bank angle yu and the airspeed V, at a rate of 
10 Hz (see Extended Data Fig. 2 for the definitions of the angles). Throughout the 
paper, we use jt > 0 when the glider is banked to the right and ¢ > 0 for the glider 
pitched with its nose above the horizontal plane. For a given desired pitch g and 
desired bank angle jug, the controller modulates the aileron and elevator control 
surfaces at 400 Hz by using a proportional-integral—derivative feedback control 
mechanism at a user-set timescale 7 (see Extended Data Table 1 for parameter 
values) such that: 


dp. 

rae b (1) 
du 

erie eo (2) 


The desired pitch is fixed during flight and can be used to indirectly modulate the 
angle of attack, a, which determines the airspeed and sink rate with respect to air 
of the glider (—v,). Actions of increasing, decreasing or keeping the same bank 
angle are taken in time steps of t, by changing jug such that ys increases linearly 
from 4; to jus in time interval t,: 


b+ T 





p(t) = w, + (Cu, — 4) (3) 


a 


Estimation of the vertical wind acceleration. The vertical wind acceleration is 
defined as: 


dw, 
a, = 
dt 


where uw and v are the velocities of the glider with respect to the ground and air 
respectively, and w is the wind velocity. Here, we have used the relation w=u — v. 
An estimate of u is obtained in a straightforward manner from the EKF, which 
combines the GPS and barometer readings to form the estimate. However, v; is 
confounded by various aerodynamic effects that affect it on timescales of a few 
seconds (Extended Data Fig. 3). Artificial accelerations introduced by these effects 
impair accurate estimation of the wind acceleration and thus alter the perceived 
state during decision-making and learning. Two effects strongly affect variations 
in vz: (1) sustained pitch oscillations with a period of a few seconds and varying 
amplitude, and (2) variations in angle of attack, which occur to compensate for 
the imbalance of lift and weight while rolling. In Supplementary Information, we 
present a detailed analysis of the longitudinal motions that affect the glider, sum- 
marized here for conciseness. Changes in v, can be approximated as: 





= <u, =») (4) 


Av, = — V(Aa— Ad) (5) 


where A denotes the deviation from their value during steady, level flight. We 
obtain A¢ directly from on-board measurements, whereas Aq can be approxi- 
mated for bank angle ju as: 





Ac (01) | ; | (6) 


COS [lL 


where a is the angle of attack at steady, level flight and a; is a parameter that 
depends on the geometry and the angle of incidence of the wing. The constant 
pre-factor (ao — a;) is inferred from experiments. Measurements of u, together 
with the estimate of Av, are now used to estimate the vertical wind velocity w, 
up to a constant term, which can be ignored as it does not affect a,. The vertical 
wind acceleration a, is then obtained by taking the derivative of w, and is 
further smoothed using an exponential smoothing kernel of timescale o, (Extended 
Data Fig. 4). 

Estimation of vertical wind velocity gradients across the wings. Spatial gradi- 
ents in the vertical wind velocity induce a roll-wise torque on the plane, which 
we estimate using the deviation of the measured bank angle from the expected 
bank angle. The total roll-wise torque on the plane has contributions from three 
sources: (1) the feedback control of the plane; (2) spatial gradients in the wind 
including turbulent fluctuations; and (3) roll-wise moments due to various 
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aerodynamic effects. Here, we follow an empirical approach: we note that the 
latter two contributions perturb the evolution of the bank angle from equation (2). 
We can then write an effective equation 





du My 7H 


+ W(t) + Wrero(t) (7) 
dt T 


where w(t) and Waero(t) are contributions to the roll-wise angular velocity due to 
the wind and aerodynamic effects, respectively. We empirically find four major 
contributions to Waero: (1) the dihedral effect, which is a stabilizing moment 
due to the effects of sideslip on a dihedral wing geometry; (2) the overbanking 
effect, which is a destabilizing moment that occurs during turns with small 
radii; (3) trim effects, which create a constant moment due to asymmetric lift 
on the two wings; and (4) a loss of rolling moment generated by the ailerons 
when rolling at low airspeeds. We quantify the contributions from the four 
effects and model their dependence on the bank angle (see Supplementary 
Information for more details on modelling and calibration). An estimate of 
w is then obtained as: 


rr dys 2 Hy 
dt Ve 





—Wrero (8) 


Finally, an exponential smoothing kernel is applied to obtain a smoothed w 
(Extended Data Fig. 5). 

Design of the learning module. The navigational component of the glider is mod- 
elled as a Markov decision process, closely following the implementation used in 
ref. 7°. The Markovian transitions are discretized in time into intervals of size t,. 
The state space consists of the possible values taken by a, w and js. To make the 
learning feasible within experimental constraints and to maintain interpretability, 
we use a simple tile coding scheme to discretize our state space: continuous values 
of a, and w are each discretized into three states (+, 0, —), partitioned by thresh- 
olds +K, and +K,, respectively. The thresholds are set at +0.8 times the standard 
deviation of a, and w. Because the width of the distributions of a, and w can vary 
across days, the data obtained on a particular day are normalized by the standard 
deviation calculated for that day. In effect, the filtration threshold to detect a signal 
against turbulent ‘noise’ is higher on days with more turbulence. The consequence 
is that the behaviour of the learned strategy could change across days, adapting to 
the recent statistics of the environment. The bank angle takes five possible values 
(0°, 15°, +30°), while the three possible actions allow for increasing, decreasing by 
15° or keeping the same bank angle. In summary, we have a total of 3 x 3 x 5=45 
states in the state space and three actions in the action space. 

We choose the local vertical wind acceleration a, obtained in the next time 

step as the reward function. The choice of a, as an appropriate reward signal is 
motivated by observations made in simulations from ref. *°. In Supplementary 
Information, we show that the obtained policy using a, as the reward function is 
equivalent to a policy that also maximizes the expected gain in height. 
Learning the strategy in the field. Data collected in the field are split into 
(s, a, s’, r) quadruplets containing the current state s, the current action a, the 
next state s’ and the obtained reward r, which are pooled to obtain the transition 
matrix T(s’ | s, a) and reward function R(s, a). Value iteration methods are used to 
estimate the Q values from T and R. The learning process is offline and off-policy; 
specifically, we begin training with a ‘random’ policy that takes the three possible 
actions with equal probability irrespective of the current state. This behavioural 
policy was used for 12 out of the 15 days of training. For the other days, a softmax 
policy’ with ‘temperature’ parameter set to 0.3 was used. For softmax training, the 
Q values were first estimated from the data obtained in the previous days and then 
normalized by the difference between the maximum and minimum Q values over 
the three possible actions at a particular state, as described in ref. 7°. 

Using a fixed, random policy as our behavioural policy has the disadvantage 
that it slows learning, as state—action pairs that rarely appear in the final policy are 
still sampled. On the other hand, it has the benefit that calibrating the parameters 
necessary for the unbiased measurement of a, and w (see Supplementary 
Information) is performed simultaneously with learning, which considerably 
reduces the number of days required in the field. Importantly, offline learning 
permits us to continuously monitor the variance of the estimated Q values by boot- 
strapping from the set E of accumulated (s, a, s’, r) quadruplets up to a particular 
point. Specifically, |E] samples are drawn with replacement from E, and Q values are 
obtained for each state—action pair by value iteration. The steps are repeated and 
the average of the bootstrapped standard deviations in Q over all the state-action 
pairs is used as a measure of learning progress, as shown in Fig. 2a. 

We expect certain symmetries in the transition matrix and the reward function, 
which we exploit to expedite our learning process. Particularly, we note that the 
Markov decision process is invariant to an inversion of sign in the bank angle 
jt — —p.. This transforms a state as (a,, w, LL) — (az, —w, — 1) and inverts the action 
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from that of increasing the bank angle to decreasing the bank angle and vice versa. 
We symmetrize T and R as 








+ = 
pym_ FP +T (9) 
2 
+ — 
pm Rk +R (10) 
p) 


where + and — denote the obtained values and those computed by applying the 
inverting transformation respectively. Finally, T°Y™ and R°”™ are used to obtain a 
symmetrized Q function, which results in a symmetric policy as shown in Fig. 2b. 
To conveniently obtain the policy that uses only a, (Fig. 3d), the above procedure 
is repeated with the threshold for w (K.,) set to infinity. 

Testing the performance of the learned policy in the field. To obtain the data 
shown in Fig. 3b, the glider is first sent autonomously to an arbitrary but fixed 
location 250 m above ground level. The learned policy for thermals is then turned 
on, and the mean climb rate (that is, the total height gained divided by the total 
time) is measured over a 3-min interval. To obtain the control data, the glider 
instead follows a random policy, which takes the three possible actions with equal 
probability. Trials in which we observe little to no atmospheric convection are 
filtered out by imposing a threshold on the standard deviation of the vertical wind 
velocity over the 3-min trial. In Extended Data Fig. 6, we show the distribution 
of the standard deviation in w, collected from about 240 3-min trials over 9 days. 
Trials below the threshold chosen as the 25th percentile mark (red dashed line) 
are not used for our analysis. 

Testing performance for different wingspans in simulations. Soaring perfor- 
mance is analysed in simulations similar to those developed in ref. 7° and adapted to 
reflect the constraints faced by our glider and the environments typically observed 
in the field. 

The atmospheric model consists of two components: (1) a kinematic model 
of turbulence that reproduces the statistics of wind velocity fluctuations in the 
convective atmospheric boundary layer; and (2) the positions, sizes and strengths 
of updrafts and downdrafts. The temporal and spatial statistics of the generated 
velocity field satisfy the Kolmogorov and Richardson laws”’ and the mean velocity 
profile in the convective boundary layer’, as described in the supplementary 
information of ref. 7°. Stationary updrafts and downdrafts of Gaussian shape are 
placed on a staggered lattice of spacing approximately 125 m on top of the fluctu- 
ating velocity field. Specifically, their contribution to the vertical wind velocity at 
position r is given by 

w= We Oe /eR) (11) 
where r’ is the location of the centre of the up(down)draft in the horizontal plane, 
W is its strength and R is its radius. W is drawn from a half-normal distribution of 
scale 1.5m s~', whereas the radius is drawn from a (positive) normal distribution 
of mean 40 m and deviation 10 m. Gaussian white noise of magnitude 0.2 ms“! is 
added as additional measurement noise. 

We assume that the glider is in mechanical equilibrium; the lift, drag and 
weight forces on the glider are balanced, except for centripetal forces while turn- 
ing. The parameters corresponding to the lift and drag curves and the (fixed) angle 
of attack are set such that the airspeed is V=8 ms! and the sink rate is 0.9 ms7! 
at zero bank angle, which match those measured for our glider in the field. Control 
over bank angle is similar to those imposed in the experiments: that is, the bank 
angle switches linearly between the angles 0°, +15°, +30° in a time interval t¢,, 
corresponding to the time step between actions. The glider’s trajectory and 
wind velocity readings are updated every 0.1 s. The vertical wind acceleration is 
derived assuming that the glider directly reads the local vertical wind velocity. The 
gradients in vertical wind velocity across the wings are estimated as the difference 


between the vertical wind velocities at the two ends of the wings. The readings 
are smoothed with exponential smoothing kernels; the smoothing parameters 
in experiments are chosen to coincide with those that yield the greatest height 
gain in simulations. 

Estimation of noise in gradient sensing due to atmospheric turbulence. The 
cues a, and w measure the gradients in the vertical wind velocity along and per- 
pendicular to the heading of the glider. Updrafts and downdrafts are relatively 
stable structures in a varying turbulent environment. Thermal detection through 
gradient sensing constitutes a discrimination problem of deciding whether a 
thermal is present or absent given the current a, and w. We estimate the magnitude 
of turbulent ‘noise’ that unavoidably accompanies gradient sensing. Intuitively, 
turbulent fluctuations in the atmospheric boundary layer (ABL) are made up of 
eddies of different length scales, with the largest being the size of the height of the 
ABL. Energy is transferred from larger, stronger eddies to smaller, weaker eddies, 
and eventually dissipates at the centimetre scale owing to viscosity in the bulk 
and owing to the boundary at the Earth's surface. In Supplementary Information, 
we present an explicit calculation of the signal-to-noise ratio for w estimation, 
taking into account the effect of turbulent eddies on the statistics of noise. Below, 
we give simple scaling arguments and refer to Supplementary Information for 
further details. 

A glider moving at an airspeed V and integrating over a timescale T averages a, 
over a length VT. For V much larger than the velocity scale of the eddies, which is 
typically the case, the decorrelation of wind velocities is due to the glider’s motion; 
the eddies themselves can be considered to be frozen in time. The magnitude of the 
spatial fluctuations across the eddy of this size scales according to the Richardson- 
Kolmogorov law” as (VT)'°. The mean gradient signal when going up the gradient 
scales as (VT); the resultant signal-to-noise ratio in a, scales as (Vie. 

Similar arguments are applicable for w measurements. In this case, the signal-to- 
noise ratio has an additional dependence on the wingspan /. The dominant 
contribution to the noise comes from eddies of size 1, whose strength scales as 1”. 
As the glider moves a distance VT, for 1<« VT, it traverses VT/I distinct eddies of 
size 1. Consequently, the noise is averaged out by a factor (VT/I)~ 7, corresponding 
to the VT/l independent measurements. Multiplying these two factors, the averaged 
noise is proportional to °°(VT)~ 7. As the mean gradient (that is, the signal) is 
approximately /, the signal-to-noise ratio is then proportional to ['/°(VT)”. 

From the above arguments and dimensional considerations, we get 
order-of-magnitude estimates of the signal-to-noise ratio (SNR) for a, and w 
estimation: 


wv2/3p2/371/3 
xX ———————————————— ee 





SNR, (12) 
. wR 
1/21 /271/671/3 
SNR, « i lls (13) 
wR 


where W is the strength of the thermal, R is its radius, w is the magnitude of turbu- 
lent vertical wind velocity fluctuations and L is the length scale of the ABL. For the 
signal-to-noise ratio estimates presented in the text, we use W=2 ms ', R=50m, 
1=2m, V=8ms_', T=3s, L=1 km. The values of Vand T correspond to the 
airspeed of the glider in experiments and the timescale between actions during 
learning respectively. 


Data availability 
The data that support the findings of this study are available from the correspond- 
ing author upon reasonable request. 


29. Frisch, U. Turbulence: The Legacy of A. N. Kolmogorov (Cambridge Univ. Press, 
Cambridge, 1995). 
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Extended Data Fig. 1 | Sample trajectories obtained in the field. The trajectories are coloured according to the instantaneous vertical 

The three-dimensional view and top view are shown of the glider’s ground velocity uz. The green (red) dot shows the start (end) point of the 
trajectory as it executes the learned strategy for thermals (labelled ‘s’) or trajectory. Trajectories sl, s2 and rl last for 3 min each, whereas s3 lasts for 
a random policy that takes actions with equal probability (labelled ‘r’). about 8 min. 
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Extended Data Fig. 2 | Force-body diagram of a glider. The forces on a 
glider and the definitions of the various angles that determine the glider’s 


motion. 
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Extended Data Fig. 3 | Modelling the longitudinal motion of the glider. 
a, Sample trajectory of a glider’s pitch and its vertical velocity with respect 
to ground (u,) in a case in which the feedback control over the pitch is 
reduced in order to exaggerate the pitch oscillations. The blue line shows 
the measured u,, and the orange line is u, obtained after subtracting the 
contributions from longitudinal motions of the glider (see Supplementary 
Information). b, The blue line shows the average change in u, when a 
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particular action is taken (labelled above each panel), averaged over n 3-s 
intervals. The 13 panels correspond to the 13 possible bank angle changes 
from the angles 0°, +15° and +30° by increasing, decreasing the bank 
angle by 15° or keeping the same angle. The green dashed line shows the 
prediction from the model whereas the orange line is the estimated w,. The 
axis on the right shows the averaged pitch (red dashed line). 
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Extended Data Fig. 4 | The estimated vertical wind acceleration with (blue line) and without (orange line) accounting for the glider’s 
is unbiased after accounting for the glider’s longitudinal motion. longitudinal motions. The axis on the right shows the airspeed (green 
a, The averaged vertical wind acceleration a, in units of its standard dashed line). b, Probability density functions (PDFs) of a, for the different 
deviation. a,, plotted as in Extended Data Fig. 3b, is shown in orange bank angle changes. The black dashed line shows the median. 
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Extended Data Fig. 5 | The estimated roll-wise torque is unbiased after 
accounting for the effects of feedback control and glider aerodynamics. 
a, The averaged evolution of the bank angle shown as in Extended Data 
Fig. 3b. The blue line shows the measured bank angle and the dashed 
orange line shows the best-fit line obtained from simultaneously fitting the 
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13 blue curves to the prediction (see Supplementary Information). 

b, PDFs of the roll-wise torque w (in units of its standard deviation) 
for the different bank angle changes. The black dashed line shows the 
median value. 
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Extended Data Fig. 6 | The distribution of the strength of vertical 
currents observed in the field. The root-mean-square vertical wind 
velocity measured in the field is pooled from about 240 3-min trials 
collected over 9 days. The dashed red line shows the threshold criterion 
imposed when measuring the performance of the strategy in the field 
(see Methods). 
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Extended Data Table 1 | Parameter values 


Label 


Pa 


Ao - Qj 


Description 


Wingspan of glider used in experiments 


Desired pitch 


Feedback control time scale 


Interval between actions (learning) 


Interval between actions (soaring) 


Net angle of attack (see eq. 6) 


Airspeed (typical) 


Dihedral effect timescale (typical) 


Overbanking effect timescale (typical) 


Trim bias (typical) 


Opposing roll timescale (typical) 


Thresholds for a, and w state estimation 


Exponential smoothing timescales for a, 


Exponential smoothing timescales for w 


Discount factor for RL implementation 
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Value 


2m 


a 


0.45s 


3S 


1.5s 


14° 


6 to 8 m/s 


14 to 30s 


< -20s 


-2 to +2°/s 


1.5to3s 


0.8 x std. dev 


8i,/3, 2t,/3 


t,, t,/4 


0.8 
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Abstract 


The flight of albatross (Diomedea exulans) takes advantage of the up-drift which is determined by 
the product of relative wind velocity and it’s gradient above the sea surface, to power its elegant 
(dynamic) flight over the ocean. Some of the complicated flight manoeuvres are determined by 
biological necessities. From its most basic flight manoeuvre a technical aerodynamically scheme 
can be derived which allows the design of a mechanical technical prototype of a wind generator. It 
is based on a rotational movement in combination with a skillful time dependent adjustment of 
the airfoil. Several technical possibilities are discussed and with one of these elaborated in some 
detail. The technology to be developed could be applied in highly asymmetric air streaming envi- 
ronment around high rise buildings, on mountain ridges and of course, also low above sea level 
and plains. Mathematical-technical conditions for power gain are discussed. The technology could, 
in principle, also be deployed to exploit velocity gradients in river water environment. The engi- 
neering challenges are significant and the presented work is just a blueprint for tasks to be ac- 
complished. 


Keywords 


Bionic, Biomimetic, Gradient Streaming, Wind (Water) Generating Power System (WGPS), 
Albatross-Type Technology, Kinetic Energy, Potential Energy 


1. Introduction 


Our present wind and water turbines are technically conceived to respond to a quite homogeneous streaming 
pattern. If the flow of air or water would be inhomogeneous, the resulting forces on airfoils or turbines would be 
inhomogeneous, resulting in non-ideal technical performance. In natural environments, however, inhomogene- 
ous streaming patterns are quite common. But our present aero- and hydro-mechanic technology has not yet de- 
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veloped strategies to exploit them efficiently. That this should be possible is convincingly demonstrated by a 
natural example: the seemingly effort-less flight of the albatross over the sea, where air up-drifts are essentially 
lacking. The flight manoeuvres of the albatross (Figure 1) have for a long time stimulated imagination of scien- 
tists and engineers. 

How is it possible to fly seemingly effortless without flapping wings over a sea surface which does not pro- 
vide wind up-drifts [1 |-[3]. It has been known for a long time that the albatross is taking advantage of mecha- 
nisms which also give rise to the Magnus effect, the same phenomenon which is giving a spinning tennis ball a 
characteristic deviation in its flight path [4]. The physics behind the albatross’ flight has well been analysed [5] 
and explains the power source for the up-drift force as the product between relative wind velocity V and the 
wind velocity gradient u' above the sea surface (vertical component z) according to the relation Figure 2: 


, oa V sin(y) (1) 

By entering the wind gradient the effect of a marked increase in airspeed appears which implies an increase in 
kinetic energy. The acceleration (dV/dt) results as product of the wind speed profile du/dz and the vertical air- 
speed component Vsin(y) which is the amount of increasing airspeed due to the finite wind step velocity, also 
known as climb rate (dz/df). 

When the albatross heads for an inclined path into the wind profile then he is able to accelerate in this direc- 
tion (acceleration = a). Such forward force depends, of course, on the albatross mass m (which can reach 10 kg) 
times the airspeed V derived with respect to the time. The forward impulse is correlated with the airspeed V 
(relative wind velocity registered by the flying bird), 





Figure 1. The Southern Royal Albatross (Diomedea epomophora) 
developed a highly specialized flight behaviour. He spends 95% of 
his life at sea. While circumnavigating the Antarctic continent on his 
wings, which exceed 3 m in width and are rarely used for flapping, he 
may travel 300 - 800 kilometres a day in an environment without air 
up-drifts (photos: H. Tributsch). 
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Figure 2. The wind velocity gradient above sea level is shown (left) 
together with vectorial diagrams describing the flight behaviour of the 
albatross (center and right). The up-drift correlates with the wind 


profile (adapted from |6}). 
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F=ma=m— (2) 


The pitching angle (y) and the heading angle (yw) depicted in Figure 2 determine the inclination of the alba- 
tross flight upwind and affect the magnitude and direction of the forward up-drift impulse, 


F =mu'Vsin(y) cos(7) cos(y) (3) 


Understanding now the physical basis of the force which the albatross is exploiting we have to turn to the 
complicated task of constructing a wind machine, able to extract energy from inhomogeneous air streams. 


2. Technical Challenges and the Results 


A biomimetic strategy aims at extracting technical information from biological systems to achieve as simple as 
possible a model of relevant properties. A biological system tends to solve different problems simultaneously [7]. 
Technology, in contrast, aims at well specialized technical solution. The most straight-forward strategy with re- 
spect to an understanding of the basic requirements for energy gain based on the albatross flight technology 
would be to look first at the most simple flight pattern used by the albatross Figure 3. 

When the albatross, after taking advantage of the special up-drift force which is activated, reaches the highest 
point of the periodical flight patterns, he can fly down into any direction while converting potential energy into 
kinetic energy and while harvesting additional energy from the wind gradient. 

However, while conserving a reasonably flight velocity with respect to the wind, he has then to turn against 
the direction of the wind gradient. This way he maximizes the product of relative wind velocity and wind veloc- 
ity gradient according to Equation (1). Now the up-drift force becomes active again and the albatross convert his 
kinetic energy into potential energy by gaining approximately 15 - 20 meters height. From here the albatross can 
again reorient the flight direction while taking advantage from the gain in potential energy and from the 
down-wind conditions. 

With respect to the most simple flight manoeuvre-prototype for technology the following approach would ap- 
pear to be the most promising 1) the simplest albatross flight pattern should be taken as a first model 2) a rota- 
tional movement should be achieved instead of a translational one 3) the power gain should be extracted 





Figure 3. Albatross flight manoeuvres of dynamic soaring into any overall direction. (a) 
cross-wind snake manoeuvre (b) upwind snake manoeuvre (c) circumnavigation 
manoeuvre (d) snaking hover manoeuvre. The flight patterns are indicated, besides of 
the wind gradients, with which the albatross is interacting. 
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periodically in the wind gradient during the rotating movement. Both, the gained potential energy and the ex- 
tracted energy from the downwind may contribute. In terms of biomimetics engineering this corresponds to the 
simplest abstraction of the biological engineering example. However, more details have to be learned about the 
albatross and his flight. 


2.1. How Did Evolution Approach the Technical Problems of Albatross Flight? 


It is well known that in ordinary flight technology via airfoil the velocity difference of air between the upper 
more curved foil surface and the lower less curved surface is a critical factor in producing (upward) lift [8]. The 
difference in air pressure generated is giving rise to the responsible force. Above a sea surface an air velocity 
gradient is already present and this is what the albatross is exploiting. Consequently, a much less curved wing 
profile compared to those of other large birds [9] should be expected for the albatross. This is actually observed 
as Figure 4 shows for a member of the albatross family. 

With such an only moderately bent wing profile the albatross extracts energy from the streaming air by en- 
gaging in its characteristic flight pattern. Additional to the four flight patterns (Figure 3) which were illustrated 
before, the simplest one is shown in Figure 5. 

As Figure 5 visualizes the albatross starts by entering the air velocity gradient with a sufficiently high relative 
wind velocity close to the ocean surface (A). He enters the air velocity gradient with a velocity of maybe 25 m/s 
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Figure 4. Wing profile characteristics of different birds (adapted from [8}). 
The albatross travels with the least curved wing profile. It explains its clum- 
siness during take-off and during flapping flight, but is a key to dynamic soar- 
ing. 
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Figure 5. Scheme of most basic circular dynamic soaring trajectory of the albatross. Shown are different projections of the 
flight manoeuvres (top right: seen from above; top bottom: seen parallel to the sea surface). 


G. Pfeifhofer, H. Tributsch 


and rises to a height of approximately 20 meters while reducing its speed (B-C). With the potential energy 
gained the albatross flies a loop downwards (D-E) and again turns into the wind gradient near to the wave crests 
to repeat the cycle. By converting potential energy and gaining additional energy from the downwind he in- 
creases again the kinetic energy for entering the wind velocity gradient (E). During the albatross flight the final 
position (E) is separated from the starting point (A) by a certain distance since the albatross migrates in the 
search of food. 

As explained in Figure 6, a technical realization on this manoeuvre could function more or less identical, 
representing however a circular albatross manoeuvre without the need for local displacement. The consequence 
would be a circular manoeuvre during which periodically gained potential energy as well as downwind energy is 
changed into kinetic energy. The kinetic energy is needed for generating the up-drift and in a technical system 
would be the source of energy gain out of the velocity gradient. Figure 6 also explains, how in a rotational wind 
energy harvesting system foil- and wind velocities add in different positions within the wind velocity gradient. 
The depicted manoeuvre appears to be the manoeuvre closest to the albatross example. But technically it is not 
the only possible one and the engineering options should therefore be further analysed. 


2.2. Technical Realization of the Albatross Flight Model 


As in part outlined in a master thesis [10] in some detail basically three different strategies are imaginable for a 
technical approach 1) a linear system 2) a rotational system and 3) an intermediate system with respect to the 
path of the airfoil. In all these cases energy gain from the wind gradient appears to be possible but the boundary 
conditions are somewhat different, as depicted in Figure 7. 

During the flight of the albatross the up-drift is proportional to the relative wind velocity V. This relative wind 
velocity adds the velocity of the air (wind) and the velocity of the albatross when flying into the wind gradient. 
In a technical system the albatross will be replaced by an artificial airfoil. This airfoil develops different velocity 
in different technical set ups. 

A foil which is periodically moving up and down (case a) in the wind velocity gradient contributes zero ve- 
locity to the relative wind velocity V (= Vying + Voi). For a technical system where a foil is rotating into the wind 
the foil velocity is periodically changing in dependence of the position within the wind gradient. In case b the 
relative wind velocity will be the sum of the wind velocity plus a varying foil velocity. In case c two linear 
movements are superposed to yield a more complex foil trajectory with a corresponding more complex foil 


Vwind 


ee tt 


Vrel = Vioil - Vwind U (du/dz) 


lift force = Vrel * uo 


Figure 6. Characterization of relative wind velocity u (or Vying) and the wind velocity gradient u' (du/dz) for understanding 
them with respect to a rotationally operating wind machine. The wind velocity uv within the wind velocity gradient is equally 
termed V,,;nq 1n order to relate it to Vio; for explaining the relative wind velocity V,.). 
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Figure 7. Three different strategies for a technical approach to albatross-type flight (a) linear (b) rotational (c) intermediate 
combined system. 


velocity which has to be added to the wind velocity within the velocity gradient. 

It should be pointed out that the flight velocity of the albatross into the wind gradient changes as well in in- 
dependence of the height above the sea surface. The albatross will fly with high velocity into the wind gradient 
close to the sea surface and then be pushed up while losing airspeed since kinetic energy is being converted into 
potential energy. High above the sea level where the full wind velocity is reached the albatross is even pushed 
forward by the wind before he is flying down towards the sea level while converting the potential energy of 
height into kinetic energy. 

1) Linear wind gradient power system (WGPS) 

In a linear WGPS Figure 7(a) an airfoil modeled according to the wing profile of the albatross is moving up 
and down changing its position within the air velocity gradient. It has been mentioned (Figure 4) that the alba- 
tross has an only weakly asymmetric wing profile compared to other large birds. But some asymmetry is still 
present with a consequence that an up-drift as known from airfoil technology and described by the lift coeffi- 
cient c; is present [11]. But there are two differences—one is that the wind velocity gradient adds velocity in 
such a way that the velocity of the upper foil surface will be increased compared to the lower surface. This will 
add to the up-drift force by in principle increasing the c; value. Also normal aerodynamic up-drift (based on the 
characteristic wing profile of the albatross (Figure 4)) will occur. 

The second difference will be that the airfoil when exposed to the maximum air velocity, within the gradient, 
will have been manipulated so that the potential energy can be harvested. This means that while changing its in- 
clination (pitching angle y) the air foil has to be returned to a lower position while providing energy which 
should be extractable in the form of electricity. 

A simple mechanism which is accomplishing this task is shown in Figure 8. The airfoil has to achieve this 
function by changing its orientation with respect of the streaming direction of air. This can be achieved with e.g. 
an electro-mechanical control unit and the consequence would be that the airfoil would move up and down (by a 
short circular reversal path at the bottom and the top). This periodic linear movement is able to provide energy 
which e.g. can be transformed into rotational movement or directly converted into electricity via a linear elec- 
tro-magnetic generator. 

2) Rotational wind gradient power system (WGPS) 

The rotational functional WGPS may be operated via a model structure as depicted in Figure 9. Two or more 
airfoils are mounted on a cantilever type arm which can be rotated. The power generated via rotating movements 
of the airfoils can be extracted either mechanically via a gear or electro-magnetically via an appropriate rotating 
induction generator. It is important to note that during rotation of the airfoils in the wind gradient the inclination 
of the wing profile has to be modified. Such a periodic adjustment of the wing inclination appears to be a major 
engineering challenge but several technologies are available. Also a more complicated change of the wing 
structure and profile (as observed in wing or fin structures of living animals) should be considered as aerody- 
namic strategy towards WGPS. 

The adjustment of airfoils (rotor blades) during flight is a well-established necessity for helicopter flight 
technology. This is achieved via a wobbling disc technology which has reached a high level of engineering 
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Figure 8. Linear WGP-System, schematic representation. Shown are the mechanical-electronic ele- 
ments (left) and the positioning of the air-foils (right). 
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Figure 9. Rotational WGP-System, schematic representation. The mechanical set up is shown to the 
left, the air foil position to the right. 


perfection. In principle such a technology could be adapted for WGP-Systems. But simpler alternatives maybe 
tested before in more basic models systems. Such alternatives have been considered and are explained in a 
sketch depicted in Figure 10. The rotating WGP-System operates via an energy yielding axis which can be used 
at the same time for steering the wing inclination in the desired wing positions during rotation. The rotating axis 
drives a bevel gear unit which powers a steering system. The steering system consists of two segmented 
rack-wheels which, while interacting periodically, turn the air foil into the desired position. 

3) Other two-dimensional movements of WGPS 

Among several other possibilities a two dimensional movement of the energy harvesting WGPS has been 


G. Pfeifhofer, H. Tributsch 


considered. It activates linear up and down movements in combination with a horizontal movement of the basis 
of the structure. As seen from picture Figure 7(c), a trajectory of the wing in a number eight-form is resulting. 
In this case c horizontal movement is basically adjusting the airfoil. Both, the most simple structure of such a set 
up as well as mechanical forces acting upon the wings in their extreme positions are visualized in Figure 11. 

More complicated two dimensional patterns of air foil trajectories are imaginable, considering that the alba- 
tross did evolve much more sophisticated flight patterns compared to that studied in this publication (compared 
to Figure 3). 

WGP-Systems based on periodical change of air foil structure are also imaginable. In order to achieve a con- 
version of periodically generated potential and kinetic energy a well-tailored periodical change of form and pro- 
file of the air-foil could also be an attractive alternative. In principle a change of the wing profile could be 


Detail Detail 


Segmented rack-wheels alternate in cantact 





Figure 10. Example for steering system for wing inclination in a rotationally working albatross-type 
wind machine. 
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Figure 11. Combinatorial movement WGP-System, schematic representation. The wind velocity 
gradient (left) as well as the mechanical set up for down and up-drift movements (right) are pre- 
sented. 
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induced via wireless triggered signals which periodically activate a mechanical mechanism reshaping the foil 
profile or the airfoil can be forced to change shape due to bending while rotating. A simple alternative has been 
studied according to Figure 12. 


2.3. Preliminary Experimental Tests 


It turned out to be a mayor challenge to generate a well-defined wind velocity gradient for testing of simple 
models of airfoils based on the albatross technology. To generate an adequate gradient-like air stream [12] [13] 
it had to be manipulated in a wind tunnel to change its profile from a homogeneous to an inhomogeneous (ex- 
ponential-like or gradient-like) one. This can be accomplished by an entrance region Figure 13 with a well-de- 
fined roughness of installed surface structure, combined with vortex generators to shorten the distance to the re- 
gion where the airstream is able to develop into a gradient form. 

Further studies of the wind tunnel are necessary to get a higher quality of the gradient-shaped streaming be- 
haviour. But first tests have shown that the rotating version of the albatross-type airfoil is indeed able to rotate. 
This is a first step towards a more profound investigation of albatross-type aerodynamic technology. Different 
airfoil models will have to be tested and energy recovery explored. Gradually more experience will be gained. A 
look at the development history of classical wind machines suggests that it will take some time until a reasona- 
bly high technological standard will be reached. 


3. Theory and Discussion (Conclusion) 


Our analysis up to now evidences that the albatross flight technology is based on quite complex physical-engi- 
neering boundary conditions. However the flight of albatross already looks back at 40 millions of years of evo- 
lution and the technology in biology has demonstrated a high standard. The albatross can fly for days without 
flapping wings. 

In order to understand the physical principle and in order to understand the engineering challenges mathe- 
matical formulas expressing the main parameters need to be developed. In a very inspiring publication J. P. 
Barnes [5] has attempted to derive the essential underlying flight physics of the albatross. During this attempt 
the interchange of kinetic and potential energy has been considered together with the up-drift generating wind 
velocity gradient and the relevance of drag. An Equation (4) has been derived which describes the time deriva- 
tive of the energy divided by the gravitational acceleration (unit mass) of the flying system (albatross): 
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Figure 12. Modified rotational WGP-System with changeable airfoil shape, schematic representa- 


tion. 
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Figure 13. Set up for the investigation of the behaviour of a rotational albatross-typ airfoil 
model within a wind gradient airstream. (a) wind tunnel: radial fan with near 1600 m°/h (b) 
model with expanded polystyrene airfoils (c) entrance region (left) with model (right): gen- 
eration of gradient-shaped wind profile by graded obstacles and vortex generators. 
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This Equation (4) describes the time derivative of the energy that means the change of energy with time for an 
airfoil with given inclination, pitching angle y, heading angle Y, gradient wu’, relative wind velocity V, lift L and 
drag D. The quantity n, describes the normal load factor (n, = lift/weight). The second term determines the 1m- 
portance of high lift-to-drag ratio under load. For the technical system we describe in our analysis these parame- 
ters will change during technical operation. For the rotating system the parameters will change in dependence of 
the rotational angle (a). In case of linear or combined rotational and linear systems they will also change in de- 
pendence of a displacement coordinate (x). For the most general case Equation (4) can accordingly be changed 
considering the variables a and x 


D(a, x) 


g L(a,x) 


Ae) (as) LE ny (as) onr{as)ooso(trs) =n) Jv(ea (5) 


For a technical energy generator to work it has to yield a positive energy output during a full technical circle. 
For this to happen an integral of Equation (5) has to be taken for all angles a (O --- 360°) and for all displace- 
ments x (0 -++ Xmax and back to zero). Such integral would have the following form and meaning: 

>0 positive energy output 


max 360 7 
| ia 0 functioning without energy output (6) 
0 O 


<0 not functioning 
If this integral yields zero (=0) the energy of the system would be conserved, no energy output would be 
available. If the integral would be smaller zero (<0) energy would be lost and the system would cease to func- 
tion. What engineering would have to accomplish is an integral which is yielding a positive value (>0) that is net 
energy output during one complete circle. 
As seen from Equation (5) several parameters are involved which contribute to energy harvesting (first term 
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Figure 14. Sites for wind- and water velocity gradients which can be considered for albatross-type technology. 


of Equation (4)) or energy loss (second term of Equation (4)). Of course the situation is quite complex and the 
outcome will significantly depend on the nature of the technical model system. 


4. Outlook 


Where can albatross-type aerodynamic technology be applied? There are numerous sites, apart from the first 15 
meters above the ocean water and above plains where wind velocities could be strongly inhomogeneous. Such 
conditions exist or can be created artificially, for example between high rise buildings Figure 14 in modern cit- 
ies. Here, the proposed technology could be implemented to provide energy for skyscrapers. The wind condi- 
tions, while being inhomogeneous, may be quite more stable and above all the wind speed is similar to that at 
wind power plants in such height. This could lead to effective solutions in wind energy systems, where a uni- 
form wind velocity cannot be provided. A further advantage is the already existing or to be constructed high-rise 
building profile, which is generating the wind velocity profile. By using the architectural patterns of high rise 
buildings, a cheaper, more practical and more efficient wind technology may be achieved. Wind power plants 
could be integrated into architectural design. 

Another challenge is energy harvesting via WGPS from wind streaming around mountain ridges Figure 14. 
Where high rise traditional wind turbines are not desired due to the visual impact on the landscape, wind energy 
could be generated via only 10 meter high, barely visible alternative installations based on the albatross principle. 
However, like with all new technical ideas, significant technical development work is still required. 

WGPS could be installed together with off-shore wind power plants to take advantage of the gradients of 
wind velocity at lower heights (0 - 15 m). While the rotor blades of the power plants cover the homogeneous 
wind speed at higher altitude, the wind energy systems based on gradient could recover the wind energy closer 
to the sea level. This combination will reduce the high amount of space required, compared to when such sys- 
tems are set up separately. This way it is possible to integrate such systems into already existing wind power 
parks. 

The aggregation state of a medium needs not automatically be gas (air), it can also be liquid (water) when the 
velocity gradient develops as a consequence of friction from neighboring surfaces. The albatross technology is 
expected to work also when implemented in rivers, Figure 14. The streaming of water in rivers is known to be 
highly inhomogeneous and it is to be expected that hydro technology may be developed, which can largely be 
hidden under water. 

Velocity gradients of air and water are quite common in nature and wind machines, which do not depend on 
homogeneous velocity conditions expected to have a technological future. 
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ABSTRACT 


Albatrosses fly long distances over the Southern Ocean, even around the world, almost without flapping 
their wings; this has raised interest in how they perform such a feat. On a cruise to the South Atlantic I 
observed albatrosses soaring in a characteristic swooping zigzag flight that appears to combine two soar- 
ing techniques to gain energy—wind-shear soaring (dynamic soaring) using the vertical gradient of wind 
velocity and wave-slope soaring using updrafts over waves. The observed characteristic swooping flight is 
shown in a new illustration and interpreted in terms of the two soaring techniques. The energy gain esti- 
mated for “typical conditions” in the Southern Ocean suggests that wind-shear soaring provides around 
80-90% of the total energy required for sustained soaring. A much smaller percentage is provided by 
wind shear in light winds and significant swell when wave-slope soaring dominates. A simple dynamical 
model of wind-shear soaring is proposed based on the concept of a bird flying across a sharp wind-shear 
layer as first described by Lord Rayleigh in 1883 and later developed with Pennycuick’s (2002) descrip- 
tion of albatrosses “gust soaring.” In gust soaring a bird exploits structures in the wind field, such as sep- 
arated boundary layers and eddies in the lee of wave crests, to obtain energy by climbing headed upwind 
and descending headed downwind across a thin wind-shear layer. Benefits of the model are that it is sim- 
ple to understand, it captures the essential dynamics of wind-shear soaring, and it provides reasonable 
estimates of the minimum wind shear required for travel velocity in different directions with respect to 
the wind. Travel velocities, given in a travel velocity polar diagram, can be combined with tacking to fly in 


an upwind direction faster than the wind speed located at the top of the wind-shear layer. 
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1. Introduction 


Albatrosses fly long distances over the Southern Ocean, even 
around the world, almost without flapping their wings (Croxall 
et al., 2005; Safina, 2002, 2007). How they soar for such long times 
has intrigued observers, who have often speculated as to how an 
albatross can extract energy from the wind. Two theories have 
been proposed to explain how the birds fly without flapping their 
wings. The first, which has gained prominence, proposes that an 
albatross uses wind shear, the increase in wind speed with height 
above the ocean surface, to gain energy (wind-shear soaring). The 
second theory proposes that an albatross uses updrafts caused by 
wind blowing over waves to gain energy (wave-slope soaring). 

Wind blowing over waves has both wind shear and vertical mo- 
tions, so the individual effects are not easily separated. Wind inter- 
acting with waves often contains structures such as gusts, lee 
eddies and rolls, which have updrafts and wind shear. Since wind 
generates waves, given sufficient time wind waves coexist with 
wind shear in the open ocean. In the Southern Ocean where most 
albatrosses soar there are strong winds and large waves, including 





* Tel.: +1 508 289 2546; fax: +1 508 457 2163. 
E-mail address: prichardson@whoi.edu 


0079-6611/$ - see front matter © 2010 Elsevier Ltd. All rights reserved. 
doi:10.1016/j.pocean.2010.08.001 


both locally-generated wind waves and swell waves generated 
elsewhere. All mathematical model studies of albatross flight in 
wind shear assume horizontal winds and no waves thus excluding 
the effect of wave-slope soaring and lee eddies which the birds ap- 
pear to use effectively. In addition, most models assume either a 
linear, logarithmic, or exponential vertical profile of average wind, 
which can be very different from the structure of the instantaneous 
wind field as it interacts with waves. Even with these simplifying 
assumptions, the resulting aerodynamical differential equations 
describing the accelerated twisting, turning, swooping flight of 
albatrosses in wind shear are very complex (Lissaman, 2005; Sachs, 
2005), which makes it difficult to understand the details and rele- 
vant dynamics of their flight. 

The approach here is to use the characteristics of an observed 
swoop to estimate the energy gained from each soaring technique 
and to develop a simple dynamical model of wind-shear soaring 
based on Rayleigh’s (1883) concept of a bird soaring across a sharp 
wind-shear layer, on Pennycuick’s (2002) observations and descrip- 
tion of albatrosses “gust soaring” across thin wind-shear layers in 
the lee of waves, and on the aerodynamical equations of motion 
(Lissaman, 2005). The modeled flight pattern is referred to here as 
the Rayleigh cycle because he was first to describe the concept of 
wind-shear soaring. The model provides a relatively easy way to 


Please cite this article in press as: Richardson, P.L. How do albatrosses fly around the world without flapping their wings?. Prog. Oceanogr. (2010), 


doi:10.1016/j.pocean.2010.08.001 





2 P.L. Richardson/ Progress in Oceanography xxx (2010) xxx-xxx 


understand the essential dynamics of wind-shear soaring and pro- 
vides predictions of soaring travel velocities, which agree well with 
observations of albatross flight and more complicated simulations 
(Lissaman, 2005; Sachs, 2005). The Rayleigh cycle, which uses two 
homogenous wind layers, is the most efficient way for a bird to gain 
energy from a wind profile and thus indicates the minimum amount 
of wind speed that can support sustained soaring or, more precisely, 
energy-neutral flight. 

When an albatross flies in wind, the bird’s airspeed is different 
from its ground speed. This should be kept in mind because air- 
speed, and not ground speed, is the quantity most relevant to fly- 
ing. Aerodynamic forces on a bird depend on its airspeed not 
ground speed. To understand this, imagine trying to fly horizon- 
tally in a downwind direction with a large ground speed but 
near-zero airspeed (due to wind shear). In this situation if ground 
speed were used to calculate kinetic energy, there would appear to 
be sufficient energy to support flight, but without any airspeed, 
down you would go in a Stall. The use of airspeed and ground speed 
leads to different conclusions about where kinetic energy is gained 
in wind-shear soaring. An increase of airspeed comes from cross- 
ing the wind-shear layer. Most increase of ground speed occurs 
as a bird turns from a direction headed upwind to downwind. This 
difference complicates interpretations of energy conservation in 
soaring and has led to seemingly contradictory conclusions. 

Gravity and drag relentlessly force a gliding albatross downward 
through the air. In balanced flight the bird’s sinking velocity through 
the air represents the bird’s rate of energy loss. In order to continu- 
ously soar, an albatross must extract sufficient energy from the 
atmosphere to balance the loss due to drag. No strong thermals exist 
in the high southern latitudes of the Southern Ocean where most 
albatrosses soar, so they extract energy from the wind using wind- 
shear soaring (also called dynamic soaring) and wave-slope soaring, 
which is somewhat similar to using the updrafts of wind blowing 
over a ridge. Albatrosses probably also exploit energy gained from 
gusts, updrafts and wind shear associated with turbulent winds. 
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2. Observations 


I observed albatrosses soaring during a research cruise to the 
South Atlantic off Cape Town, South Africa in September 1997. It 
was surprising and delightful to see them almost magically soar up- 
wind in wind speeds of 5-10 m/s (10-20 knots). The albatrosses 
flew in a characteristic and distinctive flight pattern consisting of a 
swooping motion where each swoop was tightly coupled to a wave 
crest (Fig. 1). Each swoop began with a fast flight parallel to and just 
above the windward side of a wave. This was followed by a turn into 
the wind and climb of around 10-15 m, followed by a downwind 
descent towards another wave and a turn parallel to the wave. The 
typical time to complete a swoop was around 10s. These observa- 
tions are largely in accord with previous studies (Alerstam et al., 
1993; Idrac, 1925, 1931; Pennycuick, 1982). The close coupling be- 
tween the swoops and waves suggests that wave-induced features 
of the wind field are important for sustained soaring, but these are 
often neglected in models of soaring flight. 

I was not able to find a good schematic showing this pattern of 
albatross soaring, so I have tried to develop one. Fig. 1 shows the 
zigzag swooping flight pattern in relationship to the waves. Each 
swoop is coupled to a wave as observed, although an Albatross 
can cross over intervening waves before pulling up over a wave. 
Wind shear and updrafts and downdrafts are indicated schemati- 
cally in Fig. 1, and under certain conditions the wind field can look 
somewhat like that shown. Often, however, the wind speed at a 
height of 10 m is only slightly faster than the wave phase speed. 
In that case wind vectors viewed in the frame moving with the 
waves reverse direction below a critical layer where the wind 
speed equals the wave speed. Structures such as lee eddies are ig- 
nored in Fig. 1 for simplicity but can be important as mentioned la- 
ter in describing details of albatross soaring. 

The goal of the paper is to interpret the observed flight pattern 
in terms of wind-shear soaring and wave-slope soaring as de- 
scribed below. 
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Fig. 1. Schematic summary of the zigzag swooping flight pattern of an albatross soaring over waves observed during a cruise to the South Atlantic. The swooping motion is 
shown relative to the waves, which are moving downwind. Each climb is upwind and each descent is downwind since the waves are going downwind, although the 
downwind component is difficult to show in the figure and looks almost parallel to the wave crest. The average direction of flight has an upwind component. Schematic waves 
are uniform for simplicity; real ocean waves are much more complicated. Regions of updraft and downdraft due to wind blowing over waves are indicated schematically. The 
wave phase speed was not subtracted from the wind speed in this diagram. Simplified vectors of typical average wind velocity over the ocean surface are indicated in the right 
part of the figure. Most of the vertical gradient of wind velocity (wind shear) is located in a thin boundary layer near the ocean surface. 
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Fig. 2. Vertical (logarithmic) profile of average wind speed over the ocean (after 
Sachs, 2005). An assumed reference velocity of 10 m/s is located at a height of 10 m. 
Note that most (72%) of the increase of wind speed within the lowest 10 m of the 
profile is located in a ~2m thick wind-shear boundary layer near the surface 
(shaded layer). 


3. Wind-shear soaring 


Average wind velocity generally increases with height from 
near zero velocity at the level of the ocean surface. The largest ver- 
tical gradient of wind velocity (largest wind shear) is located in a 
thin boundary layer located within around 2 m of the water surface 
(Fig. 2). Most of the increase of wind speed in an average wind pro- 
file is located in this thin wind-shear boundary layer near the sur- 
face. However, the structure of the wind field near the ocean 
surface is complicated by the presence of waves. Strong wind flow- 
ing over sharp-crested and breaking waves separates from the 
wave crest forming an area of weaker wind or a lee eddy just 
downwind of the wave crest (Fig. 3) as described by Pennycuick 
(2002). Located above this region of weak wind is a thin wind- 
shear region, a wind-shear boundary layer that has separated from 
the upwind wave crest, and above that a layer of stronger wind and 
reduced wind shear. 





3.1. Gust soaring and Rayleigh cycle 


Pennycuick (2002) proposed that albatrosses take advantage of 
the strong wind shear located between these two layers downwind 
of sharp-crested waves in order to gain energy from the wind in 
what he calls ‘gust soaring,” which is a special case of more general 
wind-shear soaring. A wind “gust” usually refers to a temporal var- 
iation of wind speed, but Pennycuick uses the term to mean the ra- 
pid increase of wind speed encountered by a bird as it climbs 
across the thin wind-shear layer located above a lee eddy. Gust 
soaring can be understood by using a two-layer approximation 
first described by Rayleigh (1883) in which a lower layer has zero 
wind speed and an upper layer has a uniform wind speed of 5 m/s 
(for example) (Fig. 4). An albatross flying at a typical airspeed of 
15 m/s in an upwind direction in the lower layer pulls up a short 
distance into the upper layer encountering a “gust” of 5 m/s, which 
increases the bird’s airspeed to 20 m/s and adds a pulse of kinetic 
energy. If the albatross now descends back into the lower layer 
again without changing direction, the bird’s airspeed would de- 
crease back to 15 m/s, and there would be no net gain in kinetic en- 
ergy to balance loss due to drag. 

Albatross flight reveals that the trick of wind-shear soaring is to 
climb headed upwind, to then turn downwind, and to descend 
headed downwind. After rising into the upper layer and increasing 
airspeed to 20 m/s, the bird banks and turns downwind to fly in 
the opposite direction. If we ignore drag, which is small for an alba- 
tross, then just before descending, the bird’s airspeed is 20 m/s ina 
downwind direction and the (tail) wind speed is 5 m/s also in a 
downwind direction. Thus, the bird’s speed over the lower layer 
(ground speed) is 25 m/s, and when the bird descends into the lower 
layer airspeed increases to 25 m/s, adding another pulse of airspeed 
and kinetic energy. In order to continue gaining energy the bird 
could bank and turn toward the wind direction and climb up into 
the upper layer again. It is apparent that the bird could maximize 
the rate of gain of airspeed and kinetic energy by increasing the fre- 
quency of swoops. Several things tend to limit the airspeed of a real 
albatross: increased sinking rate with faster airspeeds and steeply 
banked turns, the physical strength of the bird as the aerodynamic 
force on its wings increases, and smaller wind shear compared to 
the step-like increase used in the example. 

Some radio controlled (RC) glider pilots have recently used 
wind shear caused by strong winds blowing over ridges to fly 


Wind-Shear Layer 
(Separated 
Boundary Layer) 


— Lee Eddy 


Fig. 3. Schematic of an albatross “gust soaring” (after Pennycuick, 2002). Starting in a lee eddy (or separation bubble) located downwind of a sharp-crested wave a bird climbs 
up through a thin wind-shear layer (separated boundary layer) that has detached from the wave crest. On crossing the wind-shear layer, the bird’s airspeed abruptly 
increases, and the bird experiences a “gust.” The increase in airspeed can be used to climb up to heights of 10-15 m by trading airspeed (kinetic energy) for height (potential 
energy). A lee eddy is a region of closed streamlines with clockwise circulation in this figure. 
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Wind-Shear 
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No Wind 





20 m/s (Albatross airspeed) 


Fig. 4. Idealized example of the airspeeds of a dragless albatross gust soaring through a thin wind-shear layer, which is assumed to consist of an increase in wind speed from 
zero below the layer to 5 m/s above. It shows how an albatross could gust soar in the region downwind of a wave crest as indicated in Fig. 3. This schematic is based on the 
written description of Rayleigh (1883) who first suggested that a bird could continuously soar in nearly-circular flight on an inclined plane that crosses a thin wind-shear 
layer. Starting in the lower layer with an airspeed 15 m/s a bird climbs upwind a short distance vertically across the wind-shear layer, which increases airspeed to 20 m/s. The 
bird then turns and flies downwind with the same airspeed of 20 m/s. During the turn, ground speed increases to 25 m/s downwind and consists of the bird’s 20 m/s airspeed 
plus (tail) wind speed of 5 m/s. The bird descends downwind a short distance vertically across the wind-shear layer, which increases airspeed to 25 m/s. The bird turns 
upwind flying with an airspeed of 25 m/s. Thus one swoop through the wind-shear layer increases airspeed from 15 m/s to 25 m/s (two times the 5 m/s wind speed increase). 
By descending upwind (dashed line) the bird’s airspeed would have decreased from 20 m/s back to 15 m/s with no net gain in airspeed. 


gliders at surprisingly fast speeds. By repeatedly climbing upwind 
and descending downwind through the separated wind-shear 
layer located downwind of a ridge, RC gliders have achieved 
airspeeds over 150 m/s in gust soaring (http://www.youtube.com/ 
watch?v=ViOhrjqU151). The duration of a fast glider swoop is 
around 3s, which implies a lift force of around 30 times gravity 
(30g), much too large for an albatross. These exceedingly fast glider 
speeds clearly demonstrate how effective wind-shear gust soaring 
can be given the right circumstances. A Google search under “dy- 
namic soaring” provides many relevant websites that discuss this 
topic. Wind-shear soaring can also be performed with horizontal 
wind shear. 

Since an albatross crosses the shear layer twice during a typical 
10s swoop, the time scale of the 5 m/s velocity increase and the 
corresponding energy pulse due to crossing the shear layer once 
is 5s. This is a much shorter time than the ~40s required for an 
equivalent velocity increase estimated for wave-slope soaring. 

An albatross wind-shear soaring can and often does convert the 
increase in airspeed (kinetic energy) to a gain in height (potential 
energy) similar to the motion of a pendulum. The increase in wind 
speed (5 m/s) across the wind-shear layer used in the above exam- 
ple could provide around 9 m of altitude gain (assuming no drag). 
The bird could then trade height for airspeed on descending back 
down to the ocean surface. 

Temporal wind gusts, in contrast to the structure gusts encoun- 
tered in gust soaring, can be used to gain additional energy. A fas- 
ter-than-average wind-speed gust contains greater-than-average 
wind shear, through which a bird could swoop extracting a great- 
er-than-average amount of energy. The trick of temporal-gust soar- 
ing is to Maximize time in gusts and minimize time in lulls. An 
albatross undoubtedly knows how to identify gusts—rougher 
water surface, blowing spray—to use for additional energy gain. 


3.2. Minimum wind shear required for sustained soaring 


The Rayleigh cycle of wind-shear soaring as shown in Fig. 4 was 
used to model an albatross gust soaring in nearly-circular flight 
along a plane tilted slightly upward into the wind. The essential 
assumptions are that (1) the plane crosses the wind-shear layer 


at a small angle with respect to the horizon so that vertical motions 
can be ignored, (2) the mean airspeed and mean glide ratio can be 
used to represent flight in the circle, and most importantly, (3) con- 
servation of energy in each layer requires a balance between the 
sudden increase of airspeed (kinetic energy) due to crossing the 
shear layer and the gradual loss of airspeed due to drag over half 
a loop, resulting in energy-neutral flight. The motion during each 
half loop is somewhat similar to a landing flare when a glider 
maintains constant altitude and airspeed is slowly dissipated by 
drag. 

Conservation of energy indicates that the vertical increase of 
wind speed (AW) across the wind-shear layer required for en- 
ergy-neutral wind-shear soaring is given by AW =gt/2(V/V,), 
where g is gravity, t is the period (10 s) of the nearly-circular flight, 
and V/V, is the ratio of the bird’s airspeed (V) to sinking speed 
through the air (V,) (see Appendix A for details). The glide ratio, 
V/V, very closely equals lift/drag for values >>+1 typical of albatross 
flight. Relevant values of airspeed (V), sinking speed (V,), and glide 
ratio in the modeled Rayleigh cycle were calculated using the aero- 
dynamical equations of motion (Lissaman, 2005, 2007; Torenbeek 
and Wittenberg, 2009) and the maximum glide ratio V/V, = 21.2, 
and associated cruise airspeed, V. = 16.0 m/s, of a wandering alba- 
tross (Diomedea exulans) in straight flight (Pennycuick, 2008) (Ta- 
bles 1 and 2, Appendix A). The minimum AW for the Rayleigh 
cycle was found to be 3.55 m/s with an associated average airspeed 
V=16.0 m/s, average glide ratio V/V, = 13.8, and average bank an- 
gle of 45.7° in the circle. 

The 16.0 m/s average airspeed in the circle and the AW= 
3.55 m/s increase of airspeed encountered by a bird crossing the 
wind-shear layer indicate that the bird’s airspeeds before and 
after crossing the wind-shear layer are V; = 14.22 m/s and V>2= 
17.78 m/s. These values are greater than the airspeed of minimum 
sinking speed in the circle (14.12 m/s), and they fall within the 
range of the nearly constant values of V/V, (13.6-13.9) and AW 
(3.5-3.6 m/s) in Table 1, which justifies the use of averages of 
airspeed, glide ratio (V/V,), and AW in modeling the nearly-circular 
flight. 

The minimum wind shear AW calculated above is based on the 
10s observed swoop period. However, minimum AW is a function 
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Table 1 
Characteristics of circular flight and the minimum wind speed increase (AW) across the wind-shear layer required for energy-neutral soaring in a 
Rayleigh cycle (t= 10s). 


Airspeed in Bank angle @~ Sinking speed Glide ratio Wind speed increase Comments 

circle, V (m/s) V, (m/s) in circle (V/V,) AW (m/s) 

14.1 42.1 1.04 13.7 3.6 Minimum sink rate 
15.0 43.8 1.08 13.8 3.5 

16.0 45.7 1.15 13.9 3.5 Maximum glide ratio 
17.0 47.4 1.23 13.8 3.5 

18.0 49.1 132 13.6 3.6 

19.0 50.6 1.42 13.4 Ba 

20.0 52.0 1.53 13.0 3.8 

21.0 53.4 1.66 12.6 3.9 

22.0 54.6 1.80 12,2 4.0 

23.0 55.8 1.95 11.8 4.2 

24.0 57.0 212 11.3 4.3 

25.0 58.0 2.30 10.9 4.5 

26.0 59.0 2.50 10.4 4.7 

27.0 60.0 2/1 10.0 4.9 

28.0 60.9 2.93 9.6 5.1 

29.0 61.7 3.17 9.1 5.4 

30.0 62.5 3.43 8.7 5.6 


Values for circular flight were calculated using a quadratic drag law, the maximum glide ratio (21.2) and associated cruise airspeed (V,. = 16.0 m/s) 
of straight flight for a wandering albatross (Pennycuick, 2008) (see Appendix A). Minimum sink rate was calculated with V= 0.760V,/,/cos@. 


Table 2 
Characteristics of a wandering albatross (Diomedea exulans), the ultimate soaring bird. 


Mass: 10 kg 

Maximum wing span: 3.5 m (largest of all birds) 

Wing aspect ratio: 15 

Straight flight characteristics (Pennycuick, 2008): 

Maximum glide ratio: 21.2 at cruise airspeed of 16.0 m/s and sink rate of 0.755 m/s 

Minimum sink rate: 0.624 m/s at airspeed 11.5 m/s and glide ratio 18.4 

Range: ~10 million km. This range is based on a bird flying 2/3 of the time at an average travel velocity of 10 m/s for 50 years and is equivalent to approximately 400 
circumnavigations in the latitude band of the Southern Ocean 

Life span: ~50 years 

Food: Squid, fish, krill 

Nesting sites: Islands in the Southern Ocean 

Distribution at sea: Most albatross species, including wandering albatrosses, forage over the Southern Ocean between latitudes 30-60°S 

Albatross flight: As a result of their long, narrow, high aspect ratio (~12-15) wings, albatrosses have the largest glide ratios (horizontal velocity/vertical velocity) and are 
the greatest soaring birds. A shoulder lock system holds their wings in a horizontal position so that little energy is expended while soaring (Pennycuick, 1982). 
Soaring efficiency enables the heart rate of a soaring albatross to be close to the basal heart rate when resting. Most albatross species lack the muscles to undertake 
sustained flapping flight and thus are dependent on obtaining energy from the wind for sustained soaring. In contrast, most petrels, which are smaller than 
albatrosses, have smaller wing aspect ratios (<10) and tend to flap-glide (Pennycuick, 1982) 

Typical winds and waves in the Southern Ocean: Maximum average wind speeds, located near 50°S, are westerly at ~11 m/s. Maximum average significant wave heights 
near 50°S are ~5 m. Wind speeds and wave heights decrease from this latitude southward towards Antarctica and northward towards 30°S, where values of average 
wind speed are ~6-7 m/s and average significant wave heights are ~2 m. Thus, typical average wind speeds in the Southern Ocean are 6-11 m/s and typical average 
wave heights are 2-5 m (Young, 1999; Hanley et al., 2010). Based on these values, this paper used wind speeds of 5-10 m/s and a wave height of 3 m (period ~ 9- 
10s) as approximate “typical conditions” in order to crudely estimate the relative energy gains from wind-shear soaring and wave-slope soaring. An updraft of 
approximately 1 m/s is generated by the orbital motion of a 3 m wave (period ~9.5 s) and wind-wave interactions as described in the text 


of loop period. The optimum loop period for absolute minimum 3.3. Discussion of minimum wind shear (AW) 
AW in the Rayleigh cycle was found to be 7.25 s, and this coincides 


with the cruise airspeed of 16.0 m/s and bank angle of 54.7°. The Why is the observed period (10 s) of an albatross swoop greater 
absolute minimum AW is 3.36 m/s (Appendix A, Table 3). than the 7.2 s loop period associated with the absolute minimum 
Table 3 
Minimum increase of wind speed AW required for energy-neutral wind-shear soaring. 
Wind profile Max V/V, V. (m/s) t(s) AW (m/s) Travel velocity (m/s) Flight cycle Reference 
Rayleigh 21.2 16.0 10.0 3.55 10.2 Loop This paper 
Rayleigh Pe 16.0 725 3.36 10.2 Loop This paper 
Rayleigh 21.2 16.0 735 3.36 10.0 Loop Lissaman (pers. com.) 
Rayleigh 21.2 16.0 7.70 3.58 8.8 Circuit Lissaman (pers. com.) 
Exponential 25.0 15.0 - 3.74 ~ Circuit Lissaman (2005) 
Logarithmic 20.0 12.6 6.9 3.8 9.2 Snaking Sachs (2005) 


The Rayleigh wind profile has two layers with zero wind in the lower layer and a wind speed of AW in the upper layer. For the exponential and logarithmic profiles, the listed 
increase of wind speed (AW) is over the range of heights flown, 0-18 m (Lissaman, 2005), 1.5-20 m (Sachs (2005)). Maximum glide ratio (V/V,) and the associated cruise 
airspeed (V,) for straight flight define the glide polar (see Appendix A). Values consistent with a wandering albatross, V/V, = 21.2, V.= 16.0 (Pennycuick, 2008), were used in 
this table where possible. The period of a flight cycle is t, and the observed swoop period is 10s. The absolute minimum AW for a Rayleigh cycle loop occurs at a period of 
7.25 Ss. Travel velocities are perpendicular to the wind velocity and consist of averages over two half loops, assuming that the half loops could be connected in a snaking cycle. 
A “circuit” returns to the starting height, velocity, and ground position. A “loop” returns to the starting height and velocity but not ground position because of leeway. 
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AW in the Rayleigh cycle? An obvious answer is that the 10s ob- 
served swoop period was of birds soaring in wind shears (AW) lar- 
ger than the absolute minimum, and that given sufficiently large 
AW a bird flies with a larger loop period in order to reduce aerody- 
namic wing loading. It is possible that for a given AW greater than 
the absolute minimum a bird increases its swoop period to control 
energy gain in order to maintain an average airspeed of around 
16 m/s. An implication is that an albatross trying to wind-shear 
gust soar near the absolute minimum wind shear (3.4 m/s) needed 


for energy-neutral soaring must reduce the period of swoops from 
10s toward 7.2 s in order to continue soaring. 

The minimum amount of wind shear (AW = 3.4-3.5 m/s) across 
the wind-shear layer found above is small enough to suggest that 
the associated total wind speed might not be fast enough to gener- 
ate large waves with sharp crests required for gust soaring and that 
therefore gust soaring might not be an appropriate model for such 
low wind-shear values. However, in the presence of decreasing 
winds, which had generated large waves, or in the presence of 
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Fig. 5. Plan view, showing examples of snaking (zigzag) flight at an angle of 60° to the right of the wind similar to the flight shown in Fig. 1. (A) Rayleigh snaking cycle created 
by linking together semi-circular pieces of the circular Rayleigh cycle to simulate the observed albatross zigzag flight pattern (Fig. 1) and average travel velocity. (B) Semi- 
circular snaking cycle modified to cross the wind-shear layer parallel to the wind direction for maximum energy gain. (C) Snaking cycle modified so that the upwind climb is 
parallel to the wind and the descent is obliquely downwind and parallel to wave crests; this pattern closely resembles my observations of albatross soaring and those of Idrac 
(1925, 1931). (D) Snaking cycle further smoothed so that the climb is obliquely upwind and the descent is mainly across-wind (observed by Idrac, 1925, 1931). Flight patterns 


in panels C and D could be used to reduce energy gain in large wind shear (AW). 


Please cite this article in press as: Richardson, P.L. How do albatrosses fly around the world without flapping their wings?. Prog. Oceanogr. (2010), 


}doi:10.1016/j.pocean.2010.08.001 





P.L. Richardson / Progress in Oceanography xxx (2010) xxx-xxx 7 
Table 4 
Travel velocity and the increase of wind speed (AW) across the wind-shear layer required for Rayleigh snaking flight in different directions (0) relative to the wind direction. 


Across-wind 
velocity 


Increase of wind 
speed AW (m/s) 


Travel direction 
through the air 


Upwind velocity Travel velocity over the ocean (course made good) 


isve tain Leeway AW/2 = (10.2)cos(@) Sum of (10.2)sin(0) Speed (m/s) Direction . 
(m/s) (m/s) components (m/s) nS relative to wind 

0 10.6 —5.3 10.2 4.9 0.0 4.9 0 
10 8.5 —4,2 10.0 5.8 1.8 6.1 17 
20 6.8 —3.4 9.6 6.2 3.5 7.1 30 
30 5.7 —2.8 8.8 6.0 5.1 7.9 40 
40 4.9 —2.4 7.8 5.4 6.6 8.5 51 
50 4.3 —2.2 6.6 4.4 7.8 9.0 61 
60 4.0 —2.0 5.1 3.1 8.8 9.4 71 
70 3.7 —1.9 3.5 1.6 9.6 9.7 80 
80 3.6 —1.8 1.8 0.0 10.0 10.0 90 
90 3.5 —1.8 0.0 —1.8 10.2 10.3 100 
100 3.6 —1.8 —1.8 —3.6 10.0 10.7 110 
110 3.7 —1.9 —3.5 —5.4 9.6 11.0 119 
120 4.0 —2.0 —5.1 —7.1 8.8 11.3 129 
130 4.3 —2.2 —6.6 —8.7 7.8 11.7 138 
140 4.9 —2.4 —7.8 —10.2 6.6 2,2 147 
150 5.7 —2.8 —8.8 —11.7 5.1 1a 156 
160 6.8 —3.4 —9.6 —13.0 3.5 13.5 166 
170 8.5 —4,2 —10.0 —14.3 1.8 14.4 173 
180 10.6 —53 —10.2 —15.5 0.0 15.5 180 


Note: Mean airspeed in snaking flight is 16.0 m/s, and mean travel velocity through the air is 10.2 m/s. 


large swell propagating into an area from elsewhere, the waves 
might be sufficiently large and sharp enough with the addition of 
local wind waves to generate lee eddies, which can be used for gust 
soaring. In addition, lee eddies not associated with a separated 
boundary layer could be present and useful for gust soaring (more 
on this below). 

The value of absolute minimum AW determined with the sim- 
ple model described above closely agrees with results of a study by 
Lissaman (personal communication, 2010), who integrated the 
aerodynamic equations to determine the minimum increase of 
wind speed for energy-neutral wind-shear soaring in the Rayleigh 
two-layer case (Table 3). He found a minimum AW of 3.36 m/s 
(same as present study) for a loop with the end point located 
downwind of the starting point, similar to the simple model de- 
scribed above. The loop period associated with this minimum 
AW is 7.35. s, and the (constant) bank angle is 54°, very close to 
the values found in the present study. In another case, Lissaman 
(personal communication, 2010) forced the end point of a cycle 
to match the starting point in order to eliminate leeway in a circuit, 
and he found that the minimum AW increased slightly to 3.56 m/s 
and the loop period increased to 7.70 s (Table 3). 

Sachs (2005) modeled an albatross soaring in a logarithmic 
wind profile and estimated the minimum wind velocity required 
for energy-neutral wind-shear soaring to be around 9 m/s at a ref- 
erence height of 10 m. The increase of wind speed (AW) encoun- 
tered by the simulated bird was 3.8 m/s (Table 3) over the actual 
range of heights flown in the model swoop, 1.5-20 m above sea le- 
vel. Most of the wind shear in a logarithmic profile below 20 m 
height is located within 1.5 meters of the ocean surface (Fig. 2) 
and was missed because the simulated bird did not fly closer to 
the surface than this height due to the bird’s long wings. This sug- 
gests that Sachs’ (2005) minimum reference wind speed of 9 m/s 
overestimates the minimum wind speed required for gust soaring, 
which includes the large shear located in the lower part of the 
Shear layer. Lissaman (2005) included the lower part of an expo- 
nential wind profile and found a rather similar minimum AW of 
3.7 m/s over the heights flown, 0-18 m (Table 3). This estimate 
using the whole wind profile (0-18 m) appears to be a better mod- 
el of gust soaring since the simulated bird starts and ends at the 
bottom of the shear layer. The implication is that the minimum 
wind velocity at a height of 10 m required for sustained gust soar- 


ing in a smooth wind profile over the ocean is only around 3.4 m/s 
(Lissaman, 2005) not 9 m/s (Sachs, 2005). These results of the min- 
imum AW =3.7-3.8 m/s, based on smooth wind profiles, imply 
that the simple Rayleigh cycle (AW = 3.4-3.5 m/s) is also a fairly 
good approximation for a bird soaring in the reduced wind-shear 
region located higher above sea level. 

Important missing ingredients in these numerical simulations 
of wind-shear soaring are updrafts, separated boundary layers, 
and lee eddies that real albatrosses appear to exploit. An advantage 
of the gust-soaring technique is that a bird dives down into a lee 
eddy across the strongest part of the wind-shear layer and then 
climbs upward across it again (Figs. 3 and 4), thereby making good 
use of the available wind shear. In Sachs’ (2005) simulation of 
wind-shear soaring, the albatross missed most of the strong wind 
shear located close to the ocean surface. Another advantage of gust 
soaring is that a bird remains below the wind-shear layer for part 
of the loop, thereby minimizing leeway. This could be an advantage 
for a bird trying to soar upwind. 


3.4. Mean travel velocity 


The mean travel velocity of an albatross gust soaring was mod- 
eled by dividing the Rayleigh cycle into semi-circular pieces and 
connecting a series of them together in a snaking flight pattern 
to simulate the observed zigzag flight. For example, a clockwise 
semi-circle located above the shear layer was connected to a coun- 
ter-clockwise semi-circle located below the shear layer to simulate 
flight in a direction 60° to the right of the wind as illustrated in 
Fig. 5A. A bird was assumed to quickly switch banking directions 
during the climbs and descents. The 16.0 m/s mean airspeed in 
the series of connected semi-circles results in a mean travel veloc- 
ity through the air of 2V/z = 10.2 m/s. During the half loop located 
in the upper layer, a bird is carried downwind by the wind at a 
speed of AW so that the end point of a loop is displaced downwind 
of the starting point. This results in an average leeway over a loop 
equal to half of the wind speed in the two layers (AW/2), which 
equals 1.8 m/s for flight perpendicular to the wind. 

Travel velocity was calculated for mean flight directions ori- 
ented at various angles with respect to the wind direction (Table 
4). It was assumed that the average airspeed of 16.0 m/s and the 
maximum glide ratio in the flight remain constant, and that as 
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the mean flight direction through the air varies from being perpen- 
dicular to the wind, which is the most efficient course for gaining 
energy, an increase of wind shear AW is required to sustain soar- 
ing. The increase is needed because the bird would cross the 
wind-shear layer at an angle with respect to the wind direction, 
resulting in a smaller increase of airspeed for a given amount of 
wind shear. A complication is that real flown “semi-circles” could 
depart from the assumed semi-circles. Results are displayed as a 
travel velocity polar diagram in the shape of a valentine (Fig. 6), 
which gives travel velocity in any direction relative to the wind. 
Although this type of diagram is common for sailboats, Fig. 6 ap- 
pears to be the first one based on model simulations of an albatross 
Sailing through the wind. The valentine can be compared with the 
observations of Alerstam et al. (1993, their Fig. 11), plotted as a 
similar diagram. 

Results shown in Figs. 6 and 7 and Table 4 indicate that a mean 
travel velocity can be in any direction relative to the wind given 
sufficient wind shear (AW), including directly upwind, although 
this direction is not the fastest way to travel upwind. The fastest 
upwind travel velocity is 6.2 m/s (AW=6.8 m/s) in a direction 
~30° relative to the wind. This suggests that the fastest way to soar 
directly upwind is by tacking like a sailboat through angles of ~30° 
to the right and left of the wind direction. Tacking upwind with 
this angle can also be accomplished with less wind shear than by 
flying on a mean course directly upwind (AW = 10.6 m/s). Table 4 
also indicates that an albatross tacking at an angle of 50° relative 
to the wind can soar upwind at 5.4 m/s, faster than the wind speed 
(AW = 4.9 m/s) located at the top of the wind-shear layer. Tacking 
refers to the bird alternating the mean travel velocity to the left 
and right of the wind direction, not the 10s zigzag swoops along 
the mean travel velocity. 

The fastest model travel speed is directly downwind at 15.5 m/s 
(AW = 10.6 m/s), although it is almost as fast (~12-13 m/s) to tra- 
vel obliquely downwind at angles of 140-165° relative to the wind 
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Fig. 6. Travel velocity polar diagram in the shape of a valentine based on an average 
airspeed of 16.0m/s and the minimum wind shear (AW) values required for 
sustained soaring in different directions relative to the wind direction. Travel speed 
in a particular direction is given by the length of a line starting at the origin (x = 0, 
y = 0) and ending on the valentine. For example, the travel speed in a direction 30° 
to the right or left of the wind direction is 7.1 m/s, and the associated travel speed in 
an upwind direction is 6.2 m/s (Table 4). Values were calculated using the Rayleigh 
snaking cycle shown in Fig. 6A, a mean airspeed of 16.0 m/s, and leeway equal to 
the average wind speed of the two layers as given in Table 4. Two components of 
travel velocity were combined to create the valentine—the mean travel velocity 
through the air equal to 10.2 m/s and the downwind leeway equal to AW/2. 
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Fig. 7. Mean travel velocity over the ocean using an average airspeed of 16.0 m/s 
plotted versus the angle between travel direction and the wind as shown in Fig. 6. 
The associated minimum wind shear across the wind-shear layer (AW) required for 
energy-neutral soaring in the different directions is also shown. Values were 
calculated using the Rayleigh snaking cycle shown in Fig. 5A, a mean travel velocity 
of 10.2 m/s and leeway equal to the average wind speed of the two layers (AW)/2 as 
given in Table 4. The minimum wind speed increase across the sharp wind-shear 
layer required for energy-neutral soaring (t=10s) ranges from a minimum of 
AW = 3.5 m/s for across-wind flight to a maximum of AW=10.6 m/s for flight 
directly upwind and downwind. 


in AW = 4.3-6.8 m/s (Table 4, Fig. 7). Flying at these angles requires 
only around half the minimum wind shear for direct downwind 
flight (Table 4, Fig. 7) and would maximize distance over the ocean 
at these AWs for foraging and circumnavigations. The most effi- 
cient directions to fly, as defined by the travel airspeed divided 
by AW being greater than 2.5, are 80-150° over the ocean, or direc- 
tions extending from nearly perpendicular to the wind to obliquely 
downwind, with the most efficient direction at around 110° rela- 
tive to the wind. 

Travel velocities given above used a 16.0 m/s airspeed. If wind 
shear were greater than the minimum AW for soaring in a partic- 
ular direction (Table 4, Fig. 7), then in principle an albatross could 
gain additional energy during a swoop and use it to fly faster than 
16.0 m/s. Thus, the travel velocities in Table 4 and Figs. 6 and 7 
could underestimate real albatross travel velocities in larger wind 
shears. For example, the mean travel velocity perpendicular to the 
wind could be ~22 m/s in wind shear (AW) equal to 7 m/s, which 
is twice the minimum shear required for travel velocity perpendic- 
ular to the wind at 10.2 m/s. However, albatrosses do not appear to 
fly this fast (Wakefield et al., 2009) probably because of the large 
associated aerodynamic forces acting on the bird’s wings at fast 
airspeeds. For example, a travel velocity of 22 m/s corresponds to 
an airspeed of 35 m/s, bank angle of 66°, and lift force of 2.5 g, 
which is almost twice that encountered with an airspeed of 
16 m/s and bank angle of 45°. 

In summary, the model Rayleigh snaking cycle indicates that an 
albatross can fly in any direction including directly upwind when 
tacking is combined with travel velocities. Upwind flight velocity 
can be faster than the wind speed located at the top of the 
wind-shear layer when flying at angles of 40-60° relative to the 
wind (Table 4). Wind shears that are larger than the minimum 
wind shear for energy-neutral soaring could enable an albatross 
to fly faster the than travel velocities calculated for minimum wind 
shear, but albatrosses do not appear to do this. Instead, they appear 
to control airspeed in order to limit aerodynamic force on their 
wings. 
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3.5. Discussion of travel velocity 


The across-wind travel velocity calculated above (10.2 m/s) was 
based on a simplified Rayleigh cycle consisting of a series of linked 
semi-circles and average airspeed, bank angle, and glide ratio. Liss- 
aman (personal communication, 2010) numerically simulated the 
details of a Rayleigh cycle loop and circuit. The implied across- 
wind travel velocity of the loop is 10.0 m/s, close to that found 
above (Table 3). The implied travel velocity of the Rayleigh cycle 
circuit is somewhat smaller than this (8.8 m/s) because the bird 
was forced to return to the starting position. Sachs (2005), using 
a logarithmic wind profile, found an across-wind travel velocity 
of 9.2 m/s and a leeway of 9.0 m/s. This across-wind value is some- 
what slower than the Rayleigh cycle loop because Sachs used smal- 
ler values of glide ratio (20) and cruise velocity (12.6 m/s) (Table 
3). Sachs’ leeway value is much larger than that for the Rayleigh cy- 
cle because the simulated bird remains in the region of fast wind 
speeds above 1.5m in elevation. This could be advantageous for 
a bird trying to fly downwind but a disadvantage in trying to fly 
upwind. The resulting travel velocity over the ocean is 12.9 m/s 
in a direction 134° relative to the wind. 

Some of the albatrosses I observed soared upwind at the same 
speed (~6 m/s) as our ship steamed upwind, which matches the 
upwind travel velocity (6 m/s at 25-40°) of the Rayleigh cycle (Figs. 
6 and 7). Alerstam et al. (1993) report observed albatross travel 
velocities in a travel polar diagram very similar to those estimated 
with the Rayleigh cycle, including across-wind travel speed of 
10.2 m/s (in wind speeds of 8-13 m/s) compared to 10.2 m/s in 
the Rayleigh cycle (Fig. 6). The fastest travel velocity observed by 
Alerstam et al. was 22 m/s at a downwind angle of 140° and in fast 
wind speeds of 13-20 m/s, compared to a travel velocity of 12 m/s 
in the Rayleigh cycle. However, this fast observed speed no doubt 
includes a large downwind leeway velocity. 

Wakefield et al. (2009) found a strong linear relationship be- 
tween the ground speed of albatrosses and the wind-speed compo- 
nent in the direction of flight. For example, the travel velocity 
through the air of wandering albatrosses was found to equal an 
average 10 m/s plus 0.4 times the wind component (at 5 m height) 
in the direction of flight. Values for the Rayleigh cycle (Table 4) give 
a similar 10 m/s travel velocity through the air and a similar linear 
relationship with the wind-speed component (AW) in the direction 
of flight, indicating that the Rayleigh cycle is a good model for ob- 
served albatross flight speeds. The observed speeds suggest that 
albatrosses generally fly with a 10 m/s travel velocity through 
the air even in relatively fast winds (6-9 m/s) and large wind shear. 
The implication is that the birds control the amount of energy gain 
in wind-shear soaring in order to maintain a nearly-constant aver- 
age airspeed. 

In order to fly with a 10 m/s travel velocity in wind shear that is 
much larger than the minimum required for energy-neutral soar- 
ing at that airspeed (Table 4), an albatross must modify its flight 
to gain less energy than the maximum possible. To reduce energy 
gain a bird could increase the period of its swoops. It could climb 
less high through the wind-shear layer or could remain in the 
weak wind shear located higher up in the wind profile as modeled 
by Sachs (2005). A bird could also cross the wind-shear layer with 
a large horizontal angle relative to the wind direction. 

In principle, an albatross could fly on a nearly-straight course 
perpendicular to the wind and also in other directions but not di- 
rectly upwind or downwind by using the net energy gained from 
climbing and descending across the wind-shear layer. A hypothet- 
ical flight pattern might consist of the following: A bird flying east- 
ward (for example) in the lower layer below a north wind of 8.7 m/ 
S rises into the upper layer. As the bird encounters the wind veloc- 
ity, airspeed increases by AV = 2.3 m/s, and the relative wind shifts 
~30° in an upwind direction. The bird quickly turns (yaws) left and 


heads into the relative wind to avoid side slipping. It then banks 
slightly (~10°) to the right and turns eastward by the end of the 
5s half loop. During the 5s turn, airspeed decreases by 2.3 m/s 
due to drag. The bird then descends into the lower layer and 
encounters another increase of airspeed (AV = 2.3 m/s) and a shift 
of the relative wind ~30° in a downwind direction. The bird 
quickly turns (yaws) right to head into the relative wind and then 
banks slightly (~10°) to the left and turns eastward again, etc. The 
airspeed increase (AV = 2.3 m/s) for nearly-straight flight is smaller 
than the minimum (AV = 3.5 m/s) for the Rayleigh cycle because of 
the smaller bank angle and larger glide ratio in the nearly-straight 
flight. The resulting travel velocity over the ocean would be 
15.5 m/s at a direction of 105° relative to the wind. 

This hypothetical flight trajectory suggests that the snaking 
Rayleigh cycle is 2.5 times more efficient at increasing energy 
(smaller required AW) than the nearly-straight across-wind flight. 
More importantly, it indicates that the general rule of wind-shear 
soaring—climbing headed upwind and descending headed down- 
wind—can be relaxed given sufficiently-large wind shear. Further- 
more, it suggests that an albatross could control energy gain and 
airspeed by reducing the curvature (smoothing) of the Rayleigh 
semi-circular snaking flight pattern to make it straighter as illus- 
trated in Fig. 6C and D and as observed by Idrac (1925, 1931). Along 
with straighter flight come a smaller bank angle and smaller aero- 
dynamic force, which would be less stressful for a bird. 


4. Wave-slope soaring 


A common perception is that updrafts over a wave are caused 
mainly by wind flowing up the windward face of a wave (see 
Pennycuick, 1982; Wilson, 1975). However, the causes and struc- 
tures of updrafts are considerably more complicated than this 
and include air displaced upward by the orbital velocity of the 
wave surface and vertical velocities from wind-wave interactions. 
These can occur simultaneously, their effects adding and subtract- 
ing from each other in complicated ways. Soaring using any 
updraft caused by wind interacting with waves is referred here 
to be “wave-slope soaring,” realizing that this term is a simplifica- 
tion. A bird flying horizontally in an updraft over waves could gain 
altitude (potential energy) from the wind. This energy gain could 
be used to balance the bird’s loss of energy due to drag in 
energy-neutral flight. 

The following summarizes observations of albatross soaring and 
model simulations of wind over waves with the intent to infer 
methods of wave-slope soaring. First, when the air is still and the 
seas calm, albatrosses are observed to usually sit on the ocean 
surface or, infrequently, fly by alternating flapping and gliding 
(Pennycuick, 1982; Alerstam et al., 1993). This indicates that 
thermals are not effective for albatross soaring or that albatrosses 
have not learned how to soar in thermals, which seems unlikely. 

Reports of albatrosses soaring over swell waves in zero wind 
(Alerstam et al., 1993; Froude, 1888; Pennycuick, 1982) illustrate 
that the orbital velocity of the wave surface forces a substantial up- 
draft over the slope of a wave facing its direction of propagation. 
Froude estimated an updraft velocity of around 1 m/s above a wave 
of wavelength 150m, amplitude 1.5m, period 10s, and phase 
speed 15 m/s. Since the minimum sinking rate of a wandering alba- 
tross is around 0.6 m/s (Pennycuick, 2008), this bird could easily 
Soar over these waves in zero wind. The trick in wave-slope soaring 
is to maximize the time in the strongest updrafts and minimize 
time outside this region especially in downdrafts. By flying in the 
strongest updrafts an albatross could gradually increase its air- 
speed over the airspeed of minimum sink and use the excess either 
to climb or to fly in an across-wave direction through a downdraft. 

When a swell wave propagates in an upwind direction then the 
updraft created by wind flowing up the windward wave face is 
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Fig. 8. Example illustrating that wind blowing over a wave forms a lee eddy, which is a region of closed streamlines shaped like a cat’s eye (after Sullivan et al. (2000), their 
Fig. 16b). In this example, wind speed at a height of one wavelength is equal to around four times the wave phase speed c, and c/u* = 3.9, where u* is the friction velocity. The 
lee eddy is centered just upwind of the wave trough and deflects streamlines upwards above the eddy. Updraft wind velocity (shaded) is centered over the leeward (right) 
side of the wave, over the windward (left) side of the eddy, and extends upward over the lee eddy. The dotted line indicates the critical layer, where the wind changes 
direction, as viewed moving with the wave speed, from a downwind direction (to the right) above the critical layer to the opposite direction (to the left) below as caused by 


friction and surface boundary conditions. 


added to the updraft due to the orbital motion of the wave surface. 
This would provide an ideal situation for sustained wave-slope 
soaring that could be accomplished with smaller waves than those 
required to soar in zero wind. A similar situation of enhanced up- 
draft occurs when a swell wave propagates downwind faster than 
the wind speed. Sullivan et al. (2008) describe model simulations 
and observations that include illustrations of updrafts over swell 
waves. 

A fast wind flowing over a relatively slow wave can cause an up- 
draft over the windward wave face, but the updraft is countered 
somewhat by a downward orbital velocity there. Leeward of the 
wave crest and centered just upwind of the wave trough a lee eddy 
can form (Fig. 8), which is a region of closed streamlines centered 
about the critical layer and synchronous with the wave (Sullivan 
et al., 2000). The updraft region located over the leeward wave face 
is forced mainly by upward orbital velocity of the wave surface. 
Sometimes the lee eddy is known as a cat’s eye for its distinctive 
pattern as shown in Fig. 8. The region of closed streamlines in 
the lee eddy deflects the outer mean streamlines away from the 
wave surface creating a region of updraft over the eddy (Fig. 8). 
Sullivan et al. (2000) show streamlines for three examples corre- 
sponding to the increasing ratio of wave speed to wind speed, 
c/u* = 3.9, 7.8 and 11.5, where c is the wave phase speed and u* 
is the friction velocity. As the ratio increases, the cat’s-eye pattern 
thickens, its center moves upward and upwind toward the wave 
crest, and the region of updraft shifts upwind to extend vertically 
over the wave crest. Hristov et al. (2003) also show observations 
and model calculations of wave-induced lee eddies, which include 
an updraft over the lee side of a wave below the critical layer, 
forced mainly by the orbital velocity, and extending over the wave 
crest above the critical layer. Lee-eddy structures and the associ- 
ated pressure perturbations are thought to be important in gener- 
ating wind waves. 

The waves modeled by Sullivan et al. (2000) are sinusoidal. 
Wind waves tend to have sharper crests than this and can break 
in sufficient wind speed. When swift wind blows over a sharp- 
crested wave or a breaking wave, streamlines can separate from 
the wave as described by Pennycuick (2002) and shown in Fig. 3. 
The resulting lee eddy or separation bubble contains closed clock- 
wise streamlines (illustrated very schematically in Fig. 3) and thus 
a region of updraft along the leeward face of the wave due to the 
orbital velocity and upslope wind. Other examples of lee eddies 
are shown by Hsu et al. (1981, Fig. 15), Gent and Taylor, (1977, 
Fig. 2, 3, 7 and 8), Reul et al. (1999, Fig. 1). Vertical velocities of 
1.0 to 1.2 m/s have been measured in lee eddies downwind of 
sharp-crested and breaking waves associated with a free-stream 
wind velocity of 6 m/s (Kawai, 1982; Reul et al., 1999). The updraft 


associated with lee eddies might explain the observations by Idrac 
(1925, 1931) of some albatrosses soaring there as part of low-level 
flight in a Swoop. 

Three main points of this discussion add to previous descrip- 
tions of albatross wave-slope soaring as follows: (1) Lee eddies 
with updrafts can form downwind of a wave crest with or without 
a separated boundary layer (Figs. 3 and 8). (2) An updraft region is 
often located over the leeward slope of a wave and over a lee eddy, 
not just over the windward slope of a wave. The regions of updrafts 
over the leeward and windward wave slopes and the wave crest 
merge and extend upwards above a wave crest (Fig. 8). (3) Starting 
in the lower part of a lee eddy, an albatross can climb upwind over 
a wave crest and descend downwind back into the lee eddy (wind- 
Shear gust soaring) and remain in a region of updraft during the 
whole swoop. Combining both wind-shear soaring and wave-slope 
soaring in this maneuver would maximize total energy gain in a 
swoop. This would be very useful for soaring in low wind speeds. 


5. Relative energy gain from the two soaring techniques 


The gain of energy in wave-slope soaring during a typical swoop 
in a typical updraft was estimated crudely by assuming that an 
albatross spends around half of each swoop in a 1 m/s updraft over 
waves (as discussed above), resulting in an average vertical veloc- 
ity of 0.5 m/s (over a swoop). During the 5s of a half swoop this 
vertical velocity would result in a height gain of 2.5 m, assuming 
horizontal flight through the air. In the same 5s of a half swoop 
the bird could gain 9 m from wind-shear gust soaring as estimated 
above for an increase in wind speed of 5 m/s across the wind-shear 
layer or could gain 20 m for an increase of wind speed of 10 m/s 
across the wind-shear layer. These values indicate that wind- 
shear soaring during a swoop in these typical conditions in the 
Southern Ocean (Table 2) provides around 4-8 times more energy 
than wave-slope soaring or 80-90% of the total. Of course, in zero 
wind 100% of the energy for soaring would have to come from 
wave-slope soaring. 

The energy gained from wave-slope soaring during a swoop 
could be critical to soaring in smaller wind shears. For example, 
if wind shear fell below a certain threshold (AW ~ 3.4 m/s) below 
which energy-neutral soaring could not exist by itself, wave-slope 
soaring could provide the additional energy to make soaring possi- 
ble. In such a situation the bird would need to combine the two 
techniques as observed by Alerstam et al. (1993), Pennycuick 
(1982), and this present study. During the lower part of a swoop 
albatrosses often fly very close to the surface of a wave with a 
wingtip just grazing the water surface. This is interpreted to be 
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how a bird maximizes the effect of wave-slope soaring by flying 
both in the region of maximum updraft to gain energy and in 
ground effect to reduce energy loss from wingtip vortices and 
downwash. 


6. Summary 


The general rules of albatross soaring are: (1) No wind, no 
waves, no soaring; (2) Wave-slope soaring can be accomplished 
in swell without wind; (3) Wind-shear soaring can be accom- 
plished in wind without waves. (4) Wave-slope soaring and 
wind-shear soaring are usually combined when wind and waves 
coexist. In “typical conditions” in the Southern Ocean (Table 2), 
consisting of an increase of wind speed ~5-10m/s across the 
wind-shear layer and an updraft velocity ~1 m/s, wind-shear 
soaring provides around 80-90% of the energy for soaring. 

The albatrosses I observed appeared to use both wind-shear 
and wave-slope soaring techniques. The birds periodically flew 
very close to the ocean surface along the windward face of a wave, 
which is interpreted to be wave-slope soaring. They then turned 
Sharply upwind and pulled up just downwind of another wave, 
climbed above the windward face of that wave, banked steeply 
to turn downwind then descended toward the windward side of 
that wave; this is interpreted to be wind-shear gust soaring. 
Wave-slope soaring might also have been used during the climb 
and descent. Each swoop crossed twice through the wind-shear 
layer, once by climbing into the wind and once by descending 
downwind (or obliquely downwind), so that energy was gained 
on both crossings. The lower across-wind part of the swoop ap- 
peared to coincide with the windward face of a wave, where fast 
airspeeds could lead to large sinking rates. Flying in the updraft re- 
gion of a wave during this part of the swoop provides a gain in en- 
ergy where none is possible from wind shear. Winds interacting 
with waves can generate eddy structures, which can be used by 
an albatross to wave-slope soar over the leeward face and over 
the crest of waves in combination with gust soaring. 

A simple dynamical model based on the Rayleigh (1883) cycle 
of wind-shear soaring and Pennycuick’s (2002) concept of gust 
Soaring Over waves was developed to simulate the observed zigzag 
flight pattern. The Rayleigh cycle indicates that albatrosses can 
Soar in any direction including directly upwind, although across- 
wind and down-wind flight is faster. Simulated travel velocities 
in different directions were shown in a travel velocity polar dia- 
gram (Fig. 6), which is somewhat similar to ones generated for sail- 
boats but a first for albatrosses. Mean travel velocity perpendicular 
to the wind is around 10.2 m/s. Maximum upwind velocity of 
6.2 m/s can be achieved by a bird tacking through angles of 30° rel- 
ative to the wind. The minimum increase of wind speed across the 
thin wind-shear layer necessary for energy-neutral soaring at 
16.0 m/s airspeed was estimated to be around 3.5 m/s for across- 
wind flight with a 10s period (Table 4, Fig. 7). This result agrees 
closely with some detailed numerical simulations of albatross 
flight by Lissaman (personal communication, 2010) (Table 3). In 
large wind shear an albatross modifies its flight pattern in order 
to limit energy gain, airspeed, and aerodynamical force on its 
wings. 

Real albatross flight patterns appear to combine both wind- 
shear gust soaring and wave-slope soaring, which suggests that 
models that exclude waves and wind-wave interactions do not 
simulate real albatross flight. The simple Rayleigh cycle modeled 
here, which includes the effect of waves by simulating gust soaring 
through a detached wind-shear boundary layer, captures the 
essential dynamics of wind-shear soaring and appears to repro- 
duce quite well the observed features of albatross soaring, includ- 
ing realistic travel speeds in all directions, even upwind velocity 
faster than the wind speed at the top of the wind-shear layer. 


The model Rayleigh cycle balances the sudden gain of airspeed (ki- 
netic energy) due to crossing the wind-shear layer in gust soaring 
with the gradual loss of airspeed due to drag during each half loop 
and simply illustrates how an albatross can use wind shear to soar 
in different directions. 


7. Conclusions 


My conclusion about the relative importance of the two soaring 
techniques has evolved since I first watched albatrosses soaring 
over the South Atlantic Ocean and thought they mainly used up- 
drafts over waves. I now believe they mainly use wind-shear in 
gust soaring, except in light winds and in the presence of signifi- 
cant swell. Convincing evidence of the overall importance of the 
wind-shear soaring technique is given by modeling studies of 
wind-shear soaring (Lissaman, 2005; Sachs, 2005) including the 
simple Rayleigh cycle described above, the gust-soaring concept 
developed by Pennycuick (2002), the exceedingly fast RC glider 
speeds obtained by wind-shear gust soaring downwind of ridges, 
and the large relative energy gain from wind shear estimated here 
for a swoop in “typical conditions” in the Southern Ocean. 
Albatrosses appear to combine both soaring techniques in their 
swooping flight, with wave-slope soaring providing additional 
energy gain on the lower across-wind flight path where no energy 
gain is possible from wind-shear soaring and possibly additional 
energy gain in updrafts during the upper part of a swoop over a 
wave crest. 

To further investigate the soaring techniques of albatrosses, it 
would be helpful to instrument them to measure in high resolution 
their positions, orientations, velocities over the ground and 
through the air as well as information about wind and wave fields. 
If these kinds of data were obtained they might also provide new 
information about the interactions of wind and waves and wave 
generation. Models of albatross soaring would be improved by 
including waves and the interactions of wind and waves, which 
could help provide information about gust soaring and the wave- 
slope soaring part of a swoop. Interpreting such model simulations 
could be difficult since even model simulations of albatross soaring 
that exclude waves have been a challenge to interpret. 
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Appendix A. 
A.1. Modeled Rayleigh cycle 


In the modeled Rayleigh cycle (Fig. 4) the loss of energy over a 
half loop (t/2=5s) is given by mg(t/2)V,, where m is mass, g is 
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Table Al 

Summary of flight characteristics of a wandering albatross. 
Flight characteristic t (s) @ (°) 
Straight flight 
Minimum sink rate - 0 
Maximum V/V, - 0 
Circular flight 
Minimum sink at t= 10s 10.0 42.1 
Minimum height loss 9.3 45.0 
Rayleigh cycle loop 
Minimum AW at t=10s 10.0 45.6 
Absolute minimum AW 72 54.7 


V (m/s) V, (m/s) VIV, AW (m/s) 
122 0.66 18.4 - 

16.0 0.76 21.2 - 

14.1 1.04 13.7 - 

14.5 1.11 13.0 - 

16.0 1.14 13.8 3.55 

16.0 1.51 10.6 3.36 


Note that t is the period of a loop (observed t = 10s), @ is the bank angle, V is the airspeed, V; is the sinking speed, V/V, is the glide ratio, and AW is the minimum increase of 
wind speed across the wind-shear layer needed for energy-neutral flight in a Rayleigh cycle loop. Values were calculated using a quadratic drag law, the aerodynamic 
equations of motion (Lissaman, 2005), the maximum glide ratio (21.1) and the associated cruise airspeed (16.0 m/s) of a wandering albatross in straight flight (Pennycuick, 
2008) (see Appendix A). A Rayleigh cycle is based on a constant height loop (zero sink); listed values of V, represent the equivalent sinking speeds due to drag, assuming that 


dV/dt = 0. 


gravity, t is the period of a loop, and V, is the bird’s sinking speed 
through the air due to drag. Conservation of energy for energy- 
neutral soaring requires that this energy loss must equal the sud- 
den gain in kinetic energy (airspeed) from crossing the wind-shear 
layer, which is given by m(V5 — V{)/2, where V; is the airspeed be- 
fore crossing the wind-shear layer, and V2 is the airspeed after 
crossing the layer. In this latter term, V5 — V7 = (V2 — Vi)(V2 + V1). 
V,+V, is assumed to equal twice the average airspeed (2V) in 
the nearly-circular flight, and V2 — V; is the increase of airspeed, 
AV, of a bird crossing the wind-shear layer, which is assumed to 
equal the vertical increase of wind speed (AW) across the layer. 
Conservation of energy and the approximations given above indi- 
cate that 


gt 
AV WV Vs)’ (A1) 
where V/V, is the glide ratio averaged over 5 s of a half loop and over 
AV. 

The decrease in airspeed at constant height during a half loop 
was obtained by balancing the rate of change of airspeed (kinetic 
energy) with dissipation due to drag. This balance indicates that 
dV/dt = g/(V/V-). Since V/V, is virtually constant in the relevant air- 
speed range AV centered on the cruise airspeed of 16.0 m/s (Table 
1), airspeed decreases linearly in time. Therefore, the total decrease 
of airspeed, AV, in a half loop (t/2) is equal to gt/2(V/V,) as derived 
above (Eq. (A1)). 

Values of V/V, for circular flight were calculated using a qua- 
dratic drag law (drag proportional to lift squared), the aerodynamic 
equations of motion (Lissaman, 2005; Torenbeek and Wittenberg, 
2009), the maximum glide ratio (V/V,)max = 21.2, and the associ- 
ated cruise airspeed V.=16.0 m/s of a wandering albatross in 
Straight flight (Pennycuick, 2008). Specifically, values of V/V, were 
calculated using 


2(V/V-,) max 


me (V/V)? + (V/V cos py?’ 


(A2) 


where ¢ is the bank angle. For balanced circular flight, cos@ is given 
by 


1 


COS 0 = SS 
i (2nV/gt)? +1 


(A3) 

The airspeed at minimum sink rate in straight flight was found 
by setting the derivative dV,/dV (Eq. (A2)) equal to zero and solving 
for V, which is given by V=0.760V, = 12.2 m/s. The minimum sink 
rate (V,=0.66 m/s) at this airspeed is achieved with a large lift 
coefficient. The airspeed of minimum sink rate in a balanced circle 


was Calculated using this same lift coefficient and by assuming that 
the added lift required to balance centrifugal force in the turn is 
provided by an increase of airspeed. This results in the airspeed 
of minimum sink rate (often called minimum power) in a 10s cir- 
cle being given by V = 0.760V,/,/cos@ = 14.1 m/s (Tables 1 and A1). 
The minimum sink rate at this airspeed is V, = 1.04 m/s. These val- 
ues are close to those corresponding to the minimum height loss in 
a circle, which occurs at a bank angle @ = 45.0°. Using @ = 45.0° and 
V = 0.760V,/,/cos@, we find that values for minimum height loss in 
a circle are t=9.26s, V=14.5 m/s, and V, = 1.11 m/s (Table A1). 

The minimum AV (and AW) for an energy-neutral Rayleigh loop 
was determined by first calculating a AV using the V/V, at 16.0 m/s 
cruise airspeed in the loop (Table 1) and then calculating average 
values for that range in airspeeds (AV) centered on an airspeed 
of 16.0 m/s, etc. The minimum vertical increase of wind speed 
across the wind-shear layer for a 10s loop was found to be 
AW = 3.55 m/s; this value corresponds to an average V/V, = 13.8, 
average equivalent V,=1.14m/s, and average m=45.6° (Table 
A1). The Rayleigh cycle is based on a constant height loop (zero 
sink); the value for V, given above is the equivalent sink rate that 
would balance drag if dV/dt = 0. 

The minimum airspeed loss AV (and AW) calculated above for 
energy-neutral soaring used the observed 10s loop period. How- 
ever, minimum AV is a function of the loop period, and there is 
an absolute minimum AV, which occurs at the cruise airspeed 
V.= 16.0 m/s and at an optimum loop period top; given by 


TV V2 





=7.25s. (A4) 


Cont — 
Eq. (A4) was derived by setting the derivative d(AV)/dt (Eq. (A1)) 
equal to zero and solving for t. At top, and V. = 16.0 m/s, the glide 
ratio in the loop is just one half of the maximum glide ratio in 
Straight flight, and the equivalent sink rate (1.51 m/s) in the loop 
is equal to twice the sink rate (0.755 m/s) at 16.0 m/s in straight 
flight. The small optimum loop period (7.25s) results in more 
frequent crossings of the shear layer and a larger rate of energy gain 
than that in the 10s loop. 

Using top and V.= 16.0 m/s, the absolute minimum AV (and 
AW) was found to be 


TV .V2 


—— (V/V,) max 


=3.35m/s. (A5) 
Eq. (A5) can be simplified to AV= 7V,,/2, where V; is the sinking 
speed (0.755 m/s) corresponding to (V/V,)max. Values for the abso- 
lute minimum AV in a Rayleigh cycle loop were calculated as aver- 
ages within the AV range in airspeeds (3.35 m/s) centered on 
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16.0 m/s. The absolute minimum AV (and AW) = 3.36 m/s; this 
value corresponds to an average V/V,= 10.6, average equivalent 
V,= 1.51 m/s, and average ~ = 54.7° (Table A1). 
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Abstract 


In the domain of UAVs, endurance and range are key utility factors. However, 
small-sized UAVs are faced with serious limitations regarding energy storage options. A 
way to address this challenge is to seek for energy from the surrounding environment. 
One flight technique, called dynamic soaring, has been perfected by large seabirds like 
the albatross, which enables them to wander effortlessly in southern oceans. This thesis 
investigates the feasibility to find inspiration from the biological world in order to 


address the issue of limited endurance. 


First of all, an extensive literature background sums-up a range of technical 
aspects that can be learnt out of the flight of albatrosses. It reviews their morphology, 
flight performance and sensitivity to wind strength, their flight characteristics and 


energy expenditure management. 


Then, a methodology to simulate dynamic soaring flight is built-up by focusing 
first on adequate models for the vehicle and for the environment. It details the way 
those models are described quantitatively and qualitatively. As for the vehicle, a point 
mass model is chosen and applied to fixed-wing gliders of several scales, as well as to an 
albatross of generic dimensions. The environment is first modelled by classical 
boundary layer theory on a rather flat surface and then refined by taking into account 
specificity about the ocean boundary layer, such as varying roughness length and 


surface waves. 


Equations of motion are detailed for both points of views, earth-relative and air 
relative. This yields two different sets of equations of motion, eventually representing 
equivalent physics. An optimization problem is then set in order to determine, for the 
vehicle, how to extract energy from its environment. Variations in objective function 
and in constraints are described before presenting the numerical integration scheme 
which converts the optimization problem into that of finite-dimension. The solving 
tools and their specificity are presented, followed by a validation of the overall 


methodology with a particular study case from the literature. 


Basic principles of dynamic soaring flight are explicated by using a specific 
closed-loop study case. Energy-harvesting mechanisms are disclosed locally and next 


integrated over the whole flight path. A further illustration of dynamic soaring is 


provided by relaxing some periodicity constraints and opening the trajectory. The 
specificity of the ocean boundary layer environment is finally implemented and a 


refined energy-harvesting strategy is presented. 


Air relative equations of motion are dimensionless so as to highlight specific 
dynamic soaring behaviours, in the case of a simplified linear wind profile and 
eventually by finding an appropriate non-dimensionalization for a logarithmic wind 
profile. Conditions of similarities between dimensionless solutions are described and 


some basic DS characteristics are outlined. 


Finally, various dynamic soaring performance study case are computed. 
Optimized trajectories are implemented for the selected vehicles and compared on a 
required wind strength basis. The sensitivity of the required wind strength to the net 
flight heading as well as to the ground clearance and to the surface roughness length is 
determined by drawing performance charts. In order to enlarge the scope of favourable 
dynamic soaring conditions, thrust-augmented trajectories are considered. The range 
improvements offered by dynamic soaring are compared to the straight line case, for 


different wind strength and different net flight headings. 
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Résumé de la thése en francais 


Dans le domaine des drones, l'endurance et l’autonomie sont reconnues comme 
étant des facteurs d'utilité clés. Ces véhicules sans pilotes permettent d’élargir le spectre 
des missions accessibles aux aéronefs, notamment, pour certaines applications, en 
réduisant leur taille. Cependant, les drones de faibles dimensions sont confrontés a des 
limitations importantes concernant les options de stockage d'énergie. Des solutions 
alternatives de propulsion ont été approchées dans la littérature, comme le recours aux 
piles 4 combustible qui permet de tirer parti de l’importante densité énergétique du 
dihydrogéne. Ou encore la modification de la chaine énergétique a bord, en déportant la 
source au sol et en transmettant au véhicule l’énergie via un laser. Chacune de ces 
options a ses limites a lheure actuelle. Une autre fagon de répondre a cet enjeu est 
d’extraire de l'énergie a l'environnement ambiant au vehicule. Cela peut se décliner sous 
plusieurs formes et des solutions ont été abordées par le passé, comme le vol de pente, 
le vol de thermique ou encore l’exploitation photovoltaique de l’énergie solaire. Ce sujet 
se rallie a la méme démarche générale d’exploitation de |’énergie ambiante, mais se base 
sur des principes d’extraction différents. Par ailleurs, ces problématiques s’inscrivent 
dans un contexte de bio-mimétisme, ou la nature est considérée comme une source 
d’inspiration, d’autant plus alors que les drones peuvent atteindre des dimensions 


comparables a celles des oiseaux. 


Une technique de vol a été perfectionnée par de grands oiseaux de mer comme 
l'albatros, ce qui leur permet de parcourir les océans de |’hémispheére sud, sur des 
milliers de kilométres, sans effort. En effet, les travaux de biologistes ont permis 
d’identifier certaines caractéristiques du vol de l’albatros. Parmi les plus significatives 
figurent leur vol non battu et les distances considérables qu ‘ils parcourent, ce qui sous- 
entend une gestion énergétique intéressante. Afin de bénéficier d'une source 
d’informations aussi large que possible concernant le vol de l’albatros, une vaste revue 
bibliographique est faite des contributions biologistes offrant un éclairage sur les 
aspects physiques de l’albatros et de son vol. A ce titre, cette bibliographie s’attache a se 
distancer des aspects comportementaux, ou du moins a les distinguer, pour se 


concentrer sur les informations physiques du vol. 


Ainsi, on peut apprendre que des spécimens équipés de balises GPS ont 
parcouru des distances de voyages considérables, allant méme jusqu’a des 


circonvolutions dans les latitudes sud. Ces performances sont associées a des vitesses de 


lil 


voyage pouvant dépasser 900 kilometres par jour. Cette vaste couverture des mers 
s explique, pour l’aspect comportemental, par un besoin de l’oiseau de maximiser la 
probabilité de rencontrer des proies qui ont une distribution trés sporadique en surface. 
Ces performances soulévent d’autant plus d’interrogations que la morphologie des 
albatros démontre une incapacité a battre durablement des ailes. En outre, toutes les 
observations font de fait état d'un vol strictement plané, pour lequel l’albatros est en 
revanche particuliérement adapté. En effet, ses ailes ont un grand allongement et 
peuvent étre maintenue déployées sans que loiseau force, par l’intermédiaire d’un 
tendon. Des la fin du XXeme siecle, il a été conjecturé que ces oiseaux tiraient parti du 
vent. De fait, la répartition de leurs colonies sur les iles isolées des mers du sud 
correspond effectivement aux zones qui voient les vents les plus fréquents et les plus 
puissants. Cette dépendance au vent est méme marquée par des oiseaux bloqués sur 
l'eau lorsque le vent vient a tomber, alors qu’ils passent la majorité de leur temps en 
lair. D’autres informations sont utiles pour enrichir la compréhension de ce rapport au 
vent. Leur direction de vol par rapport au vent est orientée dans le méme sens que le 
vent, mais avec une composante de travers fortement prononcée, de telle sorte que 
l’oiseau vole aux alentours de 45° par rapport au sens du vent. Enfin, a plus faible 
échelle, leur vol fait apparaitre des trajectoires sinueuses et cycliques, avec des virages 
successifs en montée et en descente, entre la surface et des altitudes aux alentours de 
15-20 metres maximum. II apparait que les albatros tirent profit des gradients de vents 


qui se forment a la surface, dues aux interactions de couches limites atmosphériques. 


L’ensemble de ces considérations, associées au fait qu'un drone de faibles 
dimensions peut, a priori, reproduire les dimensions et performances planés de 
lalbatros, font de cette technique un sujet de recherche prometteur dans le domaine des 
drones. En effet, la source d’énergie requiert uniquement la présence de vent et est 
uniformément présente sur de vastes zones géographiques, au contraire des thermiques 
par exemples, qui sont limitées a des zones réduites et clairsemées. La technique de vol 
démontrée par les albatros est quelquefois appelée vol de gradient en francais, mais on 
lui préferera l’expression anglo-saxonne dynamic soaring (DS), plus couramment usitée. 
Elle traduit en effet la dynamique nécessaire a l’extraction d’énergie, la ot d'autres 
techniques, comme le vol de thermique ou de pente, voit l’extraction se faire sous forme 
constante, via un régime de vol statique. Lors du vol par DS, au contraire, les échanges 
d’énergie sont périodiques, avec des gains et des pertes, au travers de manoeuvres 
dynamiques de vol. La question principale de ce travail de recherche consiste a savoir 
s il est faisable, et dans quelle mesure, d’exploiter le DS afin de répondre aux enjeux de 


vol longue distance pour des drones. Ce travail de recherche a une dimension 
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clairement multidisciplinaire et on conceptualisera trois domaines de recherche afin de 
structurer notre raisonnement. Il s agit du véhicule, de l’environnement et des 
trajectoires. Différents objectifs de recherche sont déterminés, relatifs 4a chacun de ces 


concepts et a la facon dont ils s’influencent. 


Afin de circonscrire le sujet et ses développements, une bibliographie de la 
littérature est faite, parcourant les différentes approches d’extraction d’énergie au vent 
qui ont pu étre étudiées. Cela comprend le gust soaring, ou vol de rafale, qui exploite les 
variations stochastiques des composantes verticales de la masse d’air, au sein de la 
couche limite atmosphérique turbulente. Les échelles de temps considérées sont faibles, 
comparés au DS, qui exploite la répétabilité des conditions rencontrées au cours de 
cycles identiques de trajectoires. Le dynamic soaring nécessite la présence de gradient 
de vent ainsi que leur connaissance a priori et peut donc étre envisagée sous différentes 
formes. La revue bibliographique parcoure ainsi les études du DS qui ont été faites non 
seulement sur une surface pseudo-plane, mais aussi en aval d'une colline, dans le 
courant d/altitude jet stream ou encore au sein dune tempéte avec un champ 
rotationnel. Cette revue permet de fixer les bases de notre étude et de confirmer ses 
objectifs, en révélant la nécessité de réunir les différents points de vue existants, 


notamment concernant les principes d’extraction d énergie. 


Notre étude s'est tout d’abord attachée a modéliser environnement et le 
véhicule, avant de faire interagir les deux. Le chapitre III expose cette approche, en 
détaillant tout d’abord le choix des modeles physiques représentatifs de 
l'environnement et du véhicule avant de s’attarder sur leurs quantifications. Concernant 
le véhicule, un modeéle de masse ponctuelle est choisi, qui concentre tous les efforts au 
centre de gravité. La simplicité de ce modele est attractive pour l’application qui est 
recherchée et se justifie par le fait que l’échelle des gradients de vent permet d’utiliser 
une masse ponctuelle pour représenter la dynamique du vol. En revanche, certaines 
contraintes opérationnelles, liées a l’évolution du véhicule, comme la limitation du taux 
de roulis, doivent alors étre déterminées intuitivement car leur évolution n’est régie par 
aucune équation. Une fois ce choix fait, le premier véhicule considéré reprend la 
morphologie de Valbatros, établie par les biologistes. Puis, afin de s’affranchir des 
disparités liées aux dimorphismes entre spécimens et aux incertitudes des mesures 
anatomiques, trois plateformes a voilure fixe, basées sur des architectures de moto- 
planeurs, sont considérées. Elles correspondent a trois échelles de véhicule différentes, 
choisies pour représenter le compromis entre taille et performances de vol plané et 
permettre ainsi d’explorer la faisabilité du vol par dynamic soaring pour différentes 


dimensions de drone. 


En ce qui concerne l'environnement, dans un premier temps, une surface 
pseudo-plane est considérée et un profil logarithmique, représentatif d’un profil de 
vitesse moyen de couche limite atmosphérique, est choisi. En raison des irrégularités de 
surface, la loi logarithmique fait intervenir la longueur de rugosité, qui est en directe 
correspondance avec l'état de surface considéré. Elle fait aussi intervenir, plus 
classiquement, la vitesse de friction qui est une représentation directe de la contrainte 
pariétale. Ainsi, ces deux variables permettent de définir un profil de vitesse moyen, qui 
prend statistiquement en compte les irrégularités de surface. Or il apparait que les 
longueurs de rugosité utilisées dans la littérature sur le DS sont d’un ordre de grandeur 
trop important pour correspondre a celles rencontrées en mer. Une correction est 
proposée avec la prise en compte d’un modele de rugosité qui lie directement la 
longueur de rugosité au vent rencontré, suggérant ainsi une corrélation implicite entre 
contrainte pariétale et état de surface. Le profil de vitesse n’est alors plus figé mais 
dépendant des conditions environnementales. En outre, il apparait dans la littérature 
que les vagues jouent probablement un réle dans la technique de vol des albatros. Si 
interaction vent-vague est un tres vaste sujet, il a été décidé de se concentrer sur 
linfluence d’un vague sinusoidale, corrélée aux conditions de vent via une 
méthodologie simple, reprenant des mesures faites en mer. Cette vague, peu creusée, est 
représentative d’une houle et engendre des déplacements d’air a son voisinage, due au 
mouvement ondulatoire de la vague. Ainsi, la face montante de la vague voit 
l’apparition d’une composante verticale montante de la vitesse air a son voisinage, alors 


qu elle est pourtant sous le vent. 


Une fois que les modéles environnement et véhicule sont déterminés, les 
trajectoires permettant d’exploiter le DS doivent étre établies. L’unique facon de 
prouver la faisabilité de cette technique de vol est en effet de montrer l’existence d’une 
trajectoire de vol y correspondant. A cet effet, il est dans un premier lieu nécessaire de 
mettre en place les équations qui régissent l’évolution de véhicule au sein de son 
environnement. Le point de vue choisi pour l’observation de cette évolution est 
primordial pour l’écriture des équations. Deux options sont possibles, se baser dans un 
repere galiléen lié a la terre ou alors se déplacer avec la masse d’air que le véhicule 
traverse. Les échanges d’énergie entre véhicule et masse dair ont aussi des 
interprétations différentes suivant le référentiel, dans la mesure ow la définition méme 
de l’énergie dépend du point de vue. Le point de vue terrestre est choisi et les six 
€quations de la dynamique, gouvernant l’évolution de six variables d’état liées au 
vehicule, sont écrites. Celles-ci dépendent de |’état méme de ces variables ainsi que des 


variables de contrdéle, en l’occurrence le coefficient de portance et langle de roulis. Si 
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ces parametres ne sont en réalité pas contrélables directement lors du pilotage réaliste 
du véhicule, ce sont les variables de plus bas niveau disponibles avec le modeéle de masse 
° ° 37 ° ° 9 oe 
ponctuelle. Pour garantir toutefois que l’évolution de ces variables n’est pas fantaisiste, 
il est nécessaire de borner leurs taux d’évolution ainsi que leurs dérivées premiere et 
seconde, ceci afin d’obtenir des fonctions doublement continue, physiquement 


cohérentes. 


La singularité du probleme vient du fait de la dépendance aux variables de 
controle, qui pilotent la trajectoire et doivent étre, d'une certaine maniere, guidées dans 
leur évolution afin de trouver la trajectoire permettant d’exploiter le dynamic soaring. 
La méthodologie employée consiste a trouver la meilleure trajectoire, en mettant en 
place et en résolvant un probleme d’optimisation. La formulation générale du probleme 
est assez générique, avec une fonction exprimant l’objectif et des contraintes associées. 
L’objectif est de minimiser la vitesse du vent nécessaire a |’établissement d'une 
trajectoire énergétiquement neutre, c’est-a-dire qui voit les pertes du véhicules 
compensées exactement pas ses gains. Un critére important est la périodicité de la 
trajectoire. En effet, il est aisé pour un véhicule d’extraire durant une certaine période 
de l’énergie au vent. En revanche, rendre ce processus reproductible est le défi du vol 
par DS. Les contraintes du probleme d’optimisation incluent ainsi une périodicité des 
variables, a l'exception, pour certains cas, ou un déplacement horizontal est permis 
entre situation initiale et finale. Il est a noter que la reproductibilité des cycles de 


trajectoires se base ainsi sur l’hypothése d’une homogénéité de la loi de vitesse de vent. 


Une étape plus complexe est la conversion de ce probleme purement 
mathématique en un probleme de dimension finie, qui peut étre ensuite abordé par des 
techniques numériques d optimisation. La premiére tache consiste a diviser l’intervalle 
de temps en sous-intervalles et en exprimant les équations de la dynamique sous forme 
de contraintes, grace a un schéma d intégration numérique sur chaque intervalle. Sans 
rentrer dans les détails, la fonction dérivée, pour chaque équation, est interpolée, sur 
chaque intervalle, par un polynéme quadratique en trois points de collocation, tandis 
que les variables d’états sont interpolés par un polynéme de Hermite. De cette maniére 
lintégration des équations est implicitement effectuée au cours de |l’optimisation. 
D’autres contraintes opérationnelles sont ajoutées, en plus de celles de périodicité, 
comme par exemple le non-décrochage ou la garde au sol. Le probleme engendré est un 
probleme d’optimisation contrainte non linéaire, que l’on résout avec le solveur SNOPT. 
Un cas de validation est effectué en reproduisant des conditions déja étudiées dans la 


littérature et la correspondance entre les deux valide la méthodologie. 
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Une fois la méthodologie établie, le chapitre V s’attache a l’interprétation des 
résultats, avec un accent particulier porté sur la compréhension des mécanismes 
d’extraction d’énergie. Les phases caractéristiques des trajectoires par DS sont décrites 
en se basant sur un cas de trajectoires fermées, pour laquelle le véhicule revient 
exactement a sa position de départ, avec la méme énergie. Cela démontre au passage la 
faisabilité de se maintenir en position, virtuellement indéfiniment, indépendamment de 
la direction du vent. La trajectoire n’est alors pas une boucle simple mais constituée de 
deux boucles internes, enchainées pour former l’aspect d’un 8. Quatre phases de vol 
forment invariablement les trajectoires par dynamic soaring : une montée face au vent, 
un virage dans le vent en haut de boucle, une descente vent arriére et un virage contre 
le vent proche de la surface. Si ces quatre phases se retrouvent invariablement, leurs 
amplitudes dépendent de la nature de la trajectoire (ouverte ou fermée) et du cap net de 
voyage souhaité. On montre que l’extraction d’énergie se fait lors de la montée face au 
vent, du virage haut et de la descente et que le virage contre le vent concentre les 
principales pertes. Il est montré que ces échanges d’énergie sont les résultats du travail 
des forces non conservatives qui s appliquent sur le systéme, en l’occurrence les forces 
aérodynamiques. Des graphiques présentant les triangles des vitesses et projections des 
forces permettent de montrer comment la portance, orientée par la _ vitesse 
aérodynamique, est déviée dans le sens du mouvement lors des trois phases citées et de 
maniére opposée lors virage bas. En outre, l’expression de la puissance des forces se 
développe et fait apparaitre que la trainée peut, elle aussi, avoir une composante qui 
joue un réle moteur, tandis que la seule composante invariablement négative provient 
de la composante de la trainée opposée a la vitesse air. La propension du veéhicule a 
exposer sa face inferieure orthogonalement a la direction du vent lorsque l’altitude 
maximale est atteinte est similaire a ce qui est observé chez les albatros et correspond a 
la configuration qui maximise la puissance extraite au vent. Il est montré comment le 
gradient de vent permet au véhicule de ne pas perdre trop d’énergie lors du virage bas, 
mais qu’en aucun cas [énergie est extraite du gradient de vent. A ce titre, le virage 
contre le vent est une phase particuli¢érement critique pour le vol par DS, dans la 
mesure ou un changement mineur d’altitude a un impact direct sur la vitesse du vent 
rencontrée lors du virage, en raison du fort gradient présent a faible hauteur. La 
contrainte de garde au sol minimum représente des conditions favorables au vol par DS 


lorsqu elle autorise des faibles séparations entre surface et véhicule. 


Apres s étre intéressés au fondamentaux du vol par dynamic soaring, un cas 
d’étude simulant le vol de l’albatros de la maniére la plus réaliste possible est mis en 


place. Dans un premier temps, l’impact d’une longueur de rugosité est évalué. Il est 
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montré que la prise en compte d’une longueur de rugosité représentative des conditions 
océaniques engendre une augmentation de la vitesse du vent nécessaire pour voler par 
DS (Il est montré plus tard que la vitesse de friction n’est en revanche pas impacteée). 
Cela suggeére que les albatros, dont le vol est observé a des vitesses de vent plus faibles, 
utilisent des mécanismes de vol non pris en compte par la simulation sur une surface 
pseudo-plane. Celle-ci est donc enrichie par la présence de vagues, et il est montré que 
le véhicule se déporte du cété de la face montante de la vague pour y effectuer son 
virage bas, la ot: la vague produit une composante verticale montante. II est ainsi établi 
que le véhicule peut bénéficier de la présence de vagues, afin d’abaisser la vitesse de 
vent requise au vol par DS. Toutefois, des écarts significatifs persistent entre la vitesse 
de vent théorique obtenue et celles relevées lors des vols des albatros. Cela souligne des 
raffinements utilisés par les albatros que la simulation proposée ne couvre pas. On peut 
conjecturer sur l’exploitation des rafales au sein d’un environnement en vent non pas 
stationnaire mais turbulent, l’exploitation de vagues plus formées du spectre qui font 
apparaitre des phénomenes de nature différente, comme le décollement de la couche 
limite, ainsi qu'une rugosité de surface plus prononcée que dans notre modele, 


correspondant a une mer plus jeune. 


La these, sintéresse, au cours du chapitre VI, a une approche plus 
mathématique du DS, afin de comprendre comment les variables du probleme peuvent 
étre liées entre elles. En effet, un inconvenient des simulations faites dans le chapitre 
précédent est qu elles sont propres a un véhicule, a des contraintes opérationnelles 
précises et a des conditions de vent particuliéres. Afin de rechercher comment les 
variables liées a ces paramétres interagissent entre elles, indépendamment des 
métriques associ€es, une analyse adimensionnelle des équations est effectuée. Les 
équations de la dynamique, écrites dans le référentiel lié a la masse d’air, se prétent bien 
a cette approche en limitant les angles de projection. Elles sont donc normalisées, via un 
choix de variables de références pour chaque unité. Dans un premier temps, un profil de 
vent linéaire est choisi (avec un gradient constant) et l’adimension se fait aisément, en 
faisant apparaitre un parametre de similitude qui implique notamment la charge alaire 
et le gradient de vent. Ainsi |’ étude fait apparaitre, en théorie, des familles de problémes 
dont les solutions adimensionnées sont identiques entre elles. Il est donc possible, a 
partir d’une solution, d’extrapoler celles qui ont le méme paramétre de similitude. Cela 
est vérifié via des simulations, validant au passage la méthodologie. En revanche, une 
limitation du profil de vent linéaire est qu’il ne fait pas apparaitre l’augmentation du 
gradient de vitesse en proche surface, qui semble pourtant primordial dans le vol par 


DS, au vu des observations faites des albatros et des simulations établies plus tot. Ainsi, 
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cette méme approche est reprise pour un profil de vitesse qui suit la loi logarithmique 
définie précédemment. Une normalisation est trouvée et validée par des simulations, 
mais les solutions identiques entres elles impliquent des contraintes particuliéres entres 
altitude minimale et la vitesse de friction, ce qui, en pratique, semble peu utile. En 
revanche, cette approche illustre quelques résultats intéressants, notamment en faisant 
apparaitre la charge alaire comme parametre de similitude et en montrant que la vitesse 


de friction nécessaire ne varie pas quand la longueur de rugosité change. 


Enfin, une fois quelques propriétés remarquables fournies par [analyse 
adimensionnelle, la these s’attache, lors du dernier chapitre, a simuler les variations de 
performances qui peuvent étre obtenues lors du vol par DS, en changeant le véhicule, 
les conditions opérationnelles ou encore les conditions environnementales. Ainsi, les 
trois véhicules sont testés, et il apparait que le compromis entre faible envergure et 
performances de plané tourne a l’avantage de ces derniéres. En effet, le véhicule de plus 
grande envergure parvient a voler par DS avec le minimum de vent requis. Il apparait 
ainsi que le DS s’adapte moins bien, dans les conditions simulées, aux véhicules de 
faibles dimensions, mais que les perspectives pour des drones de plus grandes tailles 
sont intéressantes. Ensuite, pour un véhicule donné, les conditions opérationnelles et de 
surface sont changeéees. Il apparait qu'une forte rugosité permet de réduire la vitesse du 
vent nécessaire (méme si la vitesse de friction est constante) et qu’elle permet aussi de 
maximiser l’étendue des angles de voyage accessible par DS. D’autre part, une faible 
garde au sol a aussi les deux avantages cités précédemment. Enfin, dans la mesure ot du 
vol purement plané exige des conditions particuliéres en vent, le vol motorisé est 
approche pour complémenter le DS lorsque le vent est trop faible pour permettre du vol 
purement plane. L’autonomie permise alors par le DS est alors étudiée en fonction du 
cap net de voyage choisi par rapport au vent. Puis, le vent est progressivement diminué. 
Il apparait que le DS permet d’améliorer significativement l’autonomie du véhicule en 
conditions de vent, mais qu’au fur et 4 mesure que le vent diminue, l’avantage procuré 


par le DS faiblit et se rétrécit a une zone de direction de vol de plus en plus faible. 


Ainsi il apparait que le DS, tel que simulé sur une plaque rugueuse pseudo- 
plane, permet au véhicule de se maintenir en l’air et de se déplacer gratuitement. La 
diminution du vent proche de la surface dispense la perte de trop d’énergie lors du 
virage contre le vent et permet ainsi de limiter la puissance du vent nécessaire au vol 
par DS. En revanche, il semble peu probable que les albatros parviennent a leurs 
performances en se contentant de ces seuls phénomenes. En effet, méme en ajoutant un 
modele de vague aux simulations, les vents théoriques obtenus sont encore trop élevés 


pour correspondre aux conditions mesurées en mer. Si les albatros tirent aussi parti 


d’interactions de surface plus complexes que celles modélisées, il apparait difficile pour 
un drone de piloter son vol en proche surface, au sein d'un environnement hautement 
turbulent. En revanche, une option peut étre d’exploiter justement ces turbulences, a la 
maniére du gust soaring, le long des trajectoires de dynamic soaring. Les simulations 
destinées a reproduire ces conditions devront probablement approcher un modele de 
véhicule plus complexe que celui de masse ponctuelle, au vu de la faible échelle des 
gradients de vent turbulents. D’autre part, une voie de développement nécessaire 
concerne le pilotage du dynamic soaring. Deux approches peuvent étre suivies, en 
mémorisant des dizaines de cycles de trajectoires, pour différentes conditions, puis en 
sélectionnant celles qui correspondent aux conditions souhaitées. Ou alors en mettant 
en place des lois de contréle qui se basent sur la compréhension physique du dynamic 
soaring. Quel que soit le choix du mode de pilotage, la faisabilité du vol par DS reposera 


sur la capacité du véhicule a voler en proche surface, en présence de vent. 
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Souvent, pour s'amuser, les hommes d'équipage 
Prennent des albatros, vastes oiseaux des mers, 
Qui suivent, indolents compagnons de voyage, 
Le navire glissant sur les gouffres amers. 


A peine les ont-ils déposés sur les planches, 

Que ces rois de l'azur, maladroits et honteux, 
Laissent piteusement leurs grandes ailes blanches 
Comme des avirons trainer a cété d'eux. 


Ce voyageur ailé, comme il est gauche et veule ! 
Lui, naguere si beau, qu'il est comique et laid ! 
L'un agace son bec avec un brile-gueule, 
L'autre mime, en boitant, l'infirme qui volait ! 


Le Poéte est semblable au prince des nuées 
Qui hante la tempéte et se rit de l'archer ; 
Exilé sur le sol au milieu des huées, 


Ses ailes de géant l'empéchent de marcher. 


Charles Baudelaire, L'Albatros, in Les Fleurs du Mal, 1861 
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(CHAPTER [| 


FINDING (,UIDANCE FROM ALBATROSS FLIGHT 


For many years, in the early infancy of aeronautics, human beings have been inspired by birds 
and have tried to mimic some of their features in order to take their first steps into the air. 
However, their theoretical background was sometimes lacking and the empirical field of 
knowledge was built through many hurdles. Since then, aircraft have explored a vast domain of 
flight, pushing boundaries of size, altitude and speed to levels respectively bigger, higher and 
faster than any living bird. Yet, scientific advances in the aeronautical field have not turned 
engineers away from other potential guidance that could be found by looking at nature [1]. In 
the recent-developing field of small Unmanned Aerial Vehicles (UAV), vehicles have been 
reduced in size to dimensions where birds and even insects compare on a 1:1 scale ratio, 
opening new perspectives for the potential of bio-mimicry. Festo’s SmartBird was among the 
first vehicles of that size to master flapping flight with kinematics inspired from birds and a 
design based on the herring gull [2]. Aerovironment’s Nano Hummingbird mimics the high- 
frequency flapping of the bird it is inspired from so as to enable hovering flight [3]. Georgia 
Institute of Technology's Dragonfly is based on the eponymous insect with similar dimensions 
[4]. A general introduction to the area of small UAVs will first be presented, which will lead to 
developments on the issue of long endurance flight, and then some of the explored solutions 
will be presented. An alternative that addresses this issue will be emphasized by shedding light 
on a peculiar biological system represented by the albatross genus. Finally the main features of 
their flight technique, called dynamic soaring, will be provided. Dynamic soaring will be 


referred to as DS in the remaining of this thesis 
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I.1 The UAV Context 


UAVs date from the early hours of aviation and have been developed alongside their manned 
siblings in order to address three specific aspects of flight: “dirty”, “dull” and “dangerous”. By 
replacing the human being on board, UAVs can be used in “dirty” environments, such as 
contaminated areas or high altitude, where preserving human health is a challenge with 
significant associated engineering constraints. Or it can be used in “dangerous” environments, 
such as military conflicts, where the integrity of the vehicle and the life of a potential pilot are 
deliberately put at risk to perform a mission. On top of that, “dull” missions, such as 
surveillance and monitoring, where the presence of a human being is not required, can be 
performed by UAVs. Often, a UAV is developed to combine several of those aspects. For 
instance, the Northrop X-47B, which recently achieved the first autonomous landing on an 
aircraft carrier [5], is a large UAV of several tons, with a fighter-type profile. It prefigures a 
generation of UAVs that will be exposed to obvious dangerous environments while taking 
advantage of the absence of a human being on board to achieve higher levels of stealth, 
aerodynamics and situation awareness. Another example is the development of high-altitude 
solar-powered Titan UAVs [6], which will stay aloft for up to five years to provide 
communication coverage and hence perform a satellite-like mission anywhere around the 
globe, whilst having the flexibility and cost of an aircraft. Not only is a human presence not 
needed for the job, but it would jeopardize any chance of mission feasibility by significantly 
increasing the mass of the ultra-lightweight airframe and by requiring regular stops to avoid 
pilot exhaustion. To that respect, it is somehow ironic that for the challenge set by Solar 
Impulse [7] of flying around the world on solar power, many technical hurdles and performance 
limitations are entailed by the presence of a human on board, which is more symbolic than 


necessary. 


The domain of small-sized UAVs, which are designed for much lower loads than 
manned-airframes, has seen a significant boost in recent years thanks to the miniaturization of 
technology, the development of lightweight materials and the progressive development of 
artificial intelligence. Once the pilot is removed, all size constraints related to the presence of a 
human being on board can be withdrawn and UAVs can be downsized to a scale which best 
suits their requirements. They address a potentially vast scope of mission scenarii. Advantages 
are numerous, such as providing a constrained volume of flight for indoor applications, 
reducing overall costs, or lowering the environmental impact. However, in the domain of 


small-sized UAVs, the limited mass and volume significantly challenge energy storage options. 
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Besides, those vehicles usually fly at rather low Reynolds numbers which are characteristic of a 
degraded aerodynamic efficiency. This leads to significant endurance limitations while, in fact, 
long-endurance is acknowledged to be a key factor of UAV utility [8]. In other words, without 
any pilot on board, endurance is only limited by the amount of available energy, which is part 


of the appeal for UAVs, but small vehicles are still faced with low endurance constraints. 


Innovative approaches are therefore sought in order to address that conundrum. 





Figure 1.1: Jon Tiger, a fuel-cell-powered, 5.15 metres- Figure 1.2: Photovoltaic panels installed as a 
wingspan, which sustained flight for 48 hours in April laser receiver unit under the right wing of 
2013 [9] using liquid hydrogen. the 3 metres-wingspan Lockheed Stalker. 


One plan of attack concerns the on board electric energy supply chain and looks at 
new technical solutions as for energy storage that departs from the traditional battery power 
source. Research on fuel-cell propulsion for long endurance UAVs is particularly active since 
their high energy density has the potential to dramatically extend endurance [9]. Fuel-cells take 
advantage of the high specific energy of hydrogen fuel, which can be stored under high 
pressure in a specific vessel or in its liquid form to benefit from an even higher energy per unit 
volume [10]. Recently, the Jon Tiger, pictured in Fig. I.1, performed a flight of 48 hours using 
only 500 grams of liquefied hydrogen to feed its fuel-cell [11]. However, although fuel-cells 
provide high specific energy together with high power, their size and volume can only be 
constrained to a limited level. On Jon Tiger, the fuel-cell system only (fuel cell, fuel tank, 
regulator, cooling) weights 5.5 kg without either fuel or propulsion system [9]. Moreover, the 
need for heat transfer area increases the size of the fuselage, leading to increasing parasitic 
drag. In order to achieve a maximum lift to drag ratio of 17, Ion Tiger has a wingspan of up to 
5.15 metres, for a practical payload of 2.3 kg and an overall mass of 16 kg. Further advances are 


required to benefit from the high efficiency of fuel-cell propulsion on smaller scale vehicles. 


Another approach to cope with energy storage constraints is to leave the power 


supply on the ground and to establish a transmitting power link between the ground and the 
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station. That has been achieved recently with the Lockheed Stalker, seen in Fig. 1.2, by using a 
laser as an illuminating power source beamed to photovoltaic cells on the UAV [12]. As long as 
the vehicle remains in line of sight of the ground platform, it can benefit from a charge in mid- 
air to cope with power requirement in-flight [13]. On top of enabling a virtually unlimited 
endurance, the high energy density of the laser can enlarge the performance envelope of the 
vehicle, such as increased payload or higher airspeed, by allowing higher power requirements. 
It has been underlined that this technology could multiply the power received by an otherwise 
similar solar-powered UAV by a factor ranging from 2 to 10, depending on the technology used 
for laser and photovoltaic cells, with more consistent charge availability [14]. However, the 
efficiency of the power transmission is rather low and suffers from dependence on cloud 
coverage. The theoretical beamed-power is also limited by the heating of the illuminated 
surface of the receiver on the UAV, due to the high energy density of the laser. Besides, the 
ground station must be equipped with accurate fully-automated tracking to target only the 
laser receiver area of the UAV. Moreover, the laser-powered UAV is limited to areas where it is 
at range of a ground station, which itself must have abundant power resources, even though 
the transmission can be intermittent. Hence, the virtually unlimited endurance does not 
translate much into improved vehicle range but rather enables to keep the UAV aloft without 


the need for landing and take-off. 


One different option for improving endurance is to seek energy from the surrounding 
environment. Studies about long endurance vehicles have mainly focused on using solar power 
to maintain a vehicle aloft. Prototypes began in the seventies with flights durations under 30 
minutes [15]. With developments in lightweight materials and of photovoltaic cells, vehicles 
have started to benefit from solar power in excess of their minimum requirements, therefore 
unlocking the potential to overcome periods of sun shortage of increasing lengths. This trend 
finally opened the way for solar-powered flights all around the clock. The shortage of solar 
exposition during the night is balanced by the excess of power received throughout daylight 
that is stored into potential energy and battery charge [16]. The Qinetiq Zephyr, a 30 kg, 18- 
metres-wingspan-solar-powered UAV, displayed in Fig. I.3, paved the way by performing a 
two-week-long flight in 2010, setting the absolute record for flight duration by a winged vehicle 
heavier than air [17]. Up-scaled ongoing projects, such as Titan Solara or Boeing Solar Eagle aim 
at demonstrating and exploiting solar-powered perpetual flights, with the latest claiming a five- 
year-duration objective. However, those projects are among the largest UAVs and cannot be 
quite representative of the small UAV class. Indeed, those vehicles have low wing loadings and 


are designed for altitudes above 50,000 ft. Smaller UAVs, destined to fly in the lower 
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atmosphere, do not abide by the same constraints. Still, Aerovironment derived a solar-powered 
version of its Puma UAV, see Fig. 1.4, which sustained flight for 9 hours in August 2013 [18]. 
This range of performance, for such small-typed UAV, is however highly dependent on the 


daily sun coverage. 





Figure 1.3: Qinetig Zephyr, a solar-powered, 18 Figure 1.4: Aerovironment Puma AE, a solar- 
metres-wingspan-UAV, which sustained flight for | powered, 2.8-metres-wingspan, which sustained 
two weeks in July 2010. flight for up to 9 hours in August 2013. 


Solar array is not the only exploitable energy source in the direct environment of 
UAVs. Indeed, aerology mechanisms in the lower atmosphere, induced by local or global 
difference in temperature, entail movements of air of different scale and represent a vast source 
of energy. This alternative largely remains to be explored in the domain of UAVs, all the more 
as their eventual smaller mass may allow them to benefit from atmospheric phenomena of 
smaller scale compared with manned-gliders. For instance, some of these convective 
mechanisms are called thermals and occur when pockets of air near the ground become less 
dense, due to either heating or humidity change [19]. The challenge consists in finding a way to 
exploit it without compromising the mission scenario, since convective mechanisms directly 
affect the kinematics of any vehicle, all the more if those are of little mass. Some birds of prey 
take advantage of thermals by flying into those pockets of rising air and increasing their 
altitude, hence increasing their total energy, without flapping their wings [20]. Their 
subsequent elevated position, combined with a sharp sight, enables them to spot their prey and 
to gain the necessary speed to catch them off-guard. This technique of flight is called thermal 
soaring and has been investigated [21] and demonstrated to be practically feasible for a UAV by 
Allen in 2007 [19]. In this study, a 4.30 metres-wingspan, 6.8 kilograms-weight, Cloud Swift 
UAV is modified in order to detect and exploit thermals. Flight tests have attested that 
autonomous soaring was possible and multiple thermals were used to achieve a flight time of 
one hour. The wind can also be deflected upwards on the windward side of a ridge or a slope, 


where it would be possible to perform slope soaring by flying inside the area of rising air. 
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Those above techniques can be qualified by a broader term, as static soaring, which 
expresses the fact that a static regime of flight enables a steady increase in energy. The bird or 
the vehicle can therefore, in theory, continuously extract energy by flying steadily in the 


appropriate zone of rising air. 


However, another type of flight, more subtle, has yet again emerged by looking at 
nature. It is the main focus of the present research work and is inspired by the flight of 


albatrosses. 
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I.2 The Albatross Legacy 


Albatrosses have been at the origin of the discovery and understanding of DS flight technique. 
To this day, they remain the only ones to sustainably exploit DS flight over open ocean 
surfaces. Besides, the lack of experimental data on DS makes their study all the more critical. 
Albatross genuses form a family of fascinating birds, with peculiar behaviours, which have led 
to wide biological research over past decades. In the framework of this research, an extensive 
review of these contributions is achieved in order to retrieve potential inputs to DS knowledge 


within the biological scope of albatross flight. 





Figure I.5: Wandering Albatross (Diomedea Exulans) in flight close to the surface. Copyrights Kimball 
Chen, www.keaphotography.org 


1.2.1 Morphology Characteristics 


Albatrosses are among the biggest birds on the planet. One gender in particular, the 
Wandering Albatross (Diomedea Exulans), see Fig. 1.5 and Fig. I.6, has been recorded with a 
weight of 12.7 kg and the maximum wingspan at 3.7 metres [22]. Most individuals would weigh 
from 7 kg to 10 kg [23, 24], with mean weights estimated, at Crozet Islands, around 7.84 kg for 
the female and at 9.44 kg for the male [24]. As for the wingspan, the sexual dimorphism is less 
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pronounced with mean values of 2.99 metres for the female and 3.11 metres for the male [24]. 


Table I-1 sums up other mean values, adapted from Schaffer et al. [24]. 


Table I-1: Mean characteristics of Wandering Albatross, measured on Crozet Islands in 
1999, adapted from Shaffer et al. [24]. 


Parameter Female (mean) Male (mean) 
mass (kg) 7.84 9.44 
span (m) 2.99 3.11 
Wing area (m’) 0.586 0.626 
Wing loading (N/m’) 132 148 
Mean wing chord (m) 0.196 0.201 
Aspect Ratio 15.3 15.5 
Wandering Albatross 





Figure I.6: Planform of Wandering Albatross, compared on the same scale with that of a common pigeon. 
Adapted from Pennycuick [23]. 


One of the first striking features of albatross morphology is their long and slender 
wings, with the highest aspect ratio of any other living bird [23, 25]. For the sake of 
comparison, vultures are other remarkable soaring birds, but have an estimated aspect ratio 
around 6.9 [26], less than half of those of albatrosses. Another peculiar aspect of the anatomy of 


albatrosses is their subsequent high wing loading, see Table I-1, which is also significantly 
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higher than those of vultures, with a mean wing loading estimated at 76.5 N/m’ [26]. Such a 
heavy-loaded, high-aspect-ratio-wing trivially entails a high bending moment at the base of the 
wing, which requires significant force in order to maintain wings level or even to flap. This 
concurs with all observations that albatrosses do not sustain flapping flight, apart from rare 
take-off and landings [23, 27, 28, 29]. Besides, Pennycuick [23] observed that when fully 
extended, wings of albatrosses have a lock that prevents it to raise above the horizontal and 
level position. This materialized following dissections by the presence of a tendon which can 
maintain tension indefinitely at no metabolic cost [25, 23]. Hence, it allows albatrosses to 
sustain wings level at no energy expenditure, for any load factor within their structural limits. 
It can be concluded that the morphology of albatrosses is particularly adapted to gliding flight 
with a tolerance to high load factors, enabling them to cope with soaring flight within a windy 
and turbulent environment. Besides, their wing loadings and wing aspect ratio suggest high 
flying airspeeds and superior lift-to-drag ratio. Pennycuick calculates for the Wandering 


Albatross, using its software Flight [25], a best glide ratio of 21.2 at an airspeed of 16 ms. 


[1.2.2 Travelling Performances & Sensitivity to Wind 


Nothing much was known about their behaviour at sea until a breakthrough study 
was published in 1990 [30], following the first satellite-tracking campaign of wandering 
albatrosses. It shows that foraging specimens performed trips of several thousand kilometres 
out of the breeding site in Crozet Islands, the maximum standing out at 15 000 kilometres. It 
also highlights travelling speeds up to 80 km.h’ and daily speeds up to 936 km/day for the 
Wandering Albatross. This study revealed travelling performances that were far greater than the 
highest estimates at the time. Those flight performances were later confirmed by Prince et al. 
[31] from South Georgia and by Weimerskirch et al. [32, 33]. More recent tracking studies 
conducted by Croxall et al. [34] were focused on year-round behaviour, outside of the breeding 
phase of albatrosses. It shows that some specimens achieved a complete circumnavigation of 
the southern ocean, the fastest in just 46 days. By analysing travelling performances over three 
sub-segments of those circumnavigations, where averages up to 950 kilometres per day were 
achieved for periods up to 13 days, authors suggest that the round-the-world journey could be 
completed in 30 days. Subsequent data are summed up in Table I-2. Those indicate that 
albatrosses are able to sustain net travelling speeds, that is to say ground speeds averaged over 
large-scale movements, around 55 km.h” over distances up to 800 kilometres. Those travelling 
speeds could reach up to 85 km.h" over shorter distances while maximum ground speeds have 


been recorded at 135 km.h”, during a specific high-frequency GPS-tracking study [33]. 
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Table I-2: Summary of findings from satellite tracking studies of foraging wandering 
albatrosses, adapted from [30, 31, 32, 33, 34]. 








; Average Mean net Max 
Mean trip Mean trip dij 
istance ground ground Average 
duration length d d d 
(days) (km) per day spee spee GPS rate 
Reference (km) (km/h) (km/h) 
Jouventin et al., 11.8 GPS 
1990, [30] 24.4 8045 313 54.6 81.2 locations 
per day 
Prince et al., 15 GPS 
1992, [31] 6.9 4646 675 ie 88.1 locations 
per day 
Weimerskirch et ; Every 90 
al., 1993, [32] 11.6 6091 925 21.8 N.A. P 
Weimerskirch et 
al., 2002, [33] N.A. N.A. N.A. 94.5 135 Every 1s 
Croxall et al. Twice a 
. if 
2005, [34]** 9.9 8718 872 36.3 N.A. day 


“Mean net ground speeds were simply averaged over the whole foraging trip, hence taking into 
account periods when birds are sitting on the water. Besides, foraging trips were closed loops, 
so those speeds are representative of travelling performances over a closed circuit. Other net 
speeds are calculated between two specific landings, along an open segment of travel, 
regardless of the direction of travel. 


“Tagged specimens were gray-headed albatrosses and trips were open-loops. 


‘Speed averaged over the whole mean duration. 


Another common conclusion between those studies is the correlation between 
albatross travelling performances and wind speed. It was estimated that such travelling 
performances were energetically impossible for albatrosses to reach in the absence of wind [29, 
35]. Besides, it was measured that albatrosses usually spent most of their time aloft, with only 
short stops on the water [30], but that high-pressure weather systems, with their associated 
absence of winds, were grounding albatross on the water for much longer periods until 
stronger winds reappeared [30]. On a global scale, both Fig. I.7 and Fig. I.8 highlight the explicit 
correlation between albatross and wind distributions. Albatross species are endemic to remote 
islands such as Crozet, Kerguelen, South Georgia....Their wide distribution, exposed in Fig I.7, is 
therefore only the result of singular travel abilities, as underlined before, which carries them 
over all longitudes. However their presence is strictly out of tropical latitudes, with the only 


exception being the Chatam Albatross seen in orange on Fig. I.7. Albatross species are reported 
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in every zone of average wind velocity above 9 m.s’, but north Atlantic, and reciprocally very 


little albatross presence is seen outside of those areas. 
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Figure 1.7: Satellite tracking locations of albatrosses and petrels, reproduced with permission from 
Birdlife International [36]. All coloured-locations represent albatross species, the Wandering Albatross is 
pictured in bright red. 
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Figure 1.8: Estimate of wind velocity (m.s) at 50m, average over a 10-year period [37]. These data were 
obtained from the NASA Langley Research Center Atmospheric Science Data Center Surface 
meteorological and Solar Energy (SSE) web portal supported by the NASA LaRC POWER Project. 


Zones of strong winds are very much delimited by certain latitudes, pictured in Fig. 1.8, 
where prevailing winds are west-to-east currents, called the westerlies. Those are the strongest 
in the southern hemisphere, within the -40°,-50°,-60° latitudes [38], referred to as the Roaring 


Forties, Furious Fifties, Screaming Sixties. 
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Although the wind strength cannot explain in itself the distribution of albatrosses, it 
reinforces the assertion that albatrosses use a wind-related phenomenon to propel themselves 
effortlessly around the globe. More precisely, it was measured that albatrosses rarely sustain 
flight for winds under 8 m.s’ [39], that small albatrosses species are never observed gliding 
below 5 m.s’ [28] and that the distribution of wandering albatrosses is limited to areas where 
the average annual wind speed exceeds 7.5 m.s” [38], which is very much consistent with what 
can be observed from Fig. I.7 and Fig. I.8. Weimerskirch et al. [38] even describe how a shift in 
wind patterns over the last 20 years affected the distribution and the average mass of 
wandering albatrosses. Moreover, traveling performances of albatrosses have been measured to 
be intrinsically correlated with wind strength [29, 38]. Weimerskirch et al. [38] show indeed a 
correlation between the evolution of average travelling speeds of albatrosses and mean wind 
strength at Crozet Islands over the past 20 years, seen in Fig. I.9. Although data do not permit 
to establish a one-to-one correspondence, the year 2010 appears to confirm a dependence of 


travelling performances of albatrosses on wind strength. 
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Figure I.9: Evolution of mean wind strength at Crozet Islands over years 1990-2010 (top) and the 
associate evolution of travelling performances of female wandering albatrosses (bottom). Adapted from 
Weimerskirch et al. [38], reprinted with permission from AAAS. 


Further refinements about the flight technique of albatrosses can be learnt by getting a 
closer look at the way birds and wind currents interact. To start with, albatrosses have almost 
never been recorded to fly routes directly against the wind [30, 38, 35, 29, 32]. In particular, out 


of the 15 round-the-world journeys tracked by Croxall et al. [34], all were performed eastward, 
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in the direction of prevailing winds. This result seems somehow intuitive as the mass of air 
would have an overall tendency to carry the albatross throughout its displacement. Yet, travel 
directions of albatrosses show a predominant tendency to have a sidewind component [30, 29, 
35, 38]. That is to say that albatrosses rarely fly leeward either, directly with the wind, but 


rather show a travel direction tilted with respect to the wind direction, as seen in Fig. I.11. 
| Wind 
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Figure I.10 (above): Definition of an 
azimuth angle w between the albatross 
travel direction and the direction from 
which the wind is coming. 


Figure I.11 (right): Speed and direction of 
travel of albatrosses, with the wind 
direction, obtained by radar tracking. 
Adapted from Alerstam et al. [29]. 





The azimuth angle between the albatross travel direction and wind origin is defined in Fig. 1.10. 
Jouventin et al. [30] estimates that 40 % of the time, albatrosses fly at an angle w between 112.5° 
and 157.5°during the outward journey and that 90 % of the time, w is between 67.5°and 180°. 
Quite identically, Weimerskirch et al. [35] estimates that 35 % of the time, yw is between 120° 
and 150° and that the ratio goes up to 80 % between 90° and 180°. Results from Alertsam et al. 
[29] are mapped on Fig. I.11 and plot the y-distribution and the travelling speed related to the 
mean wind strength. It shows that the stronger the wind is, the higher both the average W and 
the travelling speed are. It hence highlights the tendency of the mean travel direction to shift 
leeward when the wind increases and the way travelling performances improve with the wind 
strength. Furthermore, although the sample size is rather reduced and precise values of wind 


strength are unknown, the speed of travel appears to be maximum for values of w close to 140°. 


It can be observed that for winds above 13 m.s’, no albatross was seen to progress against the 
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wind, which partly explains why circumnavigations recorded by Croxall et al. [34] were all 
achieved in the direction of prevailing winds. The travel direction should have a symmetrical 
distribution with respect to the wind direction. Indeed, albatrosses should behave indifferently 
whether the wind is coming from their left or their right. In most of studies, values of w are 
spread between 0° and 180°, without even mentioning if the wind comes from port or starboard 
with respect to the bird. The unsymmetrical distribution found by Alertsam et al. in Fig. I.11 
could be explained by a behavioural preference for certain routes in the area where the study 


was conducted. 


Those observations however question how those birds then manage to perform closed 
loops from their colonies. They manage to come back to their breeding site after thousand- 
kilometres—long-foraging trips, within an area where west-to-east winds significantly prevail 
[38], while it has been evidenced that their ability to go against the wind is very much limited. 
The Figure 1.12 has more about their overall flight strategies: two albatrosses were tracked 
leaving Crozet Islands at the same time, on the same day. The simultaneity of tracks is 
particularly interesting as it reveals the flight strategies of two specimens exposed to identical 


wind conditions. 
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Figure 1.12: Simultaneous trips of two males from Crozet Islands. Numbers indicates the successive 
counts of midnight locations. Reproduced with permission from Weimerskirch et al. [32]. 


It can be observed that although the two birds are most of the time over 50 kilometres apart, 
their overall trajectories follow very similar patterns over time. Moreover, changes in travel 


headings are correlated with changes in wind direction. First, this confirms that albatross 
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movements are intrinsically correlated with wind conditions. Then, it also concurs that 
albatross fly with the wind coming from their back and on their side the majority of the time. 
The azimuth angle defined in Fig. I.10 can be estimated to vary between 90° and 150° with the 
wind coming from either side of the birds. Still, for periods up to several days, birds were flying 
with the wind coming from only one side, which could explain asymmetries in measurements 
made by Alerstam et al. [29] on Fig. 1.11. Furthermore, most of the time, the angle between 
wind and travel heading is close to 120°. This is especially explicit on the second leg and the 
fourth leg of the trip and quite consistent too, as large scale movements are very much straight 
lines with a constant heading with respect to the wind. Moreover, it shows that birds are able 
to perform a closed loop while the zonal wind, ie the component of the wind along the west- 
east direction, is always directed from east to west, as it is typically the case within those 
latitudes [38]. The flight strategy takes advantage of changes in the north-south component of 
the wind, also called the meridional wind. Indeed, those shifts enable birds to change their 
travel headings and to progress westwards, even though the zonal wind is constantly directed 


eastwards. 


Up to 90 % of the time, the northern loop from Crozet Islands has been recorded to be 
anti-clockwise [35], just like the two examples pictured in Fig. 1.12. Similarly, the southern loop 
is performed clockwise 90 % of the time. This reveals that the significant travel performances of 
albatrosses are possible thanks to predictable overall wind patterns over latitudes around the 


colony, which is confirmed by Fig. 1.13. 
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Figure I.13: Distribution of the Wandering Albatross with respect to mean wind strength and direction. 
From Weimerskirch et al. [38], reprinted with permission from AAAS. 


The distribution of albatrosses, depicted in Fig. 1.13, depicts that the maximum is trivially 


located on the colony, but local peaks can be observed at the very same latitudes which see 
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changes in mean wind directions, for both zonal and meridional components. One is 
particularly marked, for females, north of the colony, around 38° south. This latitude 
corresponds to change in mean zonal wind from eastward to westward and in meridional wind 
from southward to northward. Such changes have been demonstrated to be necessary for 
albatrosses to alter their travel headings. The local peak in albatross presence at this latitude 
can therefore be interpreted as the average upper edge of the northern loop they perform, 
where they would change heading. This is consistent with loops displayed in Fig. 1.12. This area 
is by definition more prone to uncertainties regarding the wind direction, as mean values are 
close to zero, which would locally affect travel performances and explain the increase in 
albatross presence. Again, Fig. I.12 is very much consistent with this interpretation as birds 
spent two successive nights, number 4 and 5, in this area before starting their journey back to 
the colony. It can be assumed that the duration of foraging trips, which is directly correlated to 


the speed of travel, varies depending whether mean wind predictions are met or not. 


1.2.3 Flight Behaviour and Energetics 


Figure 1.13 contains another valuable piece of information. It depicts that the 
distribution is quite uneven regarding males and females and a closer look show that females 
tend to prefer areas of lower wind strength, while males seem to favour higher winds. This is 
very much relative, as females travel areas where the mean wind speed is over 7 m.s’. One 
explanation to it could be the sexual dimorphism observed for the Wandering Albatross [24]. 
Indeed, as summed up in Table I-1, males have a wing loading that is on average 12 % higher 
than that of females. It results in an aerodynamic polar shifted towards higher airspeeds, which 
translates into a best-glide efficiency at a higher airspeed and a faster minimal sink rate [40]. It 
means that males could favour higher winds, where they could extract the higher energy they 
require to sustain aloft while also optimizing gliding flight at higher airspeeds which are 
characteristics to windier conditions [24, 35]. Inversely, females could take advantage of lighter 


wind conditions. 


This trend can been generalized for different albatross species, where wing loadings 
can vary up to 60 % [39]. Suryan et al. [39] specifically focus on specific albatross subspecies, 
including two with peculiar wing loadings with respect to their body size. The Waved Albatross 
is the only albatross located in the tropics, on Galapagos Islands and has a rather low wing 
loading of around 90 N/m’. The Short-tailed Albatross is the largest albatross species in the 
north hemisphere and has a wing loading of around 130 N/m’. It was measured that the Short- 


tailed Albatross, was the only species to travel long distances under windier conditions 
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compared to their short-range movements. The lighter Waved Albatross did not appear affected 
by such a disparity and was the species which sustained flight for the lowest wind strength, 
around 6 ms . Besides, it was also the only albatross to apparently fly in any heading 
disregarding of the wind direction. On the contrary, the heavier Waved Albatross were most of 
the time flying in the general direction of the wind. Hence the flight technique used by 
albatrosses seems to associate a higher wing loading to a greater dependence on the required 


wind strength as well as to a stronger bound between wind and flight direction. 


In order to fully understand the energetics involved in the flight strategy of 
albatrosses, it is important to somehow measure their energy expenditure in flight. It is 
assumed to be rather low since albatrosses mainly glide without flapping of their wings and do 
not exert power to maintain their wings level thanks to a specific tendon. Besides, the eating 
frequency of albatrosses is rather low, as they feed from rare floating preys only, with on 
average one prey every 100 kilometres [32]. Pennycuick [23] assumed that albatrosses in flight 
consumed twice their basal metabolic power, and estimated that they could fly 936 kilometres 
while consuming fat equivalent to 1 % of their body mass. As a matter of comparison, a small 
petrel would stand at 38.8 kilometres. Weimerskirch et al. [35] explores further the problematic 
of in-flight energy expenditure, in particular in relation to wind conditions, by measuring the 


heart rate of tracked albatrosses. Average results are presented in Fig. 1.14. 
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Figure I.14: Average heart rates of albatrosses for each activity. Adapted from Weimerskirch et al. [35]. 


The metabolic basal rate is obtained when birds are on their nest; the average is slightly above 


60 b.p.m (beats per minute). The biggest power requirement comes during take-off with an 
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averaged heart rate that peaks close to 130 b.p.m.. During this phase, which typically occurs 15 
times per day [35], albatrosses not only have to flap their wings but also to gain speed, quite 
similarly to an aircraft, in order to generate sufficient lift. It reinforces the assertion that 
albatrosses are not adapted to frequent landings and take-offs. The heart rate decreases 
progressively for long periods up to several hours after take-off to reach an average value 
slightly above 95 b.p.m., which is only 50 % higher than the basal metabolic rate, showing that 
the assumption from Pennycuick of twice the basal rate was rather conservative. In-flight 
power requirements are actually significantly lower than when walking on the ground. This 
emphasizes the peculiarity of their flight technique energy-wise. The longer albatrosses are 
airborne, the wider is the range of measured heart rates, as seen in Fig. 1.14. We understand it 
as an increasing dependence on flight conditions, such as the travel heading with respect to the 
wind direction, which have a higher probability to change for extended flights. This is 
somehow corroborated by the analysis of the evolution of average heart rate with relative 


travel heading, as displayed in Fig. I.15. 


Tail Head 
wind wind 


Heart rate (b.p.m.) 





180 150 120 90 60 30 () 
Wner (deg.) 


Figure I.15: Heart rates varied according to the angle between the wind direction and the flight direction. 
Adapted from Weimerskirch et al. [35]. 


Hearts rates and therefore power requirements are the highest when birds are facing 
the wind, which happens less than 5 % of the time. Then, heart rates decrease regularly when 
the flight heading shift windward. The overall average of 90 b.p.m. indicates that most of the 
time, birds are within 90° to 180° with respect to the wind incoming direction. However, given 
the assumption that albatrosses do not sustain flapping flight, whatever the relative flight 
heading is, it is a priori rather difficult to interpret the slight but clear evolution of power 


consumption in the 90° to 180° range. This concludes on the fact that their flight technique is 
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optimized for a delimited range of travelling angles with respect of the wind. Outside of that 


range, not only flight speeds decrease, but power requirements surge. 


Further contributions to the knowledge of albatross flight technique is based on 
observations or on high-frequency, small scale tracking, which in their vast majority are only 
two dimensional in the horizontal plane. Still, it could be both recorded and observed that on a 
small scale, albatross movements are rather intricate, with turns, descents and pull-ups [32, 23, 
29]. One metric which is representative to this sinuous path is the straightness ratio, which is 
obtained by dividing the straight-line distance over large scale movements by the actual 
distance covered. Data however usually only take horizontal distance covered. For albatrosses, 
straightness ratio is probably dependent on wind conditions, and estimates are between 0.7 [23] 
and 0.9 [29]. A 0.7 value for the straightness ratio means that albatrosses fly an actual distance 
50 % higher than the straight-line distance they travel. In terms of variation in height, 
albatrosses have never been observed higher than 30 metres [23, 41, 42, 43, 44], where the 
average maximum height should be between 10 metres and 15 metres. A rather recent high- 
frequency acquisition campaigned by Sachs et al. [44] could obtain accurate measurements of 
the Wandering Albatross in flight. A typical trajectory is pictured on Fig. 1.16, using sampling 
rates of 10 Hertz. 





Figure 1.16: Small-scale movements recorded by Sachs et al. on a Wandering Albatross gliding under 
windy conditions. Wind is 11.2 m.s at 10 metres, the direction is shown by the arrow. From Sachs et al. 
[44], reproduced with permission from Journal of Experimental Biology. 


The albatross trajectory appears to combine repetitive cycles of similar aspects, which 


themselves are composed of pull-ups, turns and descents. The low straightness ratio of the 
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albatross flight path mentioned in earlier observations is confirmed and well pictured in Fig 
I.16. Those small-scale movements are summed into medium-scale movements which indicate a 
clear flight heading. It appears, from measurements by Sachs et al. [44] , that the more birds are 
progressing against the wind, the more intricate the flight path becomes and the lower the 


straightness ratio is. 


Albatrosses indeed seem to take advantage of various flight techniques. Some 
observations and measurements witness cases where albatrosses were seen gliding in zero 
wind, taking advantage of propagating swell waves [23]. Authors suggest various theories to 
explain the flight of albatrosses. A majority refer to dynamic soaring, which exploits the 
vertical variation in horizontal winds, due to the boundary layer effect entailed by the air-sea 
interface. This technique was unknown before the flight of albatrosses was investigated. Others 
suggest wave soaring or slope soaring on waves, where waves play a significant role in 
disturbing the wind field close to the surface. Pennycuick has long held the view that 
albatrosses used gust soaring, where birds would exploit the separation of boundary layer 
leeward of waves to benefit from sharp local increases in wind strength, referred as gusts. 
Finally, Richardson actually supports that those theories are not mutually exclusive and that 


albatrosses probably take advantages of a combination of all those techniques. 


1.2.4 Summary of Inputs from the Biological World 


The general anatomy of albatrosses outlines an aspect ratio around 15, which is 
exceptionally high for a bird and makes them resemble small fixed-wing gliders. Their 
morphology suggests they are unsuitable to flapping flight, which is confirmed by all 
observations, but rather optimized for fast gliding flight. 


It is indeed established that they regularly travel thousands of kilometres with 
travelling speeds that can top 900 kilometres per day. Those performances are intrinsically 
correlated to wind strength, as albatrosses would not even be able to sustain flight in the 
absence of it. It explains their geographical distribution within south latitudes, where the 
strongest winds on earth blow predominantly from west to east at average speeds that can top 


10 ms’. 


Their travel heading can be observed to be very much dependent on wind direction, 
with an angle between the two such that they usually perform best with the wind coming from 
their back with a side component. Their ability to go against the wind is very limited and rarely 


used. It has been evidenced that it comes at a greater energy expense and that it results in 
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drastically reduced speeds of travel. They still manage to come back to their departure point 
while sustaining rather high speeds. To achieve that, it has been demonstrated that they have 
developed a large-scale strategy which takes advantage of predictable wind patterns. Changes 
in zonal wind direction (north-south component) are sufficient to enable them to alter their 


travel heading and to close the loop back to the colony. 


There are strong indications that there is an interrelation between their wing loadings 
and the wind strength they exploit. To that respect, albatross species of lower wing loadings 
are seen in lighter wind areas and it is supported that the sexual dimorphism in wandering 


albatrosses explains the different distribution between males and female. 


Their own in-flight energy expenditure was measured to be very low, not much higher 
than their basal rate. On the other hand, their performances of travel are theoretically high- 
demanding energy-wise, since albatrosses, as any flying vehicle, are subject to energy losses 
due to drag. This contrast emphasizes an inherent energy-extraction process and raises 


questions regarding the underlying technique they have developed. 


On a small scale, intricate trajectories, far from being straight, have been observed. 
The altitude rarely exceeds 20 metres and albatrosses are most of the time in the nearest 
vicinity of the surface. Although there still lacks a general agreement regarding their energy- 
harvesting strategy, there is a concord on the fact that albatross exploit surface-induced wind 
phenomena. The predominant theory focuses on the most generic of those phenomena, which 
is the boundary layer effect that forms due to air-sea interactions. This technique is called 


dynamic soaring. 


1.2.5 Promising Perspectives 


Years of evolution have created a very adapted biological system which relies on two 
different energy resources: surface food distribution and wind. The practical response is a 
foraging flying technique which renders those two resources compatible. It consists in 
exploiting wind energy in order to fly long distances and therefore statistically increase the 


probability to harvest from scarcely distributed preys. 


No fundamental deadlocks appear to prevent the approach of transposing the flight 
technique of albatrosses to the UAV world. Although albatrosses are a class of their own 
among birds, nothing stands out in terms of their general architecture compared to a fixed 


wing glider planform. Their estimated gliding performances are well within the reach of 
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vehicles of similar size and nothing indicates that it should be otherwise in terms of 
manoeuvrability. The wind resource is evenly distributed over large areas, which suppresses 
the constraint of exploring for a sparse phenomenon of unknown location, as it is the case for 
thermals. As for the large-scale flight strategy, a vehicle would actually hold a strong advantage 
over albatrosses, as it could adapt its route by taking into account weather forecast data. In 
terms of energetics, apart from potential mission requirements, most of the expenditure comes 
from propulsion on a small UAV. Endurance would then only be limited by on board 
energetics. The feasibility for a small UAV to extract energy from the wind, in the same fashion 


as albatrosses, is therefore a promising topic, which requires further investigations. 


I.3 Main features of Dynamic Soaring Flight 


The technique of DS consists in taking advantage of vertical variations in horizontal 
wind, also called vertical wind shear gradient. One way to define DS is trivially by opposing it 


to static soaring, such as thermal soaring or slope soaring. 





Figure 1.17: Schematic 2D display of a dynamic soaring path within the wind shear gradient. 


The latter techniques manage to extract energy at a positive and continuous rate 


through steady flight kinematics. In such cases, the variation in total energy of the vehicle 
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would depend whether it can maintain its air relative sinking speed lower than the rising speed 
of the surrounding mass of air. However, the mechanisms of energy-extraction would remain 
effective, whatever the kinematics of the vehicle is. Practically, it means that a vehicle, no 
matter how it is flown, would benefit from the uplift of the surrounding air as long as the 
vertical motion of it remains upwards. In static soaring, energy-extraction mechanisms are 
hence not correlated to the vehicle kinematics. On the opposite, DS is characterized by an 
energy-extraction which is not steady and that varies to form repetitive cycles composed of 
energy losses and gains. Losses are necessary in order to sustain respective gains and the 


periodicity of the process is ensured if gains and losses balance out along the cycle. 





Figure I.18: Overall aspect of a dynamic soaring trajectory. 


Practically, this translates into periodic dynamic manoeuvers within a zone where 
wind gradients form, as displayed in Figs. 1.17 and 1.18. More specifically, in dynamic soaring, 
the bird or vehicle would extract energy from the layer of strong wind and would be able to 
repeat the process by flying back in the zone of weaker wind. This concept can be apprehended 
by considering a ball bouncing on a wall which is moving at a fixed horizontal speed. Each time 
the ball bounces, it gains momentum from the wall, and the same principle applies to the bird 
flying in the zone of strong winds. It means that gains in total energy come through variations 


in inertial speed, as opposed to static soaring, where those gains relate to an increase in height. 
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Besides, variations in inertial speed themselves alter the dynamics of the vehicle during the 


manoeuvre. That is different again different from static soaring. 


To put in a nutshell, some particular trajectories are required through dynamic 
manoeuvres with respect to the wind. Those entail a transfer of energy from the wind to the 
vehicle, which in turn alters the dynamics of the vehicle and influences its trajectory. So the 
mechanisms of energy-harvesting are inherently bound up with the dynamics of the vehicle, 
which is specific to DS. For this reason, the mechanisms that lead to the energy extraction are 
challenging to understand and to model as they are related with other variations of kinetic and 
potential energy along the flight path. It means that flight dynamics and energy-harvesting 
strategy have to be investigated together. 


I.4 Research Question & Objectives 


1.4.1 Research Question 


The context outlined that the research is focused on the use of sustainable energy 
extracted from the wind applied to the flight of small autonomous vehicle. Therefore, one 


wants to know: 


Is it feasible for UAVs to exploit dynamic soaring in the purpose of long duration, or long 


distance autonomous flight? 


1.4.2 Research Areas Breakdown 


In order to fully understand the way the main question would breakdown into 
underlying research questions and then the way research objectives would be structured, some 
specified self-contained research areas can be derived quite straightforwardly out of the 


context. 


What would fly? The potential vehicle would have to be compliant with the 
requirements of DS flight. That is the Vehicle Model. 

Where would that vehicle fly? That is the first 30 metres over land or waves, the lower 
atmospheric boundary layer over earth oceans. A model of the wind field at the 
interaction air/sea or air/land would have to be developed. That is the Environment 


Model. 
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How would the vehicle interact with that environment? In other words, the way it is 
going to fly within that boundary layer. The flight path and dynamic parameters would 
have to be investigated. It is the core of that research work and corresponds to the 


Flight Model. 


1.4.3. Underlying Research Questions 


Answering the main research question consists in trying to build up a solution to the 
problem, which would ultimately be the only way to prove the feasibility. However, during the 
demonstration, or the construction of the solution, some deadlocks might arise and an 


alternative answer would be to bring out which variables govern the feasibility to exploit DS. 


The main research question breakdowns into three underlying research questions, 


which concerns each of the three models introduced above: 


e = What trajectories would enable energy-harvesting in the framework of DS flight? 

e What is the environment, and the corresponding wind field, required for DS energy- 
extraction and what is the likelihood of favourable conditions? 

e Which vehicle design would be suitable for DS flight and how do general design variables 


govern performances? 


1.4.4 Research Objectives 


The research question is there to support different aims. Their numbering is arbitrary. 


Objective 1_ “Understand”: Get a complete understanding about the energy- 
harvesting mechanisms involved in the DS, applicable to the flight of albatrosses. This objective 
might see underlying challenges arise as it emerges that albatrosses seem to combine different 


processes to extract energy from the wind. 


Objective 2_ “Simulate”: Set up a methodology to simulate the physics of DS flight, 
which would give a practical answer, in terms of trajectories, to a particular mathematical 
problem, given some vehicle and environment models adequately formulated to fit the problem. 
Use those derived optimized trajectories to support the physical understanding assumed for DS. 
Get an understanding of mathematical tools used when writing down the algebraic formulation 


of the problem and of how they influence the output. 
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Objective 3_ “Model”: Understand and model the environment where the vehicle 
would perform; in particular aerodynamics of sea/wind or earth/wind interfaces. Also provide 
models of potential flying vehicles and understand how general design variables influence DS 


performances. 


Objective 4_ “Assess”: Assess potential DS performances for different key variables, 


which may influence the feasibility to exploit DS under realistic conditions. 


1.4.5 Thesis Outline 


In order to address the research question and support the aforementioned objectives, 


the following outline has been selected. 


First of all, the literature background on the topic will be reviewed to draw a sum-up 
of the current state of the art. This will be the objective of Chapter II. Then the understanding 
of DS energy-harvesting strategy would only be approached after modelling the environment 
and the vehicle in order to get to a simulation of DS. Hence Chapter III focuses on building up a 
vehicle model that applies to different UAV designs, and also on building up models of their 
environment, by focusing on a refined earth/wind interaction such as waves. The physics of the 
evolution of the vehicle within its environment is the point of Chapter IV, which then also 
presents a methodology to pilot and optimize this evolution. With simulations of DS flight 
available, Chapter V forms the core of the present work as it focuses on the characteristics of 
DS flight and on understanding the principles of energy-harvesting. Chapter VI goes further by 
conducting a mathematical analysis of EoM and the associated optimization problem in order 
to highlight some key variables about DS flight. Finally, various case scenarios are investigated 
in Chapter VII, by varying key parameters in order to assess how those govern the energy- 
harvesting strategy. It includes the refined case of a powered vehicle, where DS makes sense by 


enabling range improvements. 
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I.5 Summary of Chapter I 





The advance of small scale UAVs opens up for a wide range plenty of potential 
applications, however those may be hampered by range and endurance limitations. Several 
options are being investigated to cope with this conundrum and the state of the art is 
presented. A particular scope is given to techniques which take advantage of the surrounding 
energy available to the vehicle. It bears the advantage of keeping the logistics of flying simple 
and the overall technical solutions rather unchanged. The breakthrough comes from the flight 
management approach, which is adapted to benefit from atmospheric convective inputs. 


Among those techniques, one is inspired by the flight of albatrosses. 


It appears that those big birds are actually quite close, in their morphology, to a small 
gliding UAV. Indeed, their wings have the highest aspect ratio of the living world and are 
locked in deployed position most of the flight. Yet, their flight performances are impressive, 
both in terms of speed of travel and range covered. They take advantage of the wind in a way 
that is not completely understood yet but that enables them to achieve round trips of several 
thousand kilometres out and back at their remote colonies. This peculiar flight, called dynamic 


soaring, associated with promising perspectives, makes it a consistent research subject. 


The main question associated to it concerns the feasibility for a UAV to exploit 
dynamic soaring for the purpose of long endurance, long range flight. Hence this research 
work focuses on understanding the energetics of DS, simulating this type of flight and 


assessing the constraints it implies for the vehicle and its flight. 
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(CHAPTER [| 


BACKGROUND OF WIND ENERGY EXTRACTION 


This Chapter aims at providing a review of the literature of wind energy extraction for 
UAV, with a specific scope on DS contributions. It will enable this present work to benefit from 
previous studies and will help to structure the research in order to focus on required 


contributions to knowledge. 


II.1 Contributions to Knowledge 


Research about flying techniques which extract energy from the wind started in late 
10" century with observations of albatrosses from Lord Rayleigh [27]. In 1925, a first analytical 
analysis was undertaken by Idrac [45], in order to underline the fundamental differences 
between static and dynamic soaring. Idrac imagined different trajectories, which are 


reproduced on Fig. II.1, that could take advantage of a vertical gradient in horizontal winds. 





Figure II.1: Various eventual shapes of DS flight path, reprint from Idrac [45]. 


A few numerical approaches were attempted later in 1975 by Wood [46], taking 
advantage for the first time of computer power. Then in 1990, the first satellite tracking 
campaign of albatrosses enlightened the biological community with unexpected flying 
performances from albatrosses [30]. As reported before, further tracking campaigns were 
conducted during the following decade. From late 1990s, the improved knowledge of 
albatrosses performances combined with the emergence of UAVs highlighted the potential of 


DS research. Major publications on the matter appeared from 2000, while certain biologists 
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started to criticize the limited scope of numerical models developed by researchers [41]. Several 
further studies, from different institutions, followed during the decade, while research on 
miniaturized UAVs with autonomous flying capacities started to flourish from 2005. From 2010, 
researchers started to implement in-flight measurement cells on albatrosses with recent 
publications in 2011 and 2013 [43, 44]. Also in 2011, for the first time a publication from an 
oceanographer mentioned and detailed the importance of waves as part of the energy 


extraction mechanism [42]. 


II.2 Literature Background 


II.2.1 Point of view 


The literature available on the subject comes from different authors with different 
backgrounds. Although we are looking at a type of flight which exploits DS, other types of 
potential contributions are reviewed. Indeed, multi-disciplinary aspects of the research involve 


different sciences and give a pretty broad literature background. 


I1.2.2 Simulation of Dynamic Soaring 


The first numerical approaches to DS flight appeared in the late 1990s. Sachs et al. 
provided one of the main publications on that matter [47]. His aim is to compute typical 
optimized DS trajectories and to understand the influence of external parameters such as the 
wind strength. Very early in the paper, the author acknowledges some limitations in the 
understanding of the energy extraction phenomenon and proposes to describe his vision of the 
energy extraction mechanism. Sachs therefore suggests a different understanding of the energy 
extraction phenomenon than the classical DS theory, which claims that the wind gradient is 
responsible for providing energy to the bird. He writes down equations of motion in a three 
dimensional reference frame linked to earth, thus inertial. He isolates a point mass vehicle 
within an environment described using a logarithm model for the wind gradient over a flat 
ocean. The aim of his model is to determine energy neutral trajectories using optimization in 
order to minimize the required wind shear strength, although the optimization process is not 
clarified. He therefore provides 3D trajectories corresponding to DS flight, such as the one 
displayed in Fig. I. 2. The UAV is apparently able to maintain a crosswind net heading while 
taking advantage of the wind through a curvy flight path. Sachs provides an energy-point-of- 


view-interpretation: the energy is transferred from the wind to the vehicle during the upper 
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turn, when the vehicle finishes its upwind leg and starts to turn into the wind to start its 


downwind descent, without detailing the underlying mechanisms. 


Wing | 
“tion 





Figure II.2: Overall open loop obtained by Sachs et al. [47], reproduced with permission from Ibis. 


Zhao et al. provides a slightly different approach of the same overall problem [48, 49]. 
His aim is to derive trajectories which minimize the average thrust required by the vehicle, 
taking benefits of the wind gradient over the ocean. He also isolates a point mass vehicle but 
this time in the reference frame of the wind, which is not inertial, since the wind increases with 
height. The wind gradient profile chosen is linear, which is again a strong difference compared 
to Sachs. The equations of motions see the apparition of a fictitious force (or inertial force) due 
the non-inertial reference frame. He derives an optimization problem clearly stated out of 
equations of motion, where constraints are derived from vehicle operational limits, surface 
avoidance consideration and periodicity consideration while the objective is to compute critical 
trajectories with minimal wind condition, the same as Sachs et al. He then converts the infinite 
dimensional optimization problem into a parameterized optimization problem using collocation 
techniques and finally solves it using NPSOL software. It is to be noted that he suggests 
tracking variations of energy by computing air frame based energy, i.e. energy calculated 
towards the mass of moving air, which is fundamentally different from Sachs’ earth-based 
energy. With this choice, as the lift is directed perpendicular to the relative airspeed vector, the 
lift does not work and hence does not contribute to the energy variation. He then provides 
results through optimized 3D trajectories for different case study. He concludes that the 


average thrust required for periodic trajectories is lower when the UAV aerodynamic efficiency 
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increases and when the wind strength increases. It is to be noted that the average thrust 
mentioned is positive, which means that an energy contribution from the vehicle is required to 


perform periodic trajectories, which was not the conclusion of Sachs et al. 


Another numerical approach was proposed by Deittert et al. [50], which aims at 
providing clarification about the flight performance and the likelihood of favourable winds. 
Like Zhao et al., he also opts for a point mass model observed from a wind reference point of 
view. The wind model is the same as Sachs’ and he also assumes a flat ocean surface. However, 
he decides to optimize his trajectories with a differential flatness technique which is different 
from Zhao’s collocation. He then validates his results by comparing his critical trajectory 
parameters with the one obtained by Sachs. His results consist in a detailed analysis of how the 
vehicle design variables influence the required minimal wind strength and he provides travel 
performances polar depending on wind strength. Deittert et al. identifies the ability to fly close 
to the surface as a key factor governing DS performance, as the strongest wind gradient is 
located in the first metres above the surface. The author concludes that DS would benefit from 
further research investigating the airflow over water surfaces; including the flow between 
waves and that a higher degree of fidelity would be reached by taking into account a six degree 


of freedom vehicle model. 


Lissaman [51], focuses on the principles of energy-extraction, applies DS to a linear 
wind profile to finally derive some general approximation rules. Lawrance et al. [52] derives 
low-complexity guidance and control algorithms and apply it to the case of a UAV travelling 
with the wind. Sukumar et al. [53] derive DS trajectories over open field and study the effect of 
aircraft properties. Other contributions to the topic include Akhtar et al. [54], Barate et al. [55] 
and Gao et al. [56]. 


11.2.3. Jet Stream 


Dynamic soaring flight could potentially be extended to any environment where wind 
gradient form. Grenestedt et al. investigates the feasibility to fly perpetually in the jet stream, 
where gradients are generated in the upper atmosphere between bands of strong winds [57, 
58]. The author undertakes the same approach as Zhao et al. by deriving air relative equations 
of motion from a point mass vehicle within a linear wind gradient and using collocation 
techniques. His main contribution is to extend the optimization process to the aircraft design, 
where he provides a clear definition of the aircraft model. He then goes another step forward 
by optimizing solar-assisted DS trajectories. Therefore the optimization process takes into 


account control and state variables as well as design parameters assuming solar augmented 
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trajectories. However, the author does not go into much detail regarding the results. The 
resulting trajectory is displayed in Fig. II.3, authors force the vehicle back to its initial position 
such that the flight path is closed. It seems quite similar to a circular flight path in an inclined 
plane performed at high altitude (over 10,000 metres). The overall amplitude of the flight path 
is wider than the trajectory obtained by Sachs, with a vertical amplitude of the order of 80 


metres. 
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Figure II.3: Dynamic Soaring closed-loop within a linear gradient, reproduced with permission from 
Grenestedt et al. [57], © 2011 IEEE. 


11.2.4 DS in hurricanes 


Another environment where strong wind shear from is within hurricanes, and the 
same authors Grenestedt et al. give a go at DS simulation within this peculiar wind field [59]. A 
standard hurricane wind model is chosen, where the wind speed, for a constant height, varies 
with the radius of the hurricane considered. The methodology is otherwise rather equivalent to 
their paper in the Jet Stream [58]. They conclude that the theoretical feasibility to perform DS 
within a hurricane depends on the wind section profile encountered. We see the main 
contribution of this paper as a demonstration that DS can be exploited where any wind 
gradient form. Indeed, whether the wind sees a vertical variation in its horizontal component or 


a horizontal variation of that same component, DS trajectories can potentially be found. 
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11.2.5 DS at Ridges 


Another potential source of wind gradient comes from wind blowing over mountain 
ridges. Richardson provides an analysis of the DS flight performed by RC gliders which enable 
vehicles to reach speeds up to 450 mph without any engine [60, 61]. His contribution consists in 
modelling ridge soaring through a simple Rayleigh cycle, which consists in assuming a circular 
loop trajectory along a plane tilted upward into the wind and crossing a two layer wind shear 
structure. This highly simplistic model still enables the author to compute the overall wind 
strength to vehicle maximal velocity behaviour. He also outlines optimal values for loop 
periodicity thanks to simple analytic calculation from the model. The main advantage of his 
approach is to be able to compare his model with achieved RC gliders flights thanks to 
measurements conducted from the ground (mainly radar speed measurement) He concludes 
that further research would consist in implementing instruments on vehicles in order to get 
more accurate elements of comparison between theory and experiment. Besides, he suggests 


that some numerical modelling would help to refine the flight model. 


II.2.6 Gust Soaring 


Gust soaring cannot quite be considered as classical DS technique. Indeed, the vehicle 
does not perform regular cycles, but rather aims at taking the maximum out of a sudden and 
unpredictable gust. However, it is nonetheless a way to extract energy from a horizontal 
variation in wind, for which the dynamics of the vehicle and the energy extraction are also 
bound up together. The main difference with DS is the time scale and the stochastic nature of 


the problem, which makes it impossible to plan trajectories. 


Patel et al. [62] detail the basic principles of gust-energy extraction in Fig. II.4, in the 
case of vertical gusts. It consists in orientating the lift in the direction of the vertical motion of 
air met by the vehicle, respectively upwards and downwards for an updraft and a downdraft. 
Although authors do not provide a scale length on the figure, those variations, both in time and 
space, are at high frequency, such that conventional pitch control is inappropriate to control 


the lift in such a way. 
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L 


Figure II.4: Fundamental concepts of gust soaring, with the vehicle performing pull-ups within an updraft 
and pulling negative acceleration during downdrafts. Adapted from Patel et al. [62]. 


It is yet probable that albatrosses take advantage of gust soaring along their overall DS 
cycles [41, 42]. Several authors have focused on gust soaring [63, 62, 64, 65]. Lissaman et al. 
concentrated on the general understanding and simulations of the energy-extraction principles 
[63] with a methodology which is rather close to what can be done for DS, while Langelaan et 
al. [64, 65] as well as Patel et al. [62] investigate control laws. Gains were demonstrated to be 


theoretically and practically feasible. 


1.2.7. Practical Aspects 


One of the challenges of DS flight is that inherent dynamic manoeuvres make it 
difficult to comprehend and to analyse from an experimental point of view. Dynamic 
conditions obviously cannot be recreated in a static wind tunnel environment. Some 
publications have focused on the way DS flight could be practically implemented with an 


autonomous vehicle as well as on experimental processes to improve the knowledge of DS 


flight. 


Even before reaching the capabilities to perform DS flight, vehicles would need to gain 
autonomous capacity, including path planning and environment recognition. In the case of DS, 
the knowledge of the wind field is essential and usually assumed to be known in research 
approaches so far. Langelaan suggests a way to estimate the wind field for a small flying 
vehicle [66].The approach utilizes sensors which are already part of a standard autopilot sensor 
suite available on the vehicle and its primary motivation is energy-harvesting perspectives. 
Therefore, hardware only consists of on board GPS, Inertial Measurement Unit (IMU), 


magnetometers, static pressure and dynamic pressure. A flight model associated with sensor 
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fusion capacity is used to derivate a wind field estimation algorithm. The author concludes that 
errors in the estimation can be maintained low by increasing the accuracy of airspeeds 
measurements. Simulations are performed with this wind estimation algorithm and a gust 
energy-harvesting controller. It shows that energy-harvesting is still possible using a wind 
estimate and not an implicitly assumed wind field at the cost of greater control actuation, since 


the vehicle is discovering its environment in real time. 


Bower undertakes another approach to bring DS flight closer to practical feasibility 
[67, 68]. He sums up the different enabling technologies required for the practical use of DS for 
a small UAV and he underlines that technology in the field of sensing and communications 
already reached a sufficient level of maturity and miniaturization to be utilized in the purpose 
of his research. He identifies the need for measuring height over the ground in order to control 
the proximity to the waves, without being able to select an appropriate ready-to-use technical 
solution. The author then undertakes the same approach as Zhao et al. by deriving optimized 
solar augmented trajectories for a point mass model of a vehicle within an environment 
simplified to a logarithmic vertical evolution of the wind. The optimization process uses 
collocation techniques and is solved with SNOPT. He finally undertakes a vehicle design trade 
study to analyse trajectory optimization parameter sensitivity about a baseline design. He 
mentions the need for a more realistic environmental model as well as for a refined vehicle 
model with inertias He then concludes that trajectory generation and guidance capabilities are 
major challenges to overcome before DS flight can be performed and underlines that accurate 


estimation of aircraft state and wind field must be developed on the experimental side. 


Rather than recreating DS flight conditions so as to eventually validate the flight 
model, one alternative option is to compare the theory with the biological system which 
initially inspired DS. Sachs intends to prove albatross can make progress against the wind 
using DS, which is one key parameter to the feasibility to perform long endurance flight in any 
direction. Papers [43, 44] sum up the results of the first in-flight measurement campaign 
applied to DS research. The experiment is based on a GPS logger attached on the back of an 
albatross, as seen on Fig. II.5, from which raw data are post-processed with a trajectory 
reconstruction algorithm. This requires liaising with biologists on the field to set up and 
recover the device as-well-as significant logistics to perform such a test at those distant and 


remote locations. 
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Figure II.5: GPS logger attached on the back of an albatross. From Sachs et al. [43]. 


In the meantime, wind parameters were obtained using SeaWinds. Different scenarios 
of strong and weak winds are recorded and resulting trajectories are computed. The author 
then analyses and compares the patterns which show that progress against the wind was 
achieved by the tagged albatross. The bird’s trajectories are clearly different depending 
whether it goes with or against the wind. The wind strength also seems to have an influence on 
the way the bird makes progress. One of the main observations was the use of more 
pronounced tacking manoeuvres, when progressing against a stronger wind. Reconstructed 
altitude time history cycles presented in annex shows that the height reached by the bird is on 
average much lower than predicted on Sachs and Deittert numerical models. It also shows that 
an important part of the time, albatrosses are flying below 2 metres altitude, i.e. in the vicinity 


of waves. 
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II.3 Overview and Critical Summary 





The subject of DS has provided several points of views which seem to lack 
consistency so far. Even with the same and simple environment model, authors still remain 
vague in describing the energy-extraction process in their model. On this matter, Sachs 
underlines that the energy-extraction occurs during the higher turn, without providing further 
details. Zhao highlights that with an air-based point of view, energy is extracted through the 
work of a fictitious force, also called dynamic soaring force by Deittert, when the vehicle 
crosses the region of strongest wind shear, that is to say at low altitude. Lissaman details how 
energy can be locally gained from convective mechanisms via the work provided by the lift, 
without focusing on overall DS flight path, while Patel et. al applies it in the case of vertical 


drafts. 


Models developed by numerical studies hardly seem to represent the reality of the 
field observed by biologists. While biologists provides useful qualitative data but lack 
parameters to go into quantitative details, one of the main limitations about numerical studies 
is to rely on a clean model of wind gradient over a wave-less surface, which all biologists 
acknowledge as a limitation. All authors from numerical studies agree on the need to improve 
the environmental model in order to get closer to reality, by taking into account wind-waves 


interactions. 


Furthermore, all theoretical studies lack some experimental validation. Sachs in-flight 
measurement campaign was a first but lacks the measurement of further flight parameters to 
permit a thorough comparison with the flight model developed by the theory. Among those, 
the relative airspeed, the air relative flight path angles as well as the accelerations would give 


more elements of comparison. 


Another limitation acknowledged by different authors is the lack of practical point of 
view among the numerical studies. Those papers investigate DS, without assessing changes in 
vehicle properties, in flight path constraints or in environmental conditions, which is 


necessary before concluding on the eventual feasibility to exploit DS. 


To put in a nutshell, an overview of existing literature shows a lack of consistence 
between the different points of views as well as a lack of maturity in the concept of exploiting 


DS. Those limitations comfort us in our choice of research question and of research objectives 
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(CHAPTER |[[]| 


MODELLING A VEHICLE AND ITS ENVIRONMENT 


The scope of DS research involves interdependent concepts, which have been 
introduced to be the vehicle, the environment and the flight model. Each has its own variables, 
as well as its own representation with respect to other models. Variables translate into 
properties for each respective representative model. Design variables for the vehicle translate 
into aerodynamic and inertial properties; direction and strength of high altitude winds 
combined with surface properties form a representation of a three dimensional wind field 
above the surface and finally, state and control variables are varied over time to form a 
consistent flight path. The core of our subject is the way for a vehicle to harvest energy from its 
windy environment in a specific fashion. Hence a vehicle and an environment model must be a 
priori determined in order to establish a flight path that relates them. This part aims at 
presenting the variables involved in both model, as well as quantifying the way those influence 


the respective representation of each. 


IlI.1 The Vehicle Model 


The vehicle flies a whole range of functioning point during a DS cycle, which makes it 
unappropriated to optimize a specific design. Besides, as wind field conditions change, and as 
the type of mission scenario relative to this wind field are most likely to alter, it makes it 
difficult to optimize a vehicle over such various conditions. Hence it was chosen, so far, to 
break down the research problem by isolating the vehicle design to something conventional 
and well-known. Without getting into great details, any glider-shaped vehicle with good lift to 
drag ratio has the potential to fly DS trajectories and classic architectures have been 


demonstrated to perform best in this area. 


IlI.1.1 Baseline Designs 


In order to understand the way general design variables can influence DS 


performances, it was still chosen to try out different vehicles, of various sizes. Indeed, one of 
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the objectives of this research is to assess whether the flight of albatrosses can be transposed to 


vehicles that could differ from albatrosses, in particular in terms of size. 
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Figure III.1: Planforms of the UAVs considered, on the same scale. From top to bottom, Cloud Swift [19], 
Wandering Albatross, Mariner, reproduced from Bower [67], DT-18, reproduced from Delair-Tech [69]. 
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Figure III.1 presents the different vehicles chosen. It naturally starts with what will be 
called Wandering Albatross in the remaining of the document. Based on data from different 
authors on the biological field, an average geometry is outlined by Sachs [47]. Although 
variables somehow differ slightly from data gathered in Table I-1, the advantage to stick by 
Sachs’ chosen geometry is to have a support for validation. However, for the vast majority of 
simulations, UAV models were chosen, based on the architecture of a small-scale glider rather 
than that of a bird. That suppresses the need to estimate and to average a geometry between 
different albatross specimens and it makes sure that the performances of the model are 


achievable by an engineering-designed vehicle. 


Among those vehicles, the biggest is named Cloud Swift from Allen’s work [19]. It is 
based on the radio-controlled cross-country glider SBXC from RnR Products. It was chosen for 
its wide representation in other research works [19, 70] and in the way that it represents a 
performant glider in the upper range of small UAVs. It has the highest aspect ratio (AR) of the 


four designs and can be expected to have the best gliding performances. 


Next on the list is Bower’s Mariner [67], which has been developed to minimize the 
wind strength required to perform DS, under certain wind field conditions. Contrary to 
albatrosses which can have their wing tip touching the water, as presented in Fig. I.5, the ability 
of a UAV to fly very close to the surface is limited. However, in order to keep the centre of 
gravity as close to the surface as possible during manoeuvres at low height and high bank 
angles, the methodology developed by Bower limits the span, at the expense of sacrificing the 


aspect ratio. 


Finally, the smallest model is based on the DT-18, engineered by Delair-Tech [69], a 
French start-up based in Toulouse. It takes the most of the 2 kg UAV class (including mission 
payload) and is the only small civilian UAV to be approved for Beyond Visual Line-of Sight 
operations (BVLOS) by an official rule-making body [69]. Main features include an elliptical 
shape of the wing planform, which is assumed to optimize the span-wise lift distribution. 
Furthermore, the aspect ratio is lower than the other two UAVs and we interpret that as a way 
to keep the mean aerodynamic chord (MAC) at a relative high value and thus avoid local low- 


Reynolds numbers on the majority of the wing. 


Each of the three UAV designs has been selected in its powered version, where an 


electric motor and a folding propeller allow the vehicle to sustain flight under its own power. 
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IlI.1.2 Quantification of Vehicles 


The baseline design characteristics of the four vehicles, in terms of size and weights, 
were given by respective authors and are summed up in Table III-1. It is to be mentioned that 
for the sake of comparison, the baseline mass was taken for vehicles in a ready-to-fly 
configuration, including battery, propulsion hardware, sensors, autonomous capabilities, but 
excluding any extra payload. The mass would then be eventually increased from their baseline 


values for later simulations, hence virtually adding a payload. 


The aerodynamic behaviour of small-scaled slender bodied vehicle, like the ones we 
are interested in, is quite well modelled by tools such as Xfir5 [71] or AVL [72]. Those software 
extrapolate a 2D pressure distribution around airfoils to a full 3D behaviour, by means of 
different theories, such as lifting line, 3D panels or vortex lattice methods. Initially designed for 
Radio-Controlled (RC) gliders applications, they are particularly adapted to the geometries of 
chosen vehicles. Mariner aerodynamic characteristics were obtained by Bower through AVL 


[67], while those of Cloud Swift were obtained by building a strip model from XFOIL [70]. 


Another approach is to apply a technique called Prandtl's lifting line theory in order to 
get first-order approximation for the drag coefficient [73]. It gives fairly good results for a finite 
wing with no sweep and a reasonably large aspect ratio. By modelling the wing as a fixed 
vortex with a series of trailing vortices extending behind it, contributions from each vortex are 
summed into a resulting force called the induced drag, represented by its coefficient Cp;. The 
zero-lift drag, represented by Cpo, is added so that the overall drag is represented by a quadratic 
drag polar as shown in Eq. III.1. The Oswald coefficient (e) is a correction factor that accounts 
for disparity between the elliptical lift distribution assumed in the theory and the realistic lift 


distribution. 


C,’ 
Cp = Coo “Fr Cni = Coo v BAR ( III.1 ) 


For wings of relatively high aspect ratio and no sweep, the theory is rather consistent 


and an Oswald coefficient of 1 can be assumed. It is what Sachs has done to estimate the drag 


coefficient of the Wandering Albatross. 


The DT-18 is the smallest and lightest vehicle of the three UAVs considered. Besides, 
as it was mentioned before, the aspect ratio is also the lowest and a slight sweep is integrated in 


the wing design, which is apparent from Fig. III.1. Since conventional modelling techniques and 
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tools might reach some limits for the design of the DT-18, a wind tunnel testing campaign was 
conducted in order to get accurate data. Among others, the objective of the experiment is to 
provide an aerodynamic polar over the whole range of angles of attack and assess its sensitivity 
to the Reynolds number. Different vehicle configurations are tried out by combining variations 


in some of the vehicle degrees of freedoms, in terms of engine rate, control surfaces and flaps. 


Among the two wind tunnels ISAE has at its disposal, the wind tunnel S4, (located on 
the ENSICA Campus) was selected thanks to the wide 3 mx 2m elliptical test section which 
enables to set up a full-scale instrumented replica of the DT-18. Besides, the range of speeds 


available made it possible to get data for realistic airspeeds seen in flight by the UAV. 


The S4 wind tunnel is an Eiffel-type wind tunnel with an open elliptical test section 
and a closed, non-streamlined, recirculation zone. Installation can be seen on Fig. III.2. The 


airspeed ranges from 0 to 42 m.s’ and the associated turbulence rate is 0.52 %. 


123 mm gap 








Protection 


Engine-fan 
eroup 


Collector - Experimental section 


grid 
Convergent filter hea: 


-_ % . Suction chamber 
Anti-turbulent filter 


Figure III.2: Top view of the $4 wind tunnel. All distances are in m, unless otherwise stated. The DT-18, 
colored in orange, is schematically included in the test section. Adapted from [74]. 


A factory-built DT-18 was modified for the sake of the experiment. The external shape 
was not altered and main changes were conducted in order to link the replica to the rest of the 


experimental set-up. A ventral-docking configuration was selected, where the mock-up is hold 
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on top of a single mast, see Fig. III.3. Aerodynamic forces are determined via a measurement 
unit, sheltered inside the fuselage, which structurally links the mast with the aerodynamic 
mock-up. On the mock-up-side of the measurement unit, a specific holding plate was designed 
to link the standard cylinder-shaped-measurement-unit to the geometry of the DT-18 while also 
maintaining the mock-up in the correct attitude. A cut was made on the ventral panel of the 


DT- 18, just enough wide so that the wires and the mast could go through. 


The mast is connected to a cradle on the ground of the test section. The cradle can 
move along two curved rails such that mock-up rotates along a fixed pitch axis (the top of the 
mast is the centre of the rotation). This change in longitudinal attitude with respect to the wind 
tunnel section is used to modify the simulated angle of attack (AoA). Another rotation is 
possible along the axis of the mast itself. This variation in yaw permits to simulate a side slip 


angle. 


A majority of the on board electronics were kept inside the fuselage and powered from 
outside. It enabled to control the aerodynamic surfaces and the engine output through the same 


user interface, developed by Delair-Tech, as the off-the-shelf DT- 18. 
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Figure III.3: Rendering of the mechanical set-up of the mock-up on the mast. The holding plate (red), 
supports the mock-up on its upper surface and clasp the measurement-unit (horizontal, green) in one of 
its forward extremity. The measurement-unit is encircled at its aft extremity by the ventral mast 
(vertical, green). Copyrights Delair-Tech. 





Figure III.4: Mechanical set-up, which evidences the holding plate (top, horizontal), the ventral mast 
(bottom, vertical) and the measurement unit (middle, horizontal) in between. Here, a spirit level is laid on 
the holding plate, in order to verify its horizontality and ensure that the mock-up will be in a wings- 
level-configuration, before locking the holding plate into position with respect to the measurement unit. 
The yellow cable transmits measurement data from the unit to the acquisition centre. It runs downwards 
along the mast out of the lower side of the fuselage. 
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Figure III.5: The DT-18 experimental mock-up into place, through the single ventral mast, inside the 
elliptical test section of the $4 wind tunnel. The open test section is clearly visible, as the workshop is 
visible (left) outside of the section. The suction fans are visible (top right) at the end of the suction 
chamber, leeward of the diffuser. The curved rails of the cradle can also be seen (bottom), as well as the 
external feed (bottom left) for powering on board electronics. 


Forces and moments were measured locally on different places of the measurement 
unit and then could be calculated on different virtual points. Those were calculated at the 
virtual position of the centre of gravity of the DT-18 (which is different than that of the mock- 
up). Practically, what is needed for DS simulations is the variation of the drag coefficient with 
the lift coefficient. For the sake of simplification, it can be reduced to a polynomial form of the 


following. 
C5 = Cot CpG, HC ps, FOys6 Opi, (1112) 


The respective drag coefficients can show some sensitivity to the airspeed, all the 
more if that range of considered airspeed sees strong Reynolds-number effects. The Cp = f (C,) 


behavior defined by Eq. III.2 is hence dependent on the airspeed to some extent. 
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Figure III.6: Evolution of the coefficient of lift with the angle of attack (alphac) for the DT-18, for different 
wind tunnel speeds and their associated Reynolds number for the vehicle. 
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Figure III.7: Evolution of the coefficient of drag with the angle of attack (alphac) for the DT-18, for 
different wind tunnel speeds and their associated Reynolds number for the vehicle. 
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Figure III.8: Evolution of the coefficient of lift versus the coefficient of drag for the DT-18, for different 
wind tunnel speeds and their associated Reynolds number for the vehicle. 


For wind tunnel speeds above 15 m.s’, the angle of attack was limited in order not to 
damage the mock-up or the balance. A way to generally represent glider performance is to 
represent the speed polar which shows the evolution, when the vehicle is in steady equilibrium 
gliding state, of the vertical sink speed versus the horizontal speed. This curve, representative 
of steady states of the vehicle at different airspeeds, can account for variations of the Cp — C;, 
behavior with the airspeed. However, DS flights involve dynamic manoeuvres where the 
vehicle is rarely in an equilibrium position. Still, a relationship between lift and drag is needed 
for the vehicle and for the sake of simplification, Reynolds-number effects are neglected and 
the relationship is supposed to be independent of the airspeed. Figure III.8 displays how curves 
are mainly sensible to the airspeed for Reynolds numbers below 1.5. 10°. Gaps between the 
curves are narrowing for higher Reynolds numbers. Given those considerations and as the 
expected airspeed of DS flight are rather high, it was estimated that results obtained at 18 m.s” 


would be quite representative of aerodynamic performances of the DT-18 for DS simulations. 


Data of the four vehicles are gathered in Table IIJ-1 and would serve to characterize 


their respective model in the remaining of this thesis. 
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Table III-1: Comparison data between the four vehicles considered. 





Wandering 





Hueanscier Cloud Swift Albaivos Mariner DT-18 
m (kg) 6.8 8.5 2.0 7 

b (m) 4.32 3.3 2.5 1.8 
S(m?) 0.957 0.65 0.485 0.248 
AR 19.5 16.8 12.9 13.1 
WL (N/m?) 69.7 128.2 40.44 67.2 
Coo 0.017 0.033 0.0173 0.0259 
Csi 0 0 -0.0022 -0.0002 
Cp2 0.0192 0.019 0.0629 0.0735 
Cp3 0 0 -0.0578 -0.0858 
oo 0 0 0.0314 0.0607 
Coe 1.0 15 1.17 1.195 
(L/D) max 27.7 20 20.5 15 
VZmin (m/s) 0.39 0.6 0.42 0.76 
PoweTmin (W) 25.7 49.7 8.2 ey, 
ConsOminJ /km) 2410 4174 956 1113 


Mariner stands out in terms of wing loading, which is the lowest of all four. Besides, 
Mariner manages to reach a maximum lift to drag ratio ((L/D)magx) comparable to the 
Wandering Albatross, which is has a higher aspect ratio and a longer wingspan. This is partially 
achieved through a lower zero-lift drag coefficient Cpg, which could only be estimated for the 
Wandering Albatross and is therefore subject to greater uncertainties. However, Pennycuick’s 
estimate of the (L/D) mq, for the Wandering Albatross does not come far at 21.2 [25]. Although 
the zero-lift drag coefficient seems rather low for Mariner, a comparable value is obtained for 
Cloud Swift which strengthens the assertion that it is an achievable value for this class of 
vehicles. Besides, another important piece of information regarding glider’s performances is the 
minimum sink speed (VZj,). Both Cloud Swift and Mariner achieve lower minimum sink 


speeds than the Wandering Albatross. Cloud Swift has the lowest minimum sink speed of all, 
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which underlines its background conception for thermal soaring [19]. Indeed, this low value of 
minimum sink speed allow it to maintain level flight by exploiting thermals rising at a 
correspondingly low minimum rate of climb, enabling it to take the most out of a wide range of 
potential thermals. The smaller DT-18 does not reach such high value in (L/D) mq, , nor low 
value in VZ,,;,, which can partly explained by the lower Reynolds number, which hamper the 
zero-lift drag coefficient. The DT-18 is still rather interesting for our study case, precisely for its 
reduced dimensions, especially span-wise, which would allow it to get closer to the surface 
during DS manoeuvres. To put in a nutshell, the comparison between vehicles highlights that 
the Wandering Albatross has aerodynamic gliding performances that can be reached and even 
exceeded. Mariner is a sound illustration of this conclusion and combines a streamlined design 
with a lightweight architecture to outperform the Wandering Albatross. Therefore, the gliding 
performances of UAVs are not a deadlock regarding the feasibility to exploit DS flight. 


Vx 
6 8 10 12 14 16 1a 20 2? 24 76 


— Cloud Swift 

—— ‘Wandering Albatross 
— Mariner 

—DT-18 





Figure III.9: Comparison of speed polar for each of the four vehicles. 


Additional metrics can be compared when it comes to flight performance. The first is 
directly related to the sink rate and it is the minimum power required to sustain the vehicle 


airborne (POWe7y;7,). It relates to the minimum sink rate as follow. 
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PoWelmin =MGZ=Mg |VZmin| (IIL.3 ) 


It corresponds to the minimum power expenditure in flight, and intrinsically relates to 
the endurance of the vehicle. Indeed, if a vehicle needs to remain in the air for as long as 
possible, it should fly at the airspeed which allow for the minimum sink rate. If the vehicle is 
propelled, then by sustaining that airspeed and with a power output of its propeller equal to 
PoWeT nmin it will maximize its endurance. For a given charge Baty of its battery, and a 
propulsion efficiency given by Nyrop , the vehicle can theoretically sustain level flight for a 


nominal period of time called Endurance,,,, and given by the following equation. 


= Nprop B aly 


Endurancemax = Power® (111.4 ) 
min 


Where prop is given by the following formula, T being the thrust and Bat the battery charge. 





(IIL.5 ) 


T.Va 
prop = t 


Ba 


The energy expenditure of a gliding vehicle in steady flight over a horizontal-travelled 
distance Dist can be related to the associated loss of height h and to the lift to drag ratio as 


follow. 


Dist 
Eexpenditure =mg*h=mg* ( IIL.6 ) 


L/D 
This expenditure corresponds to the energy needed by the vehicle to sustain level 
flight over the distance Dist. Therefore, the lowest nominal energy consumption per unit 


length travelled, called Conso,;,, is given by the following formula. 


mg 


CONSOmin = (L/D). 7D) yan (111.7 ) 


The lowest energy consumption per unit length travelled is obtained by flying at the 
functioning point corresponding to the maximum lift-over-drag ratio. It should be noted that 


CONSOmjn is homogeneous to a force, expressed in Newton, and that it corresponds, to a slight 
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approximation, to the thrust required from the vehicle to sustain that functioning point. For a 
given charge Baty of its battery, the maximal theoretical range of the vehicle called Rangema, 


is given by the following equation. 


- Nprop B aly 


Rangemax = Benepe ( IIL.8 ) 
min 


It follows from those calculations that the maximal endurance and maximal range of a 
vehicle are respectively related to the minimum sink rate and the maximal lift-over-drag ratio 
of the UAV. Those performance parameters indeed permit to establish respectively the 


minimum in-flight power required, Power,,;,, and the minimum ConsO,pjn. 


Those latter metrics are summed up for each vehicle in Table III-1. It can be seen that 
Mariner reaches both the lowest power required and the lowest consumption. Therefore, if the 
four vehicles were equipped with the same battery and assuming they all share the same 
propulsion efficiency 779), Mariner would fly longer and farther than any of the other three 


vehicles. 


II.1.3 Point Mass Model 


For the sake of simplification, a point-mass model was chosen to represent vehicles. 
This type of representation allows simplifying the motion of the vehicle to three degrees of 


freedom, which are the three translations. 


The main advantage is that the vehicle input to the flight model will be kept to the 
minimum, which is a strong plus for the methodology that lies behind the formulation of DS 
trajectories. Furthermore, the scale of the wind gradients is such that span-wise variations in 
wind can be neglected and that a point mass model is an adequate representation of vehicle 
dynamics. The main limitation is that the rotational behaviour of the vehicle around its centre 
of gravity is not modelled. However, it does not mean that rotations cannot be indirectly 
simulated or controlled to some extent. The lack of equations representative to the rotational 
behaviour of the aircraft would have to be partially covered by further assumptions. For 
instance, by assuming that the vehicle is in symmetric flight with respect to the surrounding 
air, there is hence no sideslip angle, and the aircraft can be simulated to be directly aligned by 
the airspeed. For a hypothetic six degree of freedom model, symmetric flight would indeed be 


sought, but this orientation would be physically modelled, rather than simply assumed. 
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Practically, all forces that apply to the vehicle are concentrated at the centre of gravity 
G. Those are respectively the lift (L) the drag (D) and the weight (W) and are expressed in Eqs. 
III.9 to II.11. 


1 2 

L= 3-5. C,. Va (IIL.9 ) 
1 Z 

D= 3-5. Cp. Va ( III.10 ) 
W =mg (1.11 ) 


The orientation of the weight comes trivially, but the direction of aerodynamic forces 
is directed by the airspeed. The orientation of the lift requires the introduction of a parameter 
that is the bank angle. Once again, practically, this angle is a state of the aircraft which is 
piloted by deflections of control surfaces and which variations depend on the vehicle's inertia 
along its roll axis. With a point mass model, this angle will be directly piloted. Further 


precisions are detailed in the part dedicated to equations of motion. 


II.2 Environment Model 


Dynamic Soaring research is faced with a significant limitation on the environment 
model it uses. Although the ocean surface holds the significant advantage to be rather flat and 
uniform compared to earth variations, the wind profile encountered is not that of a flat plate. 
So far, the environment was only taken into account through steady boundary layer vertical 
wind profiles, either following power-law behaviour [50, 56]or logarithmic evolution [47, 67] or 
even selecting a linear wind profile [49]. However, those profiles are constant in space and time 
and are not correlated to any change in environment condition, apart from changing the 
reference wind itself. The purpose of this part is to refine the environment model used in 
simulations so as to correlate the wind field with environment conditions and to describe how 


DS could be affected by the presence of a moving wavy surface. 


IlI.2.1 Wind Profiles over rather Flat Surfaces 


In the present case of this study, we consider neutral conditions at sea, that is to say 


that convective exchanges or heat transmission are absent. Those conditions are met whenever 
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the surface is cooler than the air above. The subsequent boundary layer is therefore solely 
driven by wind shear turbulence and classical theory of the turbulent boundary layer can be 
used. A distinction between two cases is made, either the surface is considered smooth and 
therefore the mean velocity profile is that of a flat plate, or the roughness of the surface induces 
changes in the velocity profile. Some concepts related to that theory are introduced hereafter. 
The friction velocity u, is defined by Eq. III.12. and is a representation of the stress T, exerted 


by the wind on the ocean surface. The air density is represented by p. 


Tp 
uy, = | 11.12 
0 ( ) 


The friction length z, of the boundary layer is the length scale of the internal sub-layer 


and is related to the viscosity v by Eq. III.13. 


a. (11.13 ) 


The general form of the dependence of the mean velocity profile over a flat rigid wall 


is called the universal law of the wall and is expressed in Eq. III.14. 


u(z) =u, .f (=) (111.14 ) 


The former relation is only valid for surfaces considered as dynamically smooth, that 
is to say when eventual protrusions on the surface are small enough not to entail a change in 
the mean velocity profile. Monin and Yaglom [75] propose a quantitative explanation of the 
requirement of the smoothness: the wall will be considered as dynamically smooth if the mean 


height hp of the protrusions satisfies the condition given by Eq. III.15. 


< = — 
No S 2s uu. (111.15 ) 


In the ocean atmosphere, for friction velocity of the order of 10 cm/s, the friction 
length does not exceed some tenth of a millimetre and quite trivially the ocean surface cannot 
be considered as dynamically smooth. Therefore, the main velocity profile departs from the 


case of a flat surface and will depend on irregularities of the surface. Those are considered in 
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the expression of the main wind profile, via the roughness length Z) defined in Eq. II.16, which 
is characteristic of the dynamic interaction between flow and surface [75]. There is a one to one 
correspondence between the state of the surface, with a set of protrusions ho irregularly 


spaced and the roughness length Zo. 





er (===) 
0= p CZ) 


(III.16 ) 


Cy is the friction coefficient at the altitude z, and x is the von Karman constant. The 


fact that the shear stress is almost the same within the boundary layer entails that Zp is a 
constant [75]. The mean velocity profile over a rough surface is subsequently given in Eq. III.17 


[75]. 


u(2) = — log (=) 
u(z) = —log |— 
Y gJ Zo ( 11.17 ) 


For the resulting log profile, the local increase in horizontal wind with altitude, or local wind 


gradient is expressed in Eq. III.18. 





a (111.18 ) 


The vertical gradient in horizontal wind is highest when closest to the ground. 
Besides, the local wind gradient is directly proportional to the wind friction velocity and the 
resulting effect is graphically displayed in Fig. III.10 (left). An important remark to be deduced 
out of Eq. II.18 is that the local wind gradient is independent from the roughness length, which 
is displayed in Fig. III.10 (right). Indeed the curvature remains identical, for a constant altitude, 
from one profile to the other. Therefore, surface roughness conditions do not alter the shape of 
the mean wind profile but shift the local mean wind speed by an identical value for the whole 


profile. 
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Figure III.10: Influence of variables from Eq. III.17 on the main wind profile. Left, variations in friction 
velocity for values representative of the atmospheric boundary layer. Right, variations in surface 
roughness length, representative, from the lowest to the highest, of open sea, open tarmac, open grass, 


low crops, and high crops respectively. 


Earlier DS studies, such as Wood [46], Sachs [47], Barate [55] and Bower [68], use and 
mention values of Z) around 3 cm, which is supposed to represent a typical value over very 
rough seas. They implicitly assume that ocean surface irregularities are of the order of the 
meter, which seems a priori fair for rugged conditions at sea. However, several limitations are 
going against the use of this estimation of Zp) for defining the mean wind velocity profile in the 


case of DS over oceans. 


The first limitation is that specific literature on the topic refers to significant lower 
values. Stull [76] mentions values for Zp of the order of the millimetre for off-sea wind in coastal 
areas, while all existing data indicate that the surface of the sea is considerably smoother than 
the majority of land surfaces, with Z) < 1 mm even for a fairly strong wind. The same author 
mentions roughness lengths of the order of the cm for open flat terrain, typically covered with 


low grass and with few obstacles. 
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Moreover, the velocity profile defined in Eq. II.17 accounts for irregularities from the 
surface only in a statistical way. Although that velocity profile is mathematically defined down 
to Zp, in fact the logarithmic equation loses its physical consistence at much larger values of 
altitude. Indeed, for altitudes comparable to the height hp of protrusions on the surface, the 
velocity profile would vary depending on the shape and spacing of the irregularities and on the 
relative position towards those. This is especially critical as it was established before that DS 
trajectories are composed in particular of a low turn close to the surface (Sachs [47] maintains a 
1.5 metres ground clearance for the centre of gravity of the bird). So Eq. III.17, used with a Z9 of 
3 cm, may reach some limitations in terms of accurate modelling of the wind field in the first 


metres above the surface. 


A refined model must be approached in order to provide the roughness length Zo of 
the ocean surface. Wind-waves interaction is a vast field of research, far from being completely 
explored. Nevertheless, Charnock’s roughness length model, developed by the eponymous 
author, is a generally recognized way of relating roughness length to the wave spectrum [77]. 


The Charnock’s relation follows in Eq. III.19. 





(111.19 ) 


Stronger shear stress induces higher irregularities, which results in a greater 
roughness length. The parameter q@ is called the Charnock’s parameter and is estimated to be 
within the [0.01; 0.035] range. This parameter was shown to depend on the wave fetch, that is 
to say the length of water over which a given wind has blown. It is directly related to the wave 
age, but it can be assumed constant for some cases. Indeed, the Charnock’s parameter a, = 1/80, 
which is then called the Charnock’s constant, was demonstrated through wind field 
measurements [78] to be accurate for long fetch cases, corresponding to old waves. The use of 
Charnock’s constant might reach some limits in coastal areas, where the surface roughness of 
young waves is higher than predicted. For a friction velocity of 50 cm/s, the corresponding 
surface roughness length obtained using Charnock’s model is 0.28 mm, which is much more 


consistent with the order of magnitude given by Stull [76]. 
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III.2.2 Ocean Waves 


The way the mean wind velocity is subsequently defined with Eq. III.17, for a 
roughness length that follows Charnock’s model with a = a, , gives a model of shearing flow 


over a rigid ocean surface, where irregularities are taken into account statistically. 


However, the ocean surface is obviously constantly moving, and so to the effect of 
turbulent shear flow over rigid surfaces must be added the effect of surface displacement. 
Besides, only small scale structures of the surface, of the order of the cm are taken into account 
using Charnock’s model. It must be superposed an effect that accounts for larger structures, if 


the wind field in the vicinity of significant waves is to be modelled. 


Ocean waves are stochastic; the surface appears to be composed of random waves of 
various lengths and periods with no simple repeating pattern. Yet, one way to model this 
surface is the concept of wave spectrum which distributes wave energy among different wave 
frequencies. This decomposition of the ocean surface into an infinite sum of independent 
propagating sinusoidal waves relies on the fair assumption that non-linear interactions 
between waves are weak. The wave spectrum is established by measurement of the height of 
the sea surface from a fixed location, independently of the direction of waves. Pierson and 
Moskowitz did such measurements at a deep-sea location in the North Atlantic and made the 
assumption of a fully-developed sea to propose a simple, yet commonly used Pierson- 


Moskowitz spectrum [79], where Spy is the wave spectral density. 


8.1 g? —5 (Wy)4 
Spu(w) = 103 ws °P (= t} (111.20 ) 


The notation Wy, refers to the peak angular frequency of the spectrum and is expressed in terms 


of wind speed measured at 19.5 metres, W195, which was the height of the speed taps used in 


experiments. 


g 
Wy = 20 fp = 0.8772 W ( 111.21 ) 


19:5 





The following condition is indeed verified. 
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dSpy 
dw 





(oO (111.22 ) 


Assumption is going to be made that the wavy surface used in the reminder of this 
part consists in a single sinusoidal wave at the peak angular frequency, as presented in Fig. 
III.11, obtained after the wind blew steadily for a long time over long distances. Wind and 
waves have reached equilibrium: the sea is fully-developed. For notational convenience, it will 
be assumed that the wind comes from the North, the x-axis is pointing northbound against the 
wind and z is oriented downwards. The local wind considered is at the origin of the formation 
of the wave system so waves are propagating with a phase velocity c in the same direction as 


the wind. Dispersion relations follow in Eqs. III.23 to III.25. 


_ Jg 
= Wp (11.23 ) 
2 
W@W 
k= of (111.24 ) 
 - 21 
= (11.25 ) 


The equation of the surface Zyqye is given by Eq. III.26. 
Dining = =a cos(kx + Wyt) (111.26 ) 


The amplitude a of the wave is also related to the wave spectrum by assuming the 
peak frequency concentrates all the energy from the wave spectrum. The standard deviation in 


surface displacement obtained from the spectrum is expressed in Eq. III.27. 


2.74 Wigs” 


(y= | Spm (w) dw = a 





(111.27 ) 


For the sinusoidal peak wave expressed in Eq. III.26, this translates into Eq. III.28. 


az 


(Surfwave) = = (111.28 ) 
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Figure III.11: Aspect of a sinusoidal wave induced by a wind blowing from the North. The wind direction 
is pictured by the green arrow 


So the amplitude considered is given in Eq. II.29. 


(oe) 1/, 
a= {2 [ Spy (w) du| (111.29 ) 
0 


The wave considered is therefore highly dependent on wind conditions. It should be 
noted by combining Eq. III.21 and Eq. III.23 into Eq. II.30 that the wave travels faster than the 


wind, as noted by Pierson and Moskowitz. 
c=1.14W95 (111.30 ) 


Therefore, the overall surface Zyqye of the wave is described by Eq. III.26 and it is 
associated with a surface roughness entailed by small irregularities from the main surface. 
Those are only taken into account statistically following Charnock’s model, as described in Eq. 


H1.19. 
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III.2.3 Wind field over Waves 


The undisturbed flow over a rather flat surface has been investigated, together with 
properties of a peak-frequency wave which concentrates all the energy of the wave spectrum. 
The remaining of this part focuses on the way the latter wave alters the wind field around it. It 
is to be noted that several tens of metres above the surface, the mean velocity profile would be 


roughly unaffected so that U; 9.5 is quasi constant, for a given friction velocity. 


Several authors investigated turbulent flow over such a wavy surface, using numerical 
simulations [80]or a theoretical approach [81]. It was decided to use here expressions derived 
out of Benjamin’s development [81], which provides approximations of analytical solutions 
applying stable laminar flow theory. Validity of the solution only requires that the wave 
amplitude is small relative to the wavelength in order to remain out of flow separation cases. 


This condition was chosen following results from [80, 81] and is given in Eq. III.31. 


< 0.02 ie ak < 0.126 ( III.31 ) 


~»|& 


In a region where there is no large adverse pressure gradient, the flow tends to follow 
the contour of the wave in such a way that main features of the undisturbed boundary layer are 
preserved. The flow is assumed to be two dimensional in the {x, z} plane defined earlier, with 
the wind coming from x. An inertial reference frame Ea Vp» ea in uniform rectilinear motion 
at phase velocity c is defined such that it translates with waves but keeps the same orientation 
as {x,y,z}. A surface-fitted curvilinear system of coordinates {€,7}is defined, on the upper 


surface of the wave, as in Eq. III.32, from the set of Cartesian coordinates ie, 25). 


€ = Xp — iae ~K(-Zp—ixp) 
( III.32 ) 


N=-Zp)-—a eo ~k(-Zp-ixp) 


Only the real part of Eq. III.32 is to be considered, which gives Eq. III.33. 


(=x, a sin(kxp)e*”P 


N = —Zpy — acos(kxp)e*P (11.33 ) 


It should be observed that the equation of Zy ye in the inertial frame of reference 


[ps Voi Zot is given by 7 = 0, to the first order in ak. Periodical variations of § andy with x, 
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and Z, are damped by the exponential term inkz,. Hence, far away from the surface, the 
curvilinear coordinates {,7} fits with the Cartesian system 1X5) Vp Zi. The undisturbed mean 
velocity profile is transformed into curvilinear coordinates to follow the contour of the wave so 


that it varies with 7 only. 
ae Uu, n 
u=iu(n) = ——log (+) ( 111.34 ) 


In the present case, the wind is blowing against the x-axis such that the wave moves in the 


same direction. It means that the celerity c is expressed by Eq. III.35. 


u, 19.5 
c=1.14W,,5 = —1.14 —log (—=) ( IIL35 ) 
x ZO 


In the absence of waves, the stream function Wo, characteristic of the flow observed from 


Car Vp) Zz is expressed in Eq. III.36. 


n 
Wo = | {u — c}dn (11.36 ) 
0 


For the disturbed flow, a periodic perturbation is introduced to Wo, under the form specified by 


Eq. II.37. 


p= Wo ta{F(m) + fu —che*"} etks (11.37 ) 


F is a perturbation velocity that must be determined. Components of the velocity parallel 


to €and 7 are expressed respectively in Eq. III.38. 


rey id 
U(n,€) =J /2 an sum 
__ 1/ aw III.38 
Vy, €) = -J /2 FTG 


Where J is the Jacobian of the transformation given in Eq. III.39. 
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az)’ (ae) 
jJ= ea +f ( 111.39 ) 


The set of velocity components (U,V) abides by non-slipping boundary conditions at the 


surface of the wave. 


ree é)=-c 
V(0, E) — 0 ( III.40 ) 


Benjamin [81]decomposes F in an inviscid solution @ completed with a rapid varying 
solution f which is a valid approximation of F close to the surface, where viscosity prevails. The 
region where f is significant is only several times the friction length, to the order of the 
millimetre. It was chosen to ignore that contribution that would have no incidence on flight 


dynamics since the vehicle flies well above. The following approximation for F was chosen. 
F=p~=-{u-c}e™ (11.41 ) 


Speeds in curvilinear coordinates can then be expressed in Eq. III.42. 


(111.42 ) 


[HC ge) = sh fi-¢} 
Vin, €) = 0 


The €—lines are thus streamlines according to this approximation, as visualized in Fig. III.12, 


and the wave influence on the streamlines decays with altitude. 


From Albatross to Long Range UAV Flight by Dynamic Soaring 64 





se Ses ego” if 
7Q = -300 


Figure III.12: Streamlines, pictured in a 2D plane, viewed from a reference frame translating at wave 
celerity. The wind direction is pictured by the green arrow. 


Benjamin concludes that a fair approximation to the flow pattern is obtained simply 
by bending the primary profile. The transformation into Cartesian coordinates{x,, Vp) a) gives 


the respective components of the velocity parallel to y, and Z,. 








— Um $§) Ge {i —c} oe 
*p ple OXy OXy 

(111.43 ) 
— Um) oe fi —c} as 
2p ple OZ, OZ, 

Finally, in the Earth’s reference frame, the wind field is described by Eqs. III.44 to III.48. 
W, = ween a Wore 
‘i = pent 4 yrorbital ( III.44 ) 


wert = a {1+ ake cos(kx + wyt)} (11.45 ) 
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w,pert = a {ak e* sin(kx + wyt)} (111.46 ) 
werbital — —¢ fak e cos(kx + Wyt)} (111.47 ) 
weorbital — _¢ {ak ek sin(kx + wyt)} (111.48 ) 


The superposition of two contributions can be observed. The first comes from the 
main velocity profile being “bent” to follow the curve of the wave. Equation III.45 is applied 
successively at the wave crest and at the wave trough, in Eqs. III.49 and IIL.50, so as to highlight 


the speed up at wave crest and the slow down at wave trough, induced by the bent profile. 


W,Pent (wave crest) =u {1+ ake*7} (11.49 ) 


W,Pent (wave trough) = u {1 — ake*7} (IIL50 ) 
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Figure III.13: Velocity profiles of the wind component due to the wave curvature. The vertical amplitude 
scale of the wave is voluntarily exaggerated. 


Moreover, this bent wind field also induces an upward deflection of the wind on the 


windward side of the wave and a downward deflection on the leeward side, in a wind-over-hill 
fashion, as can be seen in Fig. III.13. 


The second contribution is entailed by the orbital motion of the wave surface, with 


perturbations opposite to those due to the “bent” profile, see Fig. III.14. Indeed, the moving 


wave creates an upward motion of the surface on the forward moving side which is the leeward 


face of the wave, whereas the “bent” profile induces a downdraft component. 


An opposite behaviour between “orbital” and “bent” contributions can also be 
observed on the other face of the wave, windward. It is to be noted, from Eq. III.46 and Eq. 
III.48, that wherever the celerity c of the wave exceeds the local mean wind speed w, the orbital 
vertical contribution is predominant. This condition is verified up to large heights with the type 


of wave considered in the present case. Therefore, the overall wind field sees an updraft on the 
leeward face of the wave, as can be seen in Fig. III.15. 
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Figure III.14: Velocity profiles of the wind component entailed by the orbital motion of the wave. The 


velocity scale is five time that of Fig III.13. 
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Figure III.15: Velocity profiles of the wind established over a moving wavy surface. 


lambda/2 


The methodology followed above enables to set up a model of the wind field within an 
environment that includes moving waves. There is hence an implicit dependence on time, even 
though all phenomena are steady. The vertical wind behaviour, in vicinity of the wave, is 


dictated by the orbital motion of the wave and not by its bending effect on the wind profile. 
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I1.3Summary of Chapter III 





One top of the Wandering Albatross model, three vehicles architectures are 
considered, all based upon classic glider planforms, but equipped with propulsive systems. It 
enables to benefit from known designs, rather easy to quantify and to get rid of uncertainties 
linked to the dimorphism measured in albatrosses. Geometries chosen for those vehicles reflect 
variations in scale and would therefore question the feasibility of DS for various UAV sizes. 
Their aerodynamic quantification is done through simple models that are well-verified for this 
kind of high-aspect-ratio wings. Yet, the smaller UAV is tested during a wind tunnel campaign 
in order to address the issue of smaller Reynolds number. A point-mass model was chosen to 
represent the vehicle dynamics, for its simplicity. It is a sound choice given the scale of 
gradients that are involved in DS, however it bears limitations since any further degree of 


freedom would have to be synthetically constrained in its evolution rather than modelled. 


The description of the environment is a two-fold process. First the theory of turbulent 
boundary layer is applied to a rough flat surface where irregularities are taken into account in 
a statistical way through the surface roughness length. The influence of the wind friction 
velocity and of the surface roughness length on the shape of the wind profile is assessed. This 
later value, which is representative of the nature of protrusions on the surface, must be 
adapted to match conditions encountered. In particular, the surface state, in the case of oceans, 
depends on the interaction between wind and waves and its evolution over time. A simple 
Charnock model is chosen to link the surface roughness length with the wind friction velocity, 
which is a good approximation for a long fetch. Secondly, the influence of surface variations of 
bigger scale is assessed. A regular sinusoidal wave is considered out of the Pierson-Moskowitz 
spectrum, at the peak frequency, and it is assumed to concentrate all the energy from the 
spectrum. The subsequent amplitude and wave length obtained make it correspond to a swell. 
The simplicity and regularity of the wave pattern is suitable for an analytical approach of the 
wind field. Stable laminar theory is applied to this shallow wave train, which highlights a 


predominant influence of the orbital motion of the wave on the vertical wind profile. 


Models of the vehicle and of the environment are therefore set and ready to be 
considered as inputs to the flight model, which focuses on the motion of the vehicle within its 


environment. 
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CHAPTER [VY 


MOTION & OPTIMIZATION 


Once respective models of the wind and the environment have been investigated, the 
trajectory that makes those two interact must be found. There lays a methodology which is 
specific to the general topic of atmospheric energy-extraction topics, and which consists in 
providing a trajectory that is considered an optimum. The success of the approach undertaken, 
which would lead into the formulation of a three dimensional trajectory, would not mean that 
DS flight is feasible for a UAV, but rather describe the trajectory it would fly, if it were to 


perform DS. 


IV.1 Equations of Motion 


IV.1.1 Point of View 


Any motion is relative to the point of view of its observer. And so are the physical 
models built to explain it. If well expressed, they would eventually lead to a consistent picture, 
where the two versions of the analysis are simply two ways of approaching the problem. As it 
has been underlined already, two relative motions, and therefore point of views, are involved 
when it comes to analysing the flight of a vehicle in a windy environment. One is relative to 
earth, while the other is relative to the surrounding air. The following lines have more about 
how the choice of one of those two points of view affects the understanding of the physics 


involved. 


Let’s consider a vehicle and the mass of air altered by the vehicle path from instant t 
to instant t + dt, such that the system {vehicle + mass of air} is isolated. Hence, it does not 
exchange any energy with the outside; all energy transfers are internal to the system. We aim 
here at getting a qualitative understanding of the internal transfers of total energy between the 


two systems {vehicle} and {mass of air}, depending on the point of view chosen for observation. 


From an Earth based point of view, it has been established that the vehicle gains 


energy during an upwind climb. As the system {vehicle + mass of air} is isolated, when the 
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energy of the {vehicle} increases, the energy of the {mass of air} decreases by the same value. 
This case is represented in Fig. IV.1. The {vehicle} extracts energy out of the {mass of air} when 
the vehicle flies through the mass of air. The concept could be compared to wind turbines, 
where downstream air is less energized than the upwind air as the wind turbine gets energy 


out of it. 





Wind field (t) o 
T Vehicle total a — total Ene rey = 
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Figure IV.1: Energy transfer from an Earth based point of view. 


When the same manoeuvre is observed from an air relative point of view, in Fig. IV.2, 
the {mass of air} has initially no relative velocity, hence no kinetic energy. When the vehicle 
flies through it, the circulation around its wing entails some perturbations within the mass of 
air, creating vortexes and unsteadiness. The {mass of air! receives some energy during the fly- 


through which is extracted from the {vehicle} energy level. 
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Figure IV.2: Energy transfer from a wind based point of view. 


The very simple understanding of energy transfers differs depending on the point of 
view chosen. The notion of energy is therefore relative to the reference frame chosen to 


describe kinematics. 


Three dimensional inertial state variables as well as their air relative counterparts are 
introduced in Fig. IV. 3 and Fig. IV. 4 respectively. The flight path angle (y) and heading angle 
(w) are introduced, as well as the bank angle (). Lift (L) and drag (D) keep the same notations 
as before, as well as the speed (V). An inertial earth reference frame Ry (X, y, Z) is arbitrarily 


defined by orientating axis X, y and Z northwards, eastwards and downwards respectively. 


The inertial speed vector is oriented with respect to po through the inertial heading 
angle and flight path angle, as defined by Fig. IV. 3. The inertial speed vector orientates a frame 
R; (X;, Y;, Z;) such that it stays directed along X;. The subsequent frame ; is then obtained from 
Ry by rotation of w; along Z and of y; along y;. Despite the subscript “i”, the frame &; is not an 
inertial frame, since it rotates with the inertial speed. The same applies with the airspeed, with 
a subsequent frame Sa, as can be seen in Fig. IV.4. Aerodynamic forces are oriented by the 
airspeed such that the drag is opposed to the airspeed while the lift can rotate by ¢ in a plane 
orthogonal to the airspeed vector. It should be mentioned that by assuming no sideslip angle, 


the attitude of the vehicle can be determined. Indeed, the longitudinal axis is therefore included 


within the (X,,Z,) plane, while wings are orthogonal to the lift vector. 
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Following those early definitions, the inertial and air relative velocity vectors can 


hence be expressed as follow. 


Vj = Vj. %; (IV.1) 
Va =Vy.Xq (IV.2 ) 


The relation between speeds is given by Eq. IV.3. 


+W (IV.3 ) 


Nl 


V;, = 


a 
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Figure IV.3: Orientation of the inertial speed with respect to the reference frame Ro 
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Figure IV.4: Orientation of the airspeed and of aerodynamic forces, with respect to the reference frame 


Ro. 


Unless stated otherwise, it is assumed in the remaining of the document that the wind 
is unidirectional, coming from the North, such that it is opposed tox. This decision is 
particularly convenient for both calculation and trajectory visualization since it emphasizes the 


behaviour of the vehicle with respect to the wind. Although this choice is arbitrary, the 
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decision to align & 9 with the compass rose was all the same. Therefore no limitative 
assumption is involved in this choice and the whole spectrum of potential vehicle's kinematics, 


with respect to the wind, is unchanged. Therefore, the wind vector is expressed as follow. 
W = W,(x,y,z,t).% = Wy. (IV.4) 


The airspeed can be expressed as a function of the local wind strength, the inertial 


speed and the inertial flight path and heading angles. 





V, = _|V;2 —2.V;.W,.cos w;.cos y; + W,” (IV.5 ) 
Besides, air relative angles can also be deduced as follow. 
. Vi. 
SIN Yq ~ 7 (IV.6 ) 
a 
V; cosy; siny; 
sinh, =——T— (IV.7) 


Va COS Vq 


Practically, the flight path angle will be limited to the interval] ~"/5; /5[, such that Eq. IV.8 
can be deduced from Eq. IV.6. 


COSY, = V1-sin* yy (IV.8 ) 


Eq. IV.9 completes the set of respective sinus and cosinus expressions. 


_ Vicos yj; cos pi — W, 


COSWqa = (IV.9 ) 


Va COS Vq 


IV.1.2 Earth Point of View 


Equations of motion of a point-mass model flying through a windy environment are 
derived in this section. Equations of motion will be referred to as EoM in the remaining of this 
thesis. Kinematics is observed from the earth-based inertial reference frame y. Newton’s 


second law is written down in Eq. IV.10. 
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dV, se 
m Tp Re = 3 FOrceSext-vehicle (IV.10 ) 


The left-hand side of Eq. IV.10 is developed by applying the derivation formula to the 


inertial speed, as follow. 


dV, dV, : , 
Tr Re = GR + QR/R,N V; (IV.11 ) 


The rotational speed of the reference frame ; with respect to Ro is given by Eq. IV.12 
DR Ry = ViVi + Wi Z; (1V.12 ) 
Eq. IV.13 is obtained by combining Eq. IV.11 and Eq. IV.12. 


dV, 


Tp Re = V; Xj + V; COS Yj Wi; WA —V; Vi- Zi (IV.13 ) 


External forces applied to the vehicles are the Lift, Drag and Weight. The lift is 
oriented by the bank angle with respect to the reference frame Ra. The expression of the lift is 
reminded in Eq. IV.15. It should be mentioned that the air density p is considered constant over 


the small altitude ranges of interest. 
L=Lsingoy,—Lcos oz, (1V.14) 
1 2 
L = 5P.5. C,. Va (IV.15 ) 
The drag is opposed to the airspeed vector, along XQ. 
D=-D x, (IV.16 ) 


1 Z 
D = 50-5. Cp. Va (IV.17 ) 
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And the weight is trivially expressed as follow. 


Weignt = +mgz (IV.18 ) 


Equations are chosen to be projected on i. All forces must therefore be successively 
projected on a and then on Ri. Contributions from the three forces listed above are summed 


up along the main axis (X, y, Z) of Ro. 


FOrCeSoesveniia = Eye + BR, y+ FZ (IV.19 ) 

FE, = —L (sing sinw, + cos @ cos Wy sinYq ) — D COS Yq COS Wa (1V.20 ) 
Fy = L (sing cos Wq — cos h sing Sin Yq) — D cos Yq Sin Wq (IV.21) 
F, = —L cos @COS Yq + D COS Vg + mg (IV.22 ) 


Then, a further projections along (X;,y;,Z;) of R; is conducted in order to get to the 


first three EoM. 


mV, = Ff. cos y; cos W; + Fy, cos yj sin Ww, —F, siny; (IV.23 ) 
mV; COS Yj Wi = =i, sin Wi oF Fy COS Wj ( IV.24 ) 
—mMV; ¥; = F, siny; cos p; + Fy siny; siny; + F, cosy; (IV.25 ) 


Those are completed by the three dimensional evolution of the position of the vehicle. 


x =V; cosy; cosy; ( IV.26 ) 


y =Vi, cosy; siny; (IV.27 ) 
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zZ=—Vjsiny; (IV.28 ) 


Therefore, Eqs. IV.23 to IV.28 describe the evolution rate, with time, of inertial state 
variables {V;, Wj, V;, xX, y, Z}. Those equations indirectly include variables that are not piloted by 
any others, that are {C,,@}. Those are the variables which pilot the evolution of the system, 
they are control variables. Besides, it should be mentioned that the wind input consists only in 
the local wind strength W,, which is taken into account as for {V,, Wg, Yq}. Therefore, the local 
evolution of inertial state variables depends only on the local wind strength encountered by the 


vehicle and is independent of the local wind gradient. 


IV.1.3 Air relative point of view 


An alternative choice is to base the point of view on the surrounding air, such that air 
relative kinematics is observed. The subtlety in the case of DS simulations is that the 
surrounding air moves at different speeds depending on where the vehicle is, due to the 
presence of wind gradients. Hence, a reference frame linked to the air around the vehicle would 
see accelerations when the vehicle crosses regions of wind shear, such that this reference frame 
is not inertial. Fictitious forces must be added in order to describe the motion from this point of 


view. 


An air-based frame is defined such that it is moving at the speed W,(x, y, Z,t).X, 
where {Xx, y, Z} is the position of the centre of gravity of the vehicle. All axes remain parallel to 


those of Ro, The inertial force is given by the local air acceleration at {x, y, z}. 

















+ =<) z ae OW, . OW, . OW, ] 3 
x = —mM. Z| .X 
Ro 


Finertial = —™M. Dt Ot ah Ax XxX + ay yr ax” (IV.29 ) 


The wind is taken to be unidirectional and steady, coming from the North, defined by 


the following equation. 


ux, 
W, = -—log — 
2 Y Og Zo (1V.30 ) 


Hence, the inertial force is expressed as follow. 
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This yields the following expression as for the inertial force. 


. . — iS x 
inertial yz 
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(IV.31) 


(IV.32 ) 


(IV.33 ) 


The derivation formula is applied to the airspeed vector and generates the following 


components. 


dV, 


TH Re = Va Xa + Vair COS Yq Wa Ve —Va Vea 


(IV.34) 


Finally, vector equations are projected on &. to entail the following system of EoM. 


, Uu, Z 
mV, = —D-—mgsinyg +m — . COS Yq COS Wa 
U, Z| 
MV, COS VqgWqg =Lsing-m oe 7 sinWga 
—MViVq = —Lcos@+mgcosyqg +m = poe Wa SINVq 
x = V, COS Vq COS Wg + Wy 


yY=V, COS Yq sSinWq 


Z=—V,sinVq 


(IV.35 ) 


(IV.36 ) 


(IV.37 ) 


(IV.38 ) 


(IV.39 ) 


(IV.40 ) 
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The set of Eqs. IV.35 to IV.37 shows that air relative state variables have their rate of 
evolution as a function of the local wind gradient, described in Eq. IV.32. This dependence 
comes through the presence of the inertial force that itself accounts for the acceleration of the 
air relative point of view. This term, which has indeed the dimension of a force, is sometimes 
referred to as the “dynamic soaring force” or the “fictitious force” [50]. Control variables{C,, } 


are trivially the same as before. 


To put in nutshell, two different sets of equations of motion can be considered. Those 
are representative of the same system evolution observed from two different points of views. 
One focuses on inertial variables, and highlights that their rate of evolution depends, among 
others, on the local wind strength encountered. From that perspective, the significance of wind 
gradients is not emphasized at that stage and the set of EoM is not much different than that of a 
point-mass glider in still air. The alternative is to use air relative variables and equations, which 
then makes the local wind gradient appear. The role of wind gradients in the energy extraction 
process can already be partially comprehended when observing the motion from an air relative 
point of view. Besides, different systems of EoM can be derived out of a single point of view, 


such that the proposed equations here are not exclusive [50, 67, 49, 47]. 


Yet, whatever the chosen point of view and the related set of equations, the local 
evolution rate of state variables, either inertial or air-based, is not sufficient to describe and 
simulate DS flight. Indeed, as it was underlined during Chapter I, DS can only be approached 
along a whole cycle of trajectory. For static soaring flight techniques, an instantaneous 
description of the system is repeated identically over time and can therefore be studied over 
any fixed time position, independently of the state and control histories. However, DS requires 
integrating the evolution rate, given by EoM, over an undetermined cycle, in order to simulate 
and understand energy-harvesting mechanisms. Hence, the fact that wind gradients terms do 
not appear in inertial EoM does not mean that wind gradients are not significant for the motion 


as a whole. 


This raises the next challenge regarding DS flight simulations, which is to integrate 
EoM to form state and control histories over a cycle yet to be determined. The way control 
variables pilot the evolution of state variables over time is also unknown, such that state 
variables cannot be explicitly integrated. The problem consists in finding a cycle of coherent 
state and control evolution over time that permits to extract energy from the wind in the 


process. 
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It was chosen to conduct all calculations using the set of inertial equations of motion 
Eqs. IV.23 to IV.28. It was considered that it would be more intuitive to deal directly with 
inertial variables since ultimately earth-based travel performances are those of primary 
interest. Other studies opted for the same option [47, 43]. Still, the choice to go for air relative 
variables could also have been defended, as those are somehow more intuitive to deal with 
when it concerns a flying object. That was the choice of various authors [46, 49, 48, 67, 68, 50, 
57]. 


IV.2 Optimization 


One way to address the issue is to set and solve an optimization problem in order to 
yield the best achievable solution. It goes through a particular methodology that starts by 
identifying the objective and the constraints and then follows by converting the subsequent 
analytical problem into one of finite dimension, involving a limited number of parameters. 
Eventually, specific numerical techniques are applied to the underlying parameter optimization 


problem to converge to a solution. 


IV.2.1 Optimal Control Problem 


The classical approach to set-up an optimization problem is to identify the variables, 
the objective and the constraints. The general formulation of the optimization problem is then 


defined as follow. 


Maximize/Minimize: Objective(variables) 
with respect to: variables 


subject to: constraints 


In our case, variables would regroup all variables that must be iterated on as part of 
the optimization problem. Since both state and control variables are unknown, the set of 
variables includes at least those variables. Besides, the cycle duration is also unknown and 
should be set free for the optimization to run on it. Indeed the trajectory which optimizes the 
objective would have a specific duration, to be determined. Practically, an initial time ¢, is set 


to zero and the final time ty is chosen as a variable. So the set of variables passed to the 


optimization problem is at least {V;,W;,V;,x, y,Z, Cz, , tr}. However, further variables can be 
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considered, such as vehicle design variable or environment variable. Especially if the wind 
strength is also unknown, which would happen in many cases, the friction velocity u, is also a 


variable to be determined. 


Regarding the objective function, numerous choices can be considered. We would 
focus on two different ones. The first and most trivial consists in minimizing the wind strength 
required to perform DS under certain operational constraints. Then, another objective would be 
to minimize the consumption per unit distance travelled. This latter obviously only makes 
sense in the case of a powered vehicle. The battery state (Bat) must then be taken into account, 
its derivative is given by the power exerted by the thrust 7. Just as a reminder, 1y7op is the 
efficiency of the propulsive block, including electrical, mechanical and aerodynamic 


efficiencies. 


a 


Bat = — 





=. (IV.41) 


Table IV-1 sums up the different objective functions that are approached in the 
framework of this research project. Once again, it constitutes a deliberate choice to focus on 


certain problematics and many further objectives can be considered. 


Table IV-1: Variation in the objective function in the framework of this thesis 








Objective Function Function Associated 
Explicit formulation title scalar Constraint 
Obtain the lowest wind at — Energy neutral cycle 
which DS GidnOis Minimize (OU u, No Thrust 

velocity tens 

sustainable. Periodicity 
Obtain the lowest electric Consumption 
consumption per unit Aimee per unit ABat Energy neutral cycle 
distance travelled by distance [Ax2 + Ay? Periodicity, all but Bat 
taking advantage of DS. travelled 


Constraints encompass all the standards that must be complied with by the eventual 
solution of the optimization problem. To start with, equations of motion must obviously be 
verified such that the solution is physical in view of the models considered. Then, periodicity is 
an essential constraint regarding eventual DS trajectories. Indeed, the problem does not consist 
in extracting energy over a certain period, but in finding a way to make that energy-extraction 


process repeatable and therefore sustainable. The solution must hence consist in a path where 
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at least some of state and control variables come back to their initial values at the end. 
Depending on the mission scenario, those periodicity constraints can be eased out to allow for 
x, y displacement. Then operational constraints are considered. Some can be expressed quite 
straightforwardly, such as avoiding stall, or maintaining ground clearance. Others are 
implemented to make up for limitations inherent to the point-mass model of the vehicle. 
Limitations on control rates and angular accelerations would ideally be implicitly accounted for 
through the presence of further equations of motion, relative to further degrees of freedom. 
They can be rather estimated using alternative analysis, in order to augment the reality of the 


point mass model. 


The control rate bounds, respectively C 7, and d, are each associated to limitations in 
control surface deflections and size. Although the coefficient of lift is considered as a variable, 
it is in fact not piloted directly, since the attitude of the vehicle is piloted, not the angle of 
attack. However, limitations in pitch rate induce subsequent constraint on the feasible C,. 
Besides, using conservative values for this rate limit allows ensuring that the evolution in 
coefficient of lift corresponds to a feasible variation, despite the mentioned inaccuracy inherent 
to the point-mass model. A roll rate of 90°/s was given for Mariner [68], while a much lower 
value of 30°/s was given for Cloud Swift [70]. Such a strong disparity can be explained by the 
ailerons representing a large portion of the chord (35 %) over 90 % of the span for Mariner, 
while Cloud Swift was design as a classical glider, with ailerons on the outboard part of the 
wing only. The maximum rolling rate for the DT-18 is considered to be 60°/s. Given the 
approximate similarity in horizontal tail control surface sizing for all three vehicles, the same 


bound value of 0.5/s for C,,, given by Bower [67], is chosen. 


Bounds on angular acceleration translate into limits in C,, and @. Those are significant 
to ensure that control variables follow a rather smooth evolution over time. One approach to 
get to a first-order approximation of the roll acceleration is to calculate the maximum rolling 
moment that ailerons can produce for a certain speed. The following equation expresses this 


dependence. 


ee | 
Iyx P = 2 PS UCRou V7 (IV.42 ) 


I,,is the longitudinal inertia, / the mean aerodynamic chord and Cp,) the rolling 


moment coefficient. The maximum rolling acceleration is hence given by: 
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- 1 
= 2 


The maximum rolling moment coefficient is given for the DT-18 by the wind tunnel 
campaign and for Mariner from Bower [67]. No data is available for Cloud Swift and it is 


assumed that it achieves the same maximum rolling moment coefficient as the DT-18. The 


< ae ‘ e : P -2 : 
maximum value for C;,, is much less intuitive and is considered to be 2 s ° for all vehicles. 


As for remaining constraints, the maximum load factor was considered to be 
Nmax = 3 for all vehicles, although Lawrance [70] recommends only 2 for Cloud Swift. The 
reason for that is our wish to let structural considerations aside, when comparing vehicles in 
DS flight. On the contrary, a limitation that marks a difference between vehicles is the ground 
clearance constraint. One option is to limit the lowest altitude at the centre of gravity, which is 
the point where the wind is computed. A refined alternative is to impose a wing tip clearance, 
obviously on both wings, which permits to assess the influence of the span during turns close 
to the surface. The wing is oriented orthogonal to the airspeed vector and to the lift vector 


which yields the following expression for the semi span vector. 
GTip or Wf (cos Va + sing Za) (IV.44 ) 


Hence, by defining a positive ground clearance Geieg;, the wing tip clearance is 


expressed by Eq. IV.45. 


Zx b/. sin @ COS Vq S —Gelear (IV.45 ) 


A summary of the different bound and constraints values is summed up in Table IV-2. 
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Table IV-2: Summary of constraints expressions and bounds 


Parameters 
Formulation 
Constraint DT-18 Mariner Cloud Swift 


Equations of Motion Eqs. IV.23-IV.28 See Table III-1 
Flight path bound VAS Vac Vide = 65° 


Final time bounds bee, S te St to be determined and adapted 
min f sup ip 


{Vi, Wi Yiv%,y,Z, Ci, Po 


Periodicity ={(VWe Vr X yz, Ci Phy to be released for x,y in case of travel 





Net travelling direction Ay = tan Wnet AX Wnet to be adapted in function of heading wanted 


Rolling bound ld| < dmax max = 85° 


pSC, Va" 





Max. load factor jane Nmax = 3 
Ground Clearance Z + a sin @ COS Yq < —Gelear Getear = 50cm 


C,, rate \C, | GC mi C max — O-? il 


CRoll max = 0.32 CRoll max = 1.43 CRoll max = 0.32 






Rolling acceleration 
Iex=0.184 kg.m*| Ly~=0.378 kg.m’ | Lyy=1.344 kg.m? 


C,, double continuous Cr | <C, a GC. max = 2 e= 


The general formulation of the optimization problem has been stated and that its 
terms have been expressed. At this point, the problem is expressed throughout analytical 
expressions, whether those are objective function or constraints. Those must be converted into 
a limited number of parameters in order to take advantage of numerical optimization 


techniques. 


IV.2.2 Conversion into a Parameter Optimization Problem 


The optimization problem formulated above must be converted into a finite-dimension 
optimization problem. This process involves different calculations steps. First, the time interval 


is divided into a number of subintervals, which define the future calculation nodes. Although 
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the final time ty is unknown and part of the variables passed to the optimization problem, a 
time step At can be implicitly defined by fixing the number JN; of time steps. 
te — ty 


At = 
Nt —1 


(IV.46 ) 

Therefore a set of N; time nodes ae ey ly ty, = tf } equally spaced by At, forms a 
number N; — 1 of time subintervals. Variables are also broke down into discrete sub-variables 
at each time node, where X; represents the value of variable X at the time node t; . Each state 
and control variable is then represented by N; discrete variables over the time interval ic ty |. 
So far, the evolution of the system with time is modelled by equations of motion, which are of 


the following form, for a state variable X and a control variable U. 


X = f(X,U,t) (IV.47 ) 


A way to involve the set 2G seg X Pees A rh, of dimension N;, in order to represent the 
behaviour expressed by Eq. IV.47 must be found. The analytical integration of Eq. IV.47 over 


one time subinterval |t;; ty41]| gives Eq. IV.48. 


tk+1 
Xk+1 — Xx — f(%, U, t).dt ( IV.48 ) 


tk 


If the evolution of the control variable was the only unknown to be found and 
optimized, then Eq. IV.48 could be integrated explicitly through classical Runge-Kutta 
techniques. That can happen when a trajectory is to be followed (through some waypoints) but 
the optimal way to pilot that trajectory must be established. The Eq. IV.48 could be interpolated 
in one go from ¢ to tr by using a time history of control variables formed by interpolation. 


This method is called direct shooting in the optimization literature [82]. 


However, in our case, state variables are also unknown and therefore the value of 
X, is not available to evaluate f(X;,Ux,t,) to get to X;,4,through classical Runge-Kutta 
integration techniques. Still, the integral in Eq. IV.48 can be approximated by involving discrete 


variables at nodes k and k+1 via a numerical integration scheme. The most basic of those 
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schemes would be the rectangle rule, where f is supposed to be constant over the 


subinterval [t,; t,41], which provides the following. 


Ck+1 
Rectangle rule: f(X,U,t). dt ~ f(X,, Ux, ty). At (1V.49) 


tk 


Despite being of appealing simplicity, the calculation error inherent to this 
approximation is high. Indeed, the rectangle rule is a first-order rule, which means that for a 
given value of X;, the obtained X;4, only match the Taylor series expansion through the first 
term. The local truncation error (LTE), that is to say the error brought by the integration rule 
over one step, is proportional to At”, in the case of a first-order scheme. The global truncation 
error (GTE) is one order of the step size less than the LTE [82], that is to say is proportional 
to At in the case of a first-order integration technique. A more accurate approximation can be 
obtained by using a higher order integrator. Indeed, the function f can be approximated by a 
polynomial interpolant. One option for converting the optimization problem is named 
collocation technique and uses Simpson’s one-third rule to approximate the integrand function 
f (83, 84]. This alternative is commonly used for trajectory optimization problems [83]. It was 
hence chosen to approximate f by a quadratic polynomial which interpolates the function at the 
endpoints of the subinterval [t,; t,4,], as well as at the centre point t, of the interval. 
Lagrange polynomial interpolation can be used to find the interpolant. After calculation, the 


integration over the time subinterval [t,; t,44]| gives the following terms. 


Ck+4 At 
F(X, U,t).dt = re Lf (Xx, Ux, ti) +r Af (Xm, Um, tm) + f (Xen Usa tr+1)] (IV.50 ) 


tk 


The three interpolation points are called collocation points. It should be noticed that 
nothing guarantees that the slope of the polynomial is continuous between two time 
subintervals. Indeed, each interpolant is only defined over one subinterval, independently of 
the previous and the following one. Simpson’s one third rule is a fourth-order integration 
technique, which means that the GTE is proportional to At*. The integrand must be evaluated 
at the mid-point of the time subinterval, where state and controls are a priori not known, since 
they are only defined by their values at the endpoints of the subinterval. Hence, a state and 
control history must be approximated in order to get values at the centre of the interval. 
Because the interpolant f, which is the derivative of Xover time, is approximated by a quadratic 


polynomial over [t,; tx41] , X must be approximated by a cubic polynomial. Hence, four pieces 
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of information must be known about the state history in order to interpolate by a polynomial of 
order three. State values at the endpoints are used {X;,X;4,}, as well as the slope at the 
endpoints {f (X;, Ux, th)» f Xk41 Vest, te41)}. The interpolation is made with a Hermite-cubic 
polynomial P;,; that is constructed out of those four pieces of information. By using notations 
F(X, Uz, te) = fy and f (Xk41, V4.1, tk41) = feti, the polynomial Py represents the time 
history of X(t) as given by Eq. IV.51. 





(C= ta) ( 2(t — | 
Py (t) = ———— } | 1 — ——_].X, + (t — tz). ff 
“ (ty — te+1)? te —teai] * ~~" 
(e~ te)” |(,_ 2 - te QF ee) 
(tr41 — tr)? tr41 “Wir Jf vee cal 
Further calculations give Eq. IV.52 by applying Eq. IV.51 at the centre of the interval. 
te + tay Xk + Xe. + X44 
Pi (ln) = Poy (-&~E#4) = Ee OF — fi (vse) 


To put in a nutshell, EoM are implicitly integrated over each time subintervals by 
approximating the derivative function with a quadratic polynomial. Because state variables are 
not built through the numerical integration but rather involved in the calculation, the 
integration of EoM yield one subsequent constraint per time subinterval. For each one of those, 
this constraint is formed out of Eq. IV.50 and Eq. IV.52 and is called the Hermite-Simpson 
system constraint [83]. For each state variable {X > Gare ¢ ‘| , over each time 
subinterval [t;,; t+], it comes in the form of a residual R!, which is expressed in Eq. IV.53 and 


must be driven to zero in the iteration process. In the formulation below, X; no longer 


yt 
represents the variable X at nodek, but the vector of state variables X;, = Xing Xe | at 


node k. 
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. At 
Ry = Xk41 — Xk - ra Lf Xn Urs te) + AF Xv Ur tm) + f Kerr Urs te+1)] 





fy Akt Awe Ae 
m 2 ; co fraad (1V.53) 
; k k+1 
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; _ fe tlk 
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IV.2.3 Solving Methodology and Tools 


Let’s begin by defining two important terms that come into play in terms of 


optimization. A generic function of n-variables is defined in Eq. IV.54. 


RR" OR 
Py sey Xn) f (X1) 0 Xp) (1V.54) 


The gradient vector is given by Eq. IV.55. 








(IV.55 ) 


The Hessian matrix is symmetric and given by Eq. IV.56 and is the matrix of second-order 


derivatives. 








[af af) 
| Ox" 0x 0% | 
Hr = | : ( IV.56 ) 
|_ af af | 
lax, ax, OX? | 


Both the gradient vector and the Hessian matrix come into play when assessing the 
second-order condition, in order to conclude on the eventual optimality of a functioning point. 


This condition is stated in Eq. IV.57 [85, 86]. 
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For X* € R",if Hy ts continuous over an open neighborhood of X*,then 


X*is a local min. (max.) © ? is definite positive(negative) (IV.57 ) 
Vf(x*) =0 

The above standard comes from the Taylor series expansion of function fat the second 
order. It highlights that both the function gradient and Hessian are required in order to solve a 
basic unconstrained optimization problem where f is the objective function. Practically, in the 
field of numerical optimization, inequalities-constrained-problems are reduced to the form of a 
simpler unconstrained-problem, at the expense of adding further variables [85, 86]. For 
instance, constraints are taken into account through the set-up of a Lagrangian merit function 
which is the addition of the objective function with terms proportional to the constraints of the 
problem. The proportionality is set by Lagrange multipliers, one per constraint, that are new 
variables to the problem. An unconstrained problem, where the objective function is formed by 
the Lagrangian merit function is subsequently solved. It means that gradients of all constraints 
must be known, on top of that of the objective function. Those very brief lines are to introduce 
the basic features of the solver we chose to use. It was selected out of the different solver 
offered by the online NEOS server [87], as for nonlinearly constrained optimization. Further 


references are available about NEOS [88, 89, 90]. 


The Sparse Nonlinear OPTimizer (SNOPT) solver was selected after runs on the 
various solvers offered by the NEOS server, for its reliability as well as its rapid convergence. 
SNOPT uses sequential quadratic programming methods to solve nonlinear constrained 
optimization problems with smooth nonlinear functions and constraints [91]. Basic features are 
that the sequence of quadratic problems is guided by line-search techniques which requires 
first-order derivative to be available. Yet, the Hessian of the Lagrangian merit function is 
approximated by using a quasi-Newton method based on the BFGS updating formula [85, 91]. 
Besides, SNOPT assumes that constraint gradients are sparse. This fits quite well within the 
scope of our optimization problem, since a majority of our constraints come from EoM that are 
each discretized into a number N; — 1 of residuals, expressed in Eq. IV.53. Those residuals, at 
time step k, only depend on state and control variables at time step k and k+1, therefore 


resulting in sparse gradient vectors. 


SNOPT requires first-order derivatives to be available as well as the general structure 
of the optimization problem to be properly set. An intermediate modelling tool is used to that 


aim, it is A Mathematical Programming Langage (AMPL). It provides an interface through three 
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input files that sum-up the optimization problem formulation (.mod), the value of parameters 
(.dat) and the solving and output options (.run) [92]. This interface permits to build up large 
scale problems through a step by step declaration of variable and constraints, while AMPL 
detects eventual substitutions as well as the general structure of the problem. It then 
communicates the problem representation to the solver SNOPT. This includes first-order 


derivatives, that are computed by AMPL using Automatic Differentiation [93]. 


IV.2.4 Local Optimizer and Sensitivity to Initiation Variables 


The solver SNOPT is guaranteed only to find a locally optimal point [91]. However, it 
can be generalized into a global optimum point only if the objective function to be minimized 
(alternatively maximized) is convex (alternatively concave) and if the associated feasible 
constraint region is also convex [85]. Hence, results from the optimization process are a-priori 


local solutions. 


Furthermore, the feasibility of the convergence, as well as the nature of the optimal 
point may depend on the way variables are initialized. Indeed, within the space of optimization 
variables, the solver starts to iterate from a specific set of variables and then guides its search 
until it eventually converges to an optimal point. Hence different sets of initiation variables 
may lead to a problem that is feasible or not and may influence the convergence towards a local 
optimal point rather than another. In our present case, there are a total of 9 * N;variables 


passed to the solver. Assuming that each variable can take N,,values, the number of different 


combinations amounts to N,,”*"*. Any reasonable assumption as for N, and N; yields a number 
of combination of such a magnitude that testing the whole range of starting points is out of the 
question. Not to mention that every combination would be tested by actually running the 
solver which takes several seconds for problems of our scale. Hence, selecting appropriate 
initiation variables would need a trial and error approach, associated with an eventual physical 


intuition of what the optimal solution may look like. 


IV.3 Validation 


IV.3.1 Simulating Albatross Flight 


One computation is made with an optimization problem matching the case from Sachs 
[47], in order to validate the optimization methodology as well as provide a classical study case 
of DS over a flat surface. All vehicle properties, environment conditions and optimization 


constraints mentioned by Sachs are reproduced here. The wind field is that of a boundary layer 
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developed over a rather flat surface, the surface roughness length is 3 cm. The vehicle is the 
Wandering Albatross, the minimal altitude was set to 1.5 metres, the maximal load factor to 3 


and the bank angle is limited to 80”. 


Regarding discretization methods, the number of time steps N; is varied to assess the 
influence on the resulting objective function, in this case the minimum wind friction velocity 
u,, and on the calculation time from SNOPT, referred to as tsygpr. Results are summed up in 


Table IV-3. N,=51. 


Table IV-3: Variation in N, and its influence on validation results 








N,; Objective u, (cm.s’) tsvopr (s) tr (s) dt (s) 
41 60.533 3.07 7.018 0.175 
51 60.555 3.85 7.010 0.14 
61 60.566 8.13 7.008 0.117 
$1 60.574 8.44 7.007 0.088 
101 60.577 29.47 7.000 0.070 
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Figure IV.5: Energy-neutral trajectory for conditions representing Sachs [47]. The surface is rather flat 
with Z)=3 cm. The wind friction velocity required to sustain DS is 60.6 cm/s, the duration is 7 seconds. 
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In order to keep the calculation time rather acceptable without sacrificing on the 
accuracy of the solution, it was chosen to perform the validation case with N;=51. This value 


could also enable to draw comparisons with results obtained by Bower [67]. 


The resulting energy-neutral cycle described in Fig. IV.5 is obtained by minimizing the 
friction velocity required to perform the neutral cycle without any power input from the 
vehicle. Figure. IV.5 pictures in black the overall flight path, while 2-D projections are in blue. 
The wind profile and direction are plotted in green and a schematic Wandering Albatross is 
plotted in grey, at the corresponding dimension scale, such that it gives an idea of the attitude 
adopted along the cycle. Overall results match very well the simulation made by Sachs, as can 
be compared in Table IV-4. Some minor disparities can be explained by differences in the 
methodology between Sachs’ simulation and the present one. Indeed, some practical aspects, 
such as for instance the number of time discretization nodes N;, or the selected integration 
scheme, are not specified in Sachs’ publication and therefore probably differ between the two. 
Results obtained differ from Sachs’ by less than 5 %, hence validating the methodology set in 


the case of DS over a rather flat surface with a 3 cm roughness length. 


Table IV-4: Comparison with simulations obtained by Sachs [47] and Bower [67]. 








Variables Sachs [47] Bower [67] Current Disparity with 
Sachs [47] 

Cf (s) 7.1 7.0025 7.0102 2.8 % 

u, (cm.s‘) 60.7 60.42 60.555 0.2 % 

Max. height (m) 20.5 20.038 20.1 2.0 % 

Eastern net 9.37 9.38 9.39 0.1 % 


speed(m.s ’) 
Eastern period(m) 66.5 65.71 65.8 1.1 % 
Therefore, the validation case seems to validate our approach and the associated 


methodology. It entails a solution, in terms of trajectory, which appears both non-trivial as well 


as physically sound and corresponds to results obtained in the literature. 
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IV.4Summary of Chapter IV 





The observation of the physics of the vehicle and the equations that govern it are 
dependent on the point of observation. In the present case, it is either earth-based or air- 
relative. After the influence of this choice on the energetic consideration is discussed, the two 
corresponding sets of equations are expressed, which involve state and control variables. In the 
case of air relative equations, the inertial force, due to the acceleration of the point of view, 
leads to the expression of a term that is the driving force of the movement. This is more diffuse 
for the earth-relative point of view, but it is decided to go ahead with this set of equations, 


partly because range performances are earth-relative. 


The evolution of state variable over time depends on that of control variables and 
those need a guiding line in order to pilot the trajectory. This is achieved by setting up an 
optimization problem in order to provide the set of values over time that yields an optimum. 
The objective constraint, at first, is to lower the wind strength required to perform an energy- 
neutral trajectory. This is complemented by constraints that include periodicity, in order to be 
able to repeat the flight path, and operational limitations, so as to ensure the flight path can be 
reasonably flown for the vehicle. The trajectory optimization problem is then converted into a 
parameter optimization problem by discretizing the time interval and allocating discrete values 
by mean of adequate interpolation and collocation technique. Equations of motion are included 
as constraints of the subsequent non-linear constrained optimization problem, which is solved 


by using a modelling language AMPL coupled with SNOPT. 
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(CHAPTER VY 


FX}UNDAMENTALS OF [)S FLIGHT 


After the whole methodology is set up, it is time to step back and analyse the results 
obtained. This part aims at exposing those first simulation results and describing their features 
in order to build-up knowledge on DS out of those simulations. Those interpretations would 
start by simply describing the trajectory components and the evolution of variables along the 
path. The principles of DS energy extraction are somehow difficult to grasp, so a special care 
would be given to analysing those, as they are the real added value of DS flight and therefore 
questions its essence. Different trajectories would be used as case studies to support the 
analysis on energy-extraction mechanisms. The formulation of energy work and powers of 
forces would help to conceptualize the exchanges between the environment and the vehicle. 
The approach would be enriched by considering a simulation within a refined wavy 


environment. 


V.1The 3D Closed-Trajectory 


Among different potential trajectories, the closed loop is of particular interest as the 
vehicle is forced to return back to its initial position, so that the initial and the final state are 
identical. The optimization process is run to derive a closed trajectory which minimizes the 
wind strength required to fly effortlessly, that is to say without any power supplied by the 
vehicle. In the formulation of the optimization problem, periodicity constraints are imposed for 


each state variable. 


It may be possible that such a trajectory does not exist, whatever the wind friction 
velocity is, even very high. In such a case, there is no solution to the problem and DS cannot be 
considered as a viable way to loiter over a fixed location. It may require some adjustments over 
some design variables such as the mass of the vehicle for instance, or alterations of some 


environment variables, such as the surface roughness length. 


Therefore, this approach questions the feasibility for the vehicle to perform a closed 


loop within an environment of positive vertical wind shear gradient of variable strength, as 
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well as provide a support to study variation in total energy during the cycle. It is to mention 
that the energy ratio between the initial state and the final state is 1, therefore the energy 
extracted along the path exactly compensates drag losses and the vehicle can stay aloft by 


repeating the exact same trajectory without the need to provide extra power. 


In the case presented below, in Fig. V.1, the vehicle chosen is Mariner, as it was more 
representative of a mission scenario than the Wandering Albatross. Besides, a virtual payload 
has been arbitrarily added, such that the overall mass is 6.6 kg. Furthermore, contrary to the 
validation case displayed before, the lowest altitude is constrained by a wing-tip clearance limit 
of 50 cm and not net minimum altitude anymore. The load factor was chosen not to be limited 


in this case. The number of time steps is N;=171. 
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Figure V.1: Closed loop energy neutral trajectory obtained for a wing tip clearance of 50 cm, a surface 
roughness length of 3 cm and a wind friction velocity of 80 cm/s, which corresponds to a wind speed of 
11.34 m/s at 10 metres. The trajectory is completed in 20.58 seconds. The wind profile and direction is 
represented by the green arrow. The scale of the vehicle (Mariner) is augmented by a factor 1.5 for better 
visibility. 


The trajectory displayed in Fig. V.1 is the result of the optimization problem explained 
above. It is obtained for a wind friction velocity of 80 cm/s, which corresponds to a wind speed 
of 11.34 m/s at 10 metres. It represents a benchmark regarding the feasibility, for a vehicle, to 
remain effortlessly in the air over a fixed position by using DS. If the actual wind is less than 
the value obtained in this simulation, then some power must be provided by a 6.6 kg-Mariner, 


operating under conditions underlined before, so as to conserve the level of total energy over 
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one cycle. On the other hand, if the actual wind is more that the benchmark value, then the 
vehicle can enter the next cycle with some extra energy. It therefore allows a margin for 
additional manoeuvres, such as travelling in one direction, before repeating the same cycle. If 


the wind is at least equal to the benchmark value, then the endurance is virtually unlimited. 


The optimal closed trajectory consists in an 8-shaped path rather than a simple loop. It 
combines different legs which are characteristic to DS flight. An upwind climb where the 
vehicle climbs straight into the wind, wings level, is followed by a turn from upwind heading to 
downwind heading where the maximal height is reached with a belly exposed to the wind 
attitude and a high bank angle, up to 80 degrees. Then starts a downwind dive, where the 
vehicle returns to wings-level and builds up inertial speed quickly. That is followed by the last 
maneuver, a turn into the wind at almost constant height close to the surface. The two inner 
loops which compose the 8-shaped path slightly differ in amplitude but are identical in their 
construction; therefore we can limit our scope to one loop of the overall trajectory, which 
consists of the four legs mentioned above. A colour scale is applied, in Fig. V.2, to the trajectory 


obtained, to highlight components of the DS trajectory. 
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Figure V.2: Phases of flight are evidenced out using different colours. Blue for the upwind climb, magenta 
for the high turn, cyan for the downwind dive and green for the lower turn. 


Figure V.3 details the variation of different speeds along the path, in order to improve 


the understanding of the way those are associated to the four phases of flight. The wind 
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strength at the vehicle’s altitude is also pictured and gives a direct image of the height of the 


vehicle. 
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Figure V.3: Speeds along the path, correlated to the four phases of flight. 


On the upwind climb, the vehicle starts to pull-up upwind and to climb through a zone 
where the vertical wind gradient is the strongest, since it is the lower part of the path, close to 
the surface. An associated local increase in airspeed can be observed, which is however not 
sustained for long as the wind gradient is not strong enough to allow for a steady airspeed. 
Besides, the rate of decrease in inertial speed gets lower as soon as the vehicle pulls up. This 
seems somehow counter-intuitive as the negative contribution of the weight adds up to drag 
losses as soon as the flight path deflects upwards, creating an even greater negative work 
contribution to the variation in kinetic energy. Hence, it can be deduced that another force 
provides a positive contribution to overcome this influence. Given that only three forces act on 
the vehicle, it can be deduced that the lift gives a positive work contribution during the 


headwind climb. 


The vehicle then performs an upper turn from a windward heading to a leeward 
heading. The inertial speed sharply increases when the vehicle gets pulled by the wind before 
reaching its highest altitude of 29 metres. The rate of increase in inertial speed is significantly 


higher than the rate of increase in airspeed, suggesting that the lift provides a positive 
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contribution to the movement. The airspeed has a smoother variation and reaches a minimum 


at the top of the path, when the vehicle starts to head down. 


The vehicle follows with the downwind dive. The inertial speed keeps increasing for 
some time before reaching a maximum when the vehicle starts the turn against the wind. The 
airspeed sees the combined effects of gain in kinetic energy during the descent and of 
decreasing tailwind as the vehicle goes down leeward through the vertical wind shear gradient. 
The latter effect is strong enough to keep the airspeed increasing after the inertial speed starts 
to reduce. The airspeed maximum is reached when the vehicle stabilizes at a low altitude at the 


end of the dive. 


The last component of the trajectory is the lower turn, where the vehicle turns into 
the wind, from leeward to windward, close to the surface at a constant height, in a zone of 
weak wind. The inertial speed sharply decreases at a much higher rate than the airspeed, 


suggesting that the lift provides a negative contribution during the lower turn. 


Overall, the vehicle manages to perform a non-powered closed trajectory and to 
maintain its energy level between the initial and the final state. It takes advantage of a wind 
gradient that is not strong enough to sustain steady airspeed during the climb, which takes 
aback a common misbelief about DS. Besides, that energy neutral trajectory was achieved for a 
nominal wind of 11.34 m/s at a 10 metres height, which is the minimal wind strength that 


enables an energy neutral closed trajectory. 


From Albatross to Long Range UAV Flight by Dynamic Soaring 100 





t(s) 


Bank Angle(*) 





t(s) 


Lift over Drag ratio 
© 


17 | | | Il 
0 9 10 15 20 


(s) 


Figure V.4: Evolution of C,, @ and C,/Cp over time. 


It can be observed from Fig. V.4 that the bank angle reaches its maximum during the 
two higher turns, first to the right and then to the left. The two lower turns see sharp 
variations in bank angle which occur when the wing tip hits the ground clearance constraint 
limit. Three local maxima can be observed as for C;,. In the case pictured, chronologically, the 
first occur at the top of the higher turn, when the vehicles banks leeward at maximum angle. 


The second is at the beginning of the lower turn, when the vehicle pulls-up to get out of the 
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dive at maximum airspeed and start to change heading. The maximum load factor (n = 3.54) is 
reached at that moment. The third local maximum, which is also the lowest by a significant 
margin, is the pull up at the end of the lower turn, when the vehicle starts to head upwards. 


The evolution of the lift to drag ratio is directly dependent on that of the lift coefficient. 


V.2 Energy-Harvesting Mechanisms 


V.2.1 Contributions from Aerodynamic Forces 


Dynamic soaring manages to take advantage of local contributions of aerodynamic 
forces along the flight path by combining specific manoeuvres with respect to the wind field. In 
the case of static gliding flight in still air, airspeed and inertial speeds are the same and only the 
drag contributes to the variation of total energy, as the lift is orthogonal to the direction of 
motion. However, if the vehicle flies through a mass of moving air, variations in total energy, 
in the earth reference frame, will see a contribution from the lift as well. It is to be noted that 
the work done by forces varies with the point of view of the observer. From an earth-based 


point of view, Fig. V.5 shows how forces and speeds orientate during a windward climb. 
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Figure V.5: Orientation of speeds and forces for a 2D-simplified windward climb. The lift is tilted by an 
angle (y; — Yq) towards the direction of motion x; .This entails a positive working component which 
governs the energy extraction. 


It is assumed that wings are level and that airspeed, inertial speed and wind speed are 


in the same plane. The orientation of the inertial speed and of the wind speed induces that the 
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airspeed is not directed along the direction of motion, but forms an angle (7; — Yq;,) with it. 
Aerodynamic forces are therefore also tilted with respect to the frame of reference ; since the 


lift and the airspeed are orthogonal while the drag and the airspeed are collinear. 


The variation of total energy is only driven by contributions of non-conservative 
forces, that is to say only the lift and the drag. The calculation of the elementary variation in 
total energy takes into account the respective work done by those forces, as presented in Eqs. 


V.1 toV.3. The elementary displacement dl is along x}. 


AE'o¢ = AWnon conservative Forces = yy F.dl — (L a5 D) mal. i) (V.1) 
NCF 
dE ror = [L sini — Ya) — D cos(¥i — Ya)] al (V.2) 
B 
AFeocs = | IL sin(y;— Ya) — D costyi~ Ye)I-dl (vs) 
A 


Therefore, the condition to an increase in total energy is expressed in Eqs. V.4 and V.5. 


Equation V.5 is obtained through some calculations involving Eqs. IV.6 and IV.7. 
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Equation V.5 underlines how the local energy-extraction is sensible to the 
instantaneous lift to drag ratio, which should be as high as possible, and for a large range of C,, 
since the vehicle explores a full range of functioning points during the climb. Moreover, it 
shows the sensitivity of the energy-extraction to both the inertial flight path angle y; and to the 
local wind strength W,.. A steep climb into a strong wind represents a favorable case. It should 
be mentioned that for an inertial speed twice the value of the local wind and at a 30° climb 
angle, the lift to drag ratio has to be over 5.7 in order to extract energy, which is easily 


achievable. It can be concluded that any average glider can extract energy during an upwind 
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climb. The very same principles apply to any of the four phases of flight, as aerodynamic forces 
are tilted by the wind speed. Yet, it does not translate into the same conclusion regarding the 


energy-extraction, depending on the attitude of the vehicle with respect to the wind. 


The rate of variation in total energy is expressed in Eq. V.6. The contribution of the lift 
can be simplified to the scalar product between the lift and the wind, as the lift is orthogonal to 
the airspeed vector. The power due to lift is developed in Eq. V.8, where components of two 


distinct contributions can be outlined. 





on DPM = E+ DP) VX) ae) 
NCF 
L:Vi=L-‘G+W)=L-W (V7) 
> -—> 1 2 F , . 
L-W=->5 pS C, Va" W, (sing sing + cos @ cos Wq sing) (V.8 ) 


The first corresponds to the case seen before, with wings level (@ = 0). The lift works 
positively when either the vehicle is facing the wind and climbing (Wg = 0 andy, > 0), or 
when the vehicle flies leeward and the vehicle is descending (Ww, =mandy, <0 ). 
Consequently, not only the lift contributes positively to an increase in total energy when the 
vehicle is climbing into the wind, but it does also provide power to the vehicle when it is diving 


downwind. 


The other contribution corresponds to the case where the vehicle flies crosswind in 
such a way that the longitudinal axis of the vehicle is orthogonal to the wind direction, if it is 
assumed that there is no sideslip. Considering the case where the wind comes from the left of 
the vehicle (Wy, = 7/2), it can be seen that the lift works positively when the vehicle turns 
right (@ > 0), that is to say opposite to the wind. This corresponds to the “belly to the wind” 
attitude of albatrosses described by Pennycuick [41]. Conversely, if the vehicle turns left into 
the wind (¢ < 0), then the lift contributes negatively. The same applies symmetrically when 


the wind comes from the right of the vehicle. 


Those simplified calculations underline that during a closed loop, such as the one 
obtained in Fig. V.1, the lift contributes to an increase in total energy during the upwind climb, 


as well as during the higher turn and also through the downwind dive. The last phase which 
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appears during the loop is the turn into the wind, which sees conversely a negative 
contribution from the lift. It should be noted that those statements remains valid, whatever the 
wind profile W,,(z) is, even if the wind is uniform with no vertical gradients. However, the 
wind profile will determine how those respective gains and losses would weigh in the overall 


energy balance respectively to each other. 


V.2.2 Acceleration in the direction of the wind 


Another way to develop calculations leads to an interesting point of view, linking the 
power due to aerodynamic forces to the acceleration in the direction of the wind. From Eq. V.6, 


a further step of calculation gives Eqs. V.9 and V.10. 


dE > > —> —_? — —=_> —_? 

= (L+D):Y.%) =L-W+d.G+W) (V.9) 
dE a 
— = W, [(L + D).#] -—D.V% (V.10 ) 


The projection of aerodynamic forces on x, in Eq. V.10, corresponds to the acceleration 


of the vehicle along that same axis, such that it yields Eq. V.11. 


dE tot 
dt 





=mW,x-—D.V, (V.11) 


The above equation conveys that the instantaneous rate of total energy is a function of 
the acceleration of the vehicle in the direction of the wind, minus a term due to drag that is 
invariably a loss. Indeed, it should be noted that in Eg. V.10, a component of the drag may 
provide a positive power. Although quite unintuitive, it comes from the fact that the drag, 
which is always opposed to the airspeed vector, can actually have a component in the inertial 
direction of motion, which is in such a case providing positive work. It leaves D .V, as the only 
term invariably negative, therefore corresponding to an unavoidable loss. It should be 
mentioned that, as any instantaneous calculation, Eq. V.11 is not specific to DS, but rather 
applicable to any wind energy extraction technique. Albeit rather interesting from the 
understanding point of view, Eq. V.11 is also convenient as the instantaneous power extracted 
from the wind can be maximized in real time by monitoring the acceleration along the wind 


direction. 
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Equation V.11 can be further developed and combined with air relative EoM in order 
to evidence exact differentials. After some calculation steps, the expression the total energy can 


be integrated out of the power expression, it gives Eq.V.12. 


1 1 
Etot = 5mW," + MWicVg COS Yq + 5MVa" — mgz (v.12) 


It provides an air-based alternative to the calculation of the total energy. 


V.2.3 Overall cycle of total Energy 


If local manoeuvres can be performed in order to extract energy through the 
mechanisms that have been specified, a closed trajectory must however combine a serie of 
those manoeuvres such that the vehicle gets back to its initial position. The set of successive 
manoeuvres is a delicate solution of an optimization problem that compromises the objective 
function in order to satisfy periodicity constraints. For the closed trajectory presented in Fig. 
V.1, the evolution of total energy with time is depicted in Fig. V.6 , together with the source of 
this evolution as it superposes the variation of aerodynamic power input with time. The power 
input displayed on the lower part of the figure can be seen as the source of the variation in 


total energy depicted on the upper chart. 
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Figure V.6: Variation in total energy with time and in the associated rate of energy with time 


As it was anticipated, the lift contributes positively on the upwind climb. The overall 
contribution is also positive and some energy is therefore extracted from the wind during the 
climb. The biggest increase in total energy comes from the higher turn. The drag contribution 
is negligible as the airspeed gets quite low and the lift contribution sharply increases from 2.3 
seconds, when the vehicle gets “belly to the wind” and is being pulled by the wind. The vehicle 
still manages to gain some energy during the first part of the downwind dive but the drag 
quickly builds up as the airspeed increases and it can be seen from Fig. V.6 that the lift 
contribution turns negative as soon as the vehicle starts to bank into the wind. Finally, the 
lower turn is flown at a high airspeed while turning into the wind. The negative contribution 
brought by the lift is clearly noticeable from Fig. V.6 and it adds to significant drag losses to 
produce the highest loss in total energy along the path. 


Overall, it can be seen from Figs. V.6 and V.7 that the energy gained on the three 


segments upwind climb, higher turn and downwind dive is lost along the lower turn. Even 
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though the wind faced at that height is at its lowest value, the lower turn is the major hurdle to 


overcome in order to complete the closed loop while maintaining an objective energy ratio of 1. 
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Figure V.7: Overall energy-harvesting strategy. The energy neutral closed loop trajectory from Fig. IV.1 is 
coloured following a red to green scale as a function of the local power input. Total energy is extracted at 
green-coloured positions and is lost at red-coloured positions. 


It is apparent in Figs. V.6 and V.7 that the loss in total energy occurs during a rather 


limited amount of time and on a small spatial portion of the trajectory, but represents both the 


highest power output and the highest energy variation of any other segment of the trajectory . 


It can be concluded that parameters which may govern the energy loss during the lower turn 


should have a prevailing influence on the overall balance of total energy. 


Another way to breakdown the energy extraction is to consider Eq. V.11, where the 


terms would be called Leeward Acceleration for m W,. X and Air relative Losses for —D .V,. 
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Figure V.8: Variation of power input contributions, with respect to time. 


Figure V.8 globally shows the same evolution as Fig. V.6, with minor disparities, which 
tend to show that the positive contribution from the drag is somehow negligible. It confirms 
that the strongest contribution from acceleration terms comes during the lower turn , when the 


vehicle deflects its path towards the wind. 
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Figure V.9: Variation in power input contributions from the drag, with respect to time. 


Figure V.9 details contributions from power which are broke down into two parts. The 
first, in green is the contribution of the drag to the acceleration in the direction of the wind. 
The second, in red, is the parasitic and unavoidable loss due to the air relative motion of the 
vehicle. Even though, in the present case, the overall contribution from the drag is always 
negative, it can be seen that, for most of the trajectory, the drag contributes to the acceleration 

in the direction of the wind. In theory, nothing hamper the overall drag to be positive, even 


though it requires the wind speed to be higher than the airspeed. It can occur if the vehicle is 
flying backwards, kept aloft by a strong headwind. 


V.2.4 Exploiting Wind Power within the Wind Gradient 


As it was underlined above, the main energy input comes from the upper part of the 


curve, while the biggest losses occur in the lower part of it. It may seem ironical, as DS is 
sometimes referred to as gradient soaring that energy is actually gained where the gradient is 
the lowest and lost where the gradient is the strongest, which is particularly emphasized by Fig. 


V.10. Once again, the energy is gained out of the wind underlying power and the gradient helps 


to combine gains with limited losses in order to achieve a neutral energy performance. 
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Figure V.10: 2-D side view of the closed trajectory. The length scale is identical between the two axes and 
the wind speed scale is adapted for good visualisation. The colour scale is identical to Fig. V.7. 


The section view offered by Fig. V.10 displays the asymmetry that was mentioned for 
the two loops of the closed trajectory. Moreover, it appears that during the upwind climb, the 
inertial flight path angle increase progressively to reach values around 60°during the higher 


turn, which is achieved in a strong inclined plane. Inversely, the lower turn is carried out in an 


almost horizontal plane, as close as possible to the surface. 
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Figure V.11: Top view of one of the two loops, with remarkable points encircled. 
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The main interest of Fig. V.11 is to underline that the maximum power input does not 
occur at the highest point of the trajectory. Although the highest altitude provides the highest 
wind strength, the vehicle extracts energy at a higher rate moments later, when it carries on 
the turn to orientate the lift in the direction of the wind. This latter configuration, dubbed 
“belly to the wind” and detailed earlier, corresponds to the vehicle flying with its longitudinal 
axis perpendicular to the wind (jw, = 90°), which is visible in Fig. V.11, and banked to the 
maximum in the direction of the wind. The acceleration in the direction of the wind is then at 
its maximum. A more precise evolution of the variable at stake is displayed in Fig. V.12. The 
reason this configuration is not achieved at the highest point of the path is due to the 
periodicity constraint of the closed trajectory, which imposes that the vehicle recovers the 
ground it loses in the direction of the wind during the energy-extraction. Hence, the highest 


altitude is reached at the northbound extreme of the path. 
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Figure V.12: Evolution of —z ,@ and y,;, during the first half of the cycle plotted in Fig. V.1. The moment 
of maximum power input is highlighted in dashed orange lines. 


V.3 Opening the Loop 


In the case of the open loop, two constraints are relaxed, that is the east and north 
position. Other than those, the problem and its parameters remain the same as before. Because 
the optimization problem is less constrained, it enables the vehicle to explore a wider space of 


solutions and potentially to reach a refined objective function. 
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Figure V.13: Energy-neutral optimized open loop obtained for a 6.6 kg Mariner. The wind friction velocity 
obtained is 64.5 cm.s’, which corresponds to wind strength of 9.14 m.s’ at a height of 10 metres. The 
cycle takes 9.25 seconds. The wind profile and direction is represented by the green arrow. The path is 
coloured following a red to green scale as a function of the local power input. Total energy is extracted at 
green-coloured positions and is lost at red-coloured positions. The scale of the vehicle is augmented by a 


factor 1.5 for better visibility. 


The minimum wind friction velocity required to perform DS flight along an open path 
is 64.5 cm.s’. It corresponds to approximately a 20 % reduction compared to the open loop. The 
corresponding wind strength at 10 metres is 9.14 m.s’. The vehicle manages to find a solution 
that requires significantly lower wind strength than for the closed loop. Yet, the performance is 
accomplished at the cost of a displacement penalty in the direction of the wind. Over one cycle, 


pictured in Fig. V.13, the vehicle is pulled by 78.5 metres in the direction of the wind. 
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Figure V.14: Evolution of the inertial speed, the wind speed and the airspeed with time. 


Compared to the closed loop, the amplitude of the inertial speed is shallower, while 
the airspeed gets to lower values. The net travelling speed is 14.82 m/s, which would 


theoretically enable the vehicle to cover 427 kilometres within 8 hours. 
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Figure V.15: Variation in total energy along the path, with the associated evolution of power 
contributions over time. 


Although the energy-harvesting mechanisms as well as their combination along the 
path are very much identical to the closed-loop case, the overall power input shows a local 
minimum around 4.5 seconds into the cycle. It corresponds to the point of highest altitude and 
can be linked to the minimum in airspeed reached at the same instant, see Fig. V.14. Indeed, the 
aerodynamic forces, which govern the energy-extraction are limited in intensity at that point of 
the path. The effect is more pronounced than for the closed-loop case because the vehicle starts 
the higher turn well before it reaches its highest altitude, see Fig. V.16, such that the energy- 


extraction is already significant before that point. 
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Figure V.16: Evolution of —z ,f and w,;, during the cycle plotted in Fig. V.13. The point of maximum 
altitude is highlighted in dashed orange lines. 


Contrary to the closed loop, the configuration “belly to the wind” occurs, more or less, 
at the point of maximum altitude. At 4.47 seconds into the cycle, the altitude reaches 30.13 
metres, the bank angle hits 72° to the left (for a maximum of 72.5‘) and the air relative heading 
is 83.3", the wind coming almost perpendicularly to the right hand side of the vehicle. This 
analysis is of particular interest as it makes the higher turn very much similar, from the point 
of view of aerodynamic variables, to a simple turn within an inclined plane, in still air. This 


may prove a useful piece of information for the sake of DS control. 
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Figure V.17: Top and zoomed-in view of the open loop displayed in Fig. V.13, where the specific point of 
maximum altitude is encircled in orange. 


The point of maximum altitude is not anymore at the northern extreme, but further 
down along the path, as displayed by Fig. V.17. It illustrates that, for the open loop, the higher 


turn is anticipated significantly before the vehicle reaches its maximum height. 


V.4 Albatross Flight in the Ocean Boundary Layer 


First simulations have established that the classical theory of DS could be applied for 
albatrosses (see Validation Case). Yet, it has been underlined that further refinements from the 
wind model could be rather significant as for their impact on DS flight simulation. Besides, 
several sources report the use of waves by albatrosses [41, 42] out of observation or 
conceptualization of albatross flight. This part aims at assessing the way a refined wind model 


impact DS trajectories, as well as providing a step forward in albatross flight simulation. 
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All along this part, the vehicle model is that of the Wandering Albatross (see Vehicle 


Model) and the wind model refers to terms explained earlier (see Wind Model). 


V.4.1 Realistic Roughness 


A first case was run without wave, but with a surface roughness length correlated to 
wind conditions through Charnock’s model. Other than that, the optimization problem is 
almost identical to that of the validation case, the exception being about the ground clearance 
constraint, which is evaluated at the wing tip and limited to 5 cm. The solution gives a wind 
friction velocity 0.65 m/s, with a corresponding roughness length Zp of 0.48 mm. The associated 
reference wind speed, at 10 metres is 15.8 m/s. Those values strongly differ from the validation 
case, mainly because the roughness length is calculated using Charnock’s model and not fixed 
at 3 cm anymore. It can be argued that the roughness length could be higher than 0.48 mm, as a 
rather flat surface of a non-developed sea under recent winds has a higher roughness than old 


Waves. 


The wind strength required to maintain the Wandering Albatross in flight is quite 
higher when a supposedly realistic roughness length is taken into account. The literature 
mentions minimum wind speeds around 7-9 m/s rather than 15.8 m/s. This confirms the need 
for further improvements in the simulation in order to fully model, if necessary, the flight of 


albatrosses. 


V.4.2 Influence of Waves 


In order to take waves and the induced wind field into account, the amplitude a of 
waves was increased manually and given as an input fixed-parameter to the solver. It then 
worked towards minimizing u,, within an environment with travelling waves of amplitude a 
and angular frequency w,(u,), and surface roughness length Z)(u,).The amplitude a was then 
compared with the theoretical amplitude of the peak wave, see Eq. III.29, and adjusted by small 
increases. The process was eventually repeated until the two amplitudes were identical, hence 


simulating fully-developed waves under winds blowing at u,. 


The methodology converged to obtain the trajectory displayed in Fig. V.18. It is to be 
mentioned that a further periodicity standard was introduced with respect to the wave motion, 
with same aim to get a cycle identically repeatable. For the case displayed in Fig. V.18, the 
vehicle arbitrarily starts (t=0 s) and stops (t=6.70 s) above the crest of the wave, which is 


materialized by the yellow line. 
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Figure V.18: Evolution of the vehicle above a wave. u+= 0.57 m/s; a= 2.47m; A= 193 m; c= 17.5 m/s. 


It can be observed that the vehicle gains ground towards the wave during the 
downwind dive (t=2.61 s). Hence it begins the lower turn close to the surface (t=3.95 s) ahead of 
the crest, on the forward face of the moving wave, which is the leeward side from a wind point 


of view. Then, the same type of upwind climb as for the open loop is achieved. 


Although the net speed in the y-direction is imposed to be the same as the wave 
celerity c, the vehicle is “free” to manoeuvre between initial and final positions above the crest. 
Besides, it is observed during the convergence iteration process, when the amplitude is slightly 
increased step by step, that those changes are affecting both the trajectory and the minimal 
friction velocity required. The friction velocity varies from 65 cm/s without waves to 57 cm/s 
with waves. The vehicle is therefore getting benefits from the presence of the wave, by flying 
on the forward side which sees a local wind updraft, see Fig. III.15, during the lower turn close 
to the surface. Although this contribution is not sufficient to balance drag losses and therefore 
to gain energy, it can be compared as a superposition of wind-gradient soaring with slope 
soaring, where the slope would be a travelling wave. This result is in accordance with 
observations of birds exploiting waves by flying on the rising side or face [94], and avoiding the 
back side. Cases are even reported of birds exploiting a swell wave, without the presence of 
wind, the uplift from the wave being sufficient to propel the bird. If a higher ratio a/A is met, 


that would increase the latter effect, by providing a stronger slope on the wave side. 


Results from Fig. V.19 are obtained by displacing the initial/final position of the 
vehicle forward, relative to the wave, rather than at the crest. It allows the vehicle to spend a 


greater fraction of the trajectory in the location of local wind updraft. It shows that both the 
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required wind strength and the wind-related amplitude are lowered. Besides, as losses during 
the lower turn are decreased, the vehicle does not need to climb as much as before, and the 
maximal height reduces from 20 metres down to 15 metres and lower, in better accordance with 


on-the-field observations of albatrosses [41]. 
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Figure V.19: From top to bottom, change in periodicity with respect to the wave. 


{a (m); u,(m/s)} = {2.46; 0.57}; {2.25; 0.53}; {2.05; 0.51} 


Even though the required wind strength is lowered to about 12.8 m/s for the bottom 
case of Fig. V.19, this is still rather high compared to what can be expected, such that those 
albatross flight simulations still lack realism somehow. A combination of rougher wave surface 
and deeper waves is probably the link to that ideal. In particular, waves of smaller scale, but of 
slope stronger than the swell wave taken into account in the framework of this study, could be 
significant for albatross flight. Further aerodynamic effects, like flow separation on the leeward 
side of the wave, have been suggested to be exploited by albatrosses [41]. Such induced energy- 
harvesting strategies are quite far down the path of UAV capabilities and have not been 


considered in the framework of this thesis. Besides, ground effect could also have a beneficial 


From Albatross to Long Range UAV Flight by Dynamic Soaring 122 


influence energy-wise, since it would improve the gliding performance of the vehicle when it is 


close to the surface. 


To sum-up, environment model refinements were integrated into the DS simulation 
model and their influence has been analysed. Although the presence of wave was investigated 
for the sake of general understanding, exploiting those is beyond the current capabilities of 
UAVs. However, it is useful to have a model of the ocean roughness length at disposal and 
quite fundamental to keep in mind that sea-surface roughness lengths are well below the 


centimetre, which may hamper the feasibility to exploit DS over oceans. 
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V.5Summary of Chapter V 





The basic characteristics of DS flight are explicated by using a specific closed- 
trajectory as a support. It is laid out that every trajectory can be decomposed into four phases, 
although their amplitudes may vary, depending on the periodicity constraints that apply to the 
trajectory. From an earth point of view, energy is gained by the vehicle in all phases of flight, 


but the lower turn into the wind, close to the surface. 


The way energy transfers occur is detailed and the prevailing role of the lift is 
explained graphically and analysed analytically. It is responsible for the biggest increase in 
energy during the higher turn but is also accountable for the significant loss during the lower 
turn. By working on the expression of the power from aerodynamic forces, it is laid out that 
the drag can have a component that provides positive work, which is part of a wider resultant 


force oriented in the wind direction. 


Although the overall pattern varies with the trajectory, depending on periodicity 
constraints, it appears that the vehicle attitude towards the air remains roughly unchanged, in 
particular the attitude “belly to the wind” at the top of the curve, which has been described in 
the flight of albatrosses. 


Finally, the simulation of DS is enriched by a refined roughness length model and 
then by the presence of waves, where the wave amplitude is increased step by step until it 
corresponds to the prediction of the model. A particularity concerns periodicity constraints, 
which are established with respect to the wave and not to earth anymore. The roughness 
length predicted by the model considered is significantly lower than what is used in the DS 
literature. The required wind strength obtained for the Wandering Albatross is rather high 
compared to values that could be expected from biology input, even though the constraint of 
ground clearance is set to the minimum. The wave allows for DS flight at lower winds, since 
the vehicle takes advantage of the updraft created on the forward moving face, during its 
lower turn at the vicinity of the rising wave surface. By displacing the initial position of the 
vehicle, relative to the wave, from the crest to further down the wave slope, the vehicle 
benefits from a higher updraft and this lowers again the wind strength required. Yet, the wind 
strength obtained is still rather above values measured on the field, which suggests that 


albatrosses exploit mechanisms that are not represented by the current model. 
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CHAPTER VY | 


NON-[DDIMENSIONALIZATION OF [)S FLIGHT 


The chapter aims at presenting a different approach to gain understanding about DS 
principles, which is more mathematical and equation-focused. The understanding about DS laid 
out so far is based on simulations where the analysis of results is rather distant from that of 
EoM. This can be explained by the fact that multiple solutions exist and that therefore EoM are 
not sufficient to define the problem and its solution. Yet, the analysis of equations could permit 
to disclose some remarkable properties about DS. Besides, simulations obtained so far are 
dependent on the metric used for each variable, such that it is difficult to link ties between one 
simulation and another. There is hence an interest in considering equations and analysing 
simulations through a new system of dimensionless variables. Air relative equations are more 
adequate for this approach as the airspeed is directly a state variable. The associate set of EoM 
is dimensionless using specific reference variables. Simulations are then run, by using the 
unchanged methodology described earlier to support the theoretical analysis of a dimensionless 
set of different equations. Results validate both the non-dimensionalization and the simulation 


methodology, if necessary, as well as highlight interesting features about DS. 


VI.1 Linear Wind Profile 


VI.1.1 Theory 


A first objective is to find an adequate non-dimensionalization of EoM so as to 
highlight some dimensionless coefficients formed from environment and vehicle parameters. 
Eventually, such coefficients would govern dimensionless solutions of EoM as well as DS 


performance, hence shedding a new light on the understanding of DS. 


EoM are dimensionless by introducing some reference parameters, which could very 
well be arbitrary chosen, but would rather be selected such that equations can be simplified by 
gathering parameters into dimensionless coefficients. For the sake of simplicity, a linear wind 


gradient profile will be selected first, where 8 represents the wind gradient (s”), as introduced 
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in Eq. VI.1. It is to be reminded that in the present case, the wind is coming from x, f is positive 


and z is oriented downwards. 


W, = Bz (V1.1) 


In the particular case of a linear wind profile, both the local mean wind speed and the 
local wind gradient are determined by f. From the set of air relative EoM, given in Eqs. IV.35 to 


IV.40, the following non-dimensionalization can be defined [49]. 


_ £ 
Ya = Na ( VI.2 ) 

2 
(X,Y,Z) = rs (x,y, Z) (VL3 ) 
t=ft (VL4) 
oy (VLS5 ) 


For any given variable X, the accentuation X refers to the associated dimensionless 


variable, while X, refers to the associated reference value, such that: 


_ xX 
x= x. (VL6 ) 
Besides, the notation X’refers to the dimensionless-time derivative of variable X. 
, ax dX 
xX = de to ap (V1.7) 


EoM are thus dimensionless as presented in Eq. V1.8. 


Va =—p VaCp(C,) — SINYVq + Va sin Yq COS Yq COS Wa (VL8 ) 
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, _=—. sing 
Wa = PVaC, i. tan Yq SinWa (V1.9) 
a COS Yq 5 
Va = PVC, cos bd — 7 — COS Wg Sin’ Wa (V1.10 ) 
a 
X' = V, COS Yq COS Wa + Z (V1L.11) 
y= Va COS Yq sin Wq ( VI.12 ) 
zZ’ =—-V, sinVq (V1.13 ) 


The dimensionless parameter /p is defined in Eq. VI. 14 


p= pg V1L.14 
=f, “Tm > | 

2 (mg/S) B? —— 
It is to be mentioned that before the non-dimensionalization, control variables are 


already represented by dimensionless values, which are the lift coefficient C; and the bank 


angle @. 


Air relative standard EoM and reduced Eqs. VI.8 to VI.13 represent the same physics, 
but observed with different norms and therefore using differently weighted variables. It is to be 
noted that those equations are not specific to DS flight, but simply describe any in-flight 
evolution of a vehicle, within a linear wind profile. In Eqs. VI.8 to VI.13, the dimensionless 
parameter p is amongst the only parameters, together with Cpg, Cp1, Cp2, Cp3, Cp4 to influence 


equations. 


From Eqs. V1.8 to VI.13, it can be learnt that if two vehicles are in similar state in terms 
of dimensionless variables, at a given instant, if the p parameter is identical between the two 
systems and if control variables follow the same evolution over dimensionless time, then the 


two system would evolve from that instant with the same dimensionless states. 


Two sets of equations would be called equivalents if they can be dimensionless into 
the same form. Comparisons between coefficients, from the dimensionless equations, then 


serve as a standard to assert if the sets are equivalent or not. In the above case, two sets of air 
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relative EoM would be equivalent if, and only if, the two respective p parameters in Eqs. VI.8 to 
V1.13 are equals (assuming Cpo, Cp1, Cp2, Cp3, Cp4 are identical). In this regard, if m, p, S and B 


vary such that p is constant, then equations would be equivalent. 


Besides, two solutions would be called equivalent if the respective dimensionless state 
variables follow identical sequences over dimensionless time. In the scope of the present work, 
solutions are not entirely determined by EoM, but also show a dependency on control variables. 
Consequently, two equivalent sets of EoM don’t necessarily lead to two equivalent solutions 
since the evolution of control variables could differ between the two. Further conditions are 


required in order to yield equivalent solutions. 


VI.1.2 Dimensionless Optimization Problem 


Essentially, a non-dimensionalization process focuses on rewriting equations in order 
to highlight similarities between solutions without solving equations. Because in the present 
case solutions are not entirely determined by equations, broader aspects of the problem must be 


considered as part of the non-dimensionalization. 


Solutions consist in sets of control variables and correlated state variables. Those are 
determined by solving an optimization problem, which is formed by an objective function and a 
set of constraints, including EoM, dimensionless or not. State variables are correlated to control 
variables through EoM, control variables are piloted in order to satisfy best the objective 


function and both state and control variables abide by further constraints. 


In order to draw similarities between solutions, the focus should not be only on 
invariances among dimensionless coefficients of dimensionless EoM, but also within 
constraints and objective, as seen by dimensionless variables. For instance, the ground 


clearance is simplified to a minimum altitude, through the constraint expressed in Eq. VI.15. 


ZS Zmax (VIL.15 ) 


By introducing the reference length Z,, the constraint is tweaked to Eq. VI.16. 


Zmax 
Zc 





= (VL.16 ) 
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If the minimum altitude —Z,,,, is kept constant, the above constraint on the 
dimensionless height Z is different with every varying reference length Z,, leading to a different 
optimization problem and therefore to different solutions. Hence, for dimensionless solutions to 
be identical, the minimal altitude must be adapted such that it remains the same proportion of 


the reference length. 


If two optimization problems show identical dimensionless constraints, including EoM, 
and objective, then they will trivially lead to equivalent solutions. In the scope of the present 
part, the equations that are passed to the solver are earth-relative EoM, described in Eqs. IV.23 
to IV.28. Even though this set of equations is practically used to yield solutions, its parameters 
and constraints would be carefully adjusted so that the underlying optimization problem would 


be equivalent as if it was expressed through Eqs. V1.8 to VI.13. 


V1.1.3 Simulation with Identical Wing Loading 


The non-dimensionalization introduced above is tested using a linear wind profile 
[49], for different objective functions. The vehicle chosen for this part is Mariner, described in 
Table Ill-1, but different values of mass and wing area are applied. The aspect ratio is left 
unchanged and it is assumed, for the sake of this study, that aerodynamic coefficients 
Coo Cp1,Cp2,Cp3,Cp4 are unaltered by changes in mass and wing area. Besides, none of the 
associated rate constraints is applied in the following simulations, in order not to burden the 


optimization problem with varying dimensionless constraints. 


First, the objective is to minimize the wind gradient (6, which in this case is equivalent 
to minimizing the wind strength at a given height. The vehicle mass and wing area were varied 
by keeping an arbitrary constant wing loading of 141.6 N/m’, corresponding to a mass of 7 kg 


for Mariner. 


From Albatross to Long Range UAV Flight by Dynamic Soaring 130 


100 





-80 : 
“asp ) 40° 


Figure VI.1: Optimal open-loop trajectories, minimizing the required wind gradient f, obtained for four 
vehicle mass, at the same wing loading. For all cases, the wing loading is 141.6 N/m’, the wind gradient is 
B= 0.1076 s' and p= 35.93. 


Identical wind gradient P of 0.1076 s’ as well as identical solutions are obtained and it 
can be seen from Fig. VI.1 that all curves are superimposed. A remarkable result is that the 
corresponding wind strength at 10 metres, or more generally the required variation in 
horizontal wind strength over 10 metres height, is only 1.08 m.s’. Such conditions of wind 


shear do not seem very demanding and may widen the scope of potential DS application. 


In the present case, ( was set free for the solver to run on it which is tantamount to 
letting the solver choose p and alter the set of EoM it will also derive solutions from. 
Unsurprisingly, in accordance with what could be expected out of dimensionless Eqs. VI.8 to 
V1.13, the solver came out with the same value of p, and therefore in wind gradient f, since all 
wing loadings are equivalent. What can be concluded out of this first simulation is that, in the 
case of a linear wind profile, the wing loading is a parameter of equivalence, which means that 
if two vehicles, with the same aerodynamic polar, have identical wing loadings, then they 


would perform identically by DS. Equations and simulations predict this result in accordance. 
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VI.1.4 Simulation with Various Wing Loadings 


Different wing loadings are subsequently tested with the same objective function as 
before, which is to minimize the wind strength required to fly by DS. In results to follow, the 
vehicle has a fixed wing area S of 0.485 m° and the vehicle mass is changed, hence 
corresponding to a realistic operational case where the same vehicle planform is used to carry 


different loads. Solutions are expected to be different this time. 
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Figure VI.2: Optimal open-loop trajectories, minimizing the required wind gradient /, obtained for four 
vehicle mass, at the same wing area. For all cases, p = 35.93 +/- 0.02. 


It appears from Fig. VI.2 that flight paths are regularly spaced such that they seem to 
be results of a proportional transformation. The light blue curve corresponds to trajectories 
obtained before with constant wing loading, presented in Fig. VI.1. The higher mass goes with a 


bigger amplitude of flight. Further data are summed up in Table VI-1. 
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Table VI-1: Parameters about solutions displayed in Fig. VI.2. 
mg/S 7 _ —Z 

Mass (N/m? ) B(s”) Pp (m)- —Zmax Cf (s) 

6 kg 121.36 0.1163 35.903 9.521 0.0131 14.55 

7 kg 141.59 0.1076 35.929 11.384 0.0134 15.69 

8 kg 161.81 0.1007 35.938 12.843 0.0133 16.77 
9kg 182.04 0.0949 35.945 14.265 0.0129 17.79 


As expected, the wind gradient 6 was different for each wing loading and is actually 


getting higher for low wing loadings. Hence, for the same vehicle planform, although the 


energy required to maintain a heavier load in the air is higher by definition, the wind strength 


required to fly it by DS is actually lower. This should not conceal that the heavier vehicle still 


climbs higher to benefit from stronger winds. High wing loadings therefore enable the vehicle 


to take advantage of less favourable wind conditions. 


Another important consideration is that the solver comes up with values of 6 between 


the different cases, which entail identical values for 9. The maximum relative disparity for p is 


of the order of 0.1 %. Small variations like this can be explained by the use of a constant number 


of time discretization nodes N;=71, while the total time of cycle is increasing from 14.55 seconds 


to 17.79 seconds, hence inducing varying time step length and different integration error. 
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Figure VI.3: Three dimensional curves, plotted in dimensionless coordinates, obtained by minimizing the 
required wind gradient f/, for four vehicle mass, at the same wing area. For all cases, Pp = 35.93+/- 0.02 


Consequently, for a given vehicle with varying wing loading, the optimization selects 
Pp such that it sets equivalent dimensionless EoM for the vehicle. Indeed, Eqs. VI.8 to VI.13 are 
identical between cases run here, leading to similar dimensionless solutions, as plotted in Fig. 
V1.3. The overall parameter which is ultimately modified by the solver within that process is p, 
whom we know from the non-dimensionalization analysis that it is the only parameter that 
defines dimensionless EoM. In this regard, p can be considered as a benchmark of DS 
performance. The value of p obtained actually represents the maximal value that allows the 
vehicle to fly effortlessly by taking advantage of the wind. For lower values of p, conditions are 
more favourable and allow extra energy-harvesting. For higher values of p, conditions become 
adverse and some extra energy is required from the vehicle to complete the cycle. The critical 
value of this parameter is actually a function of the only other varying parameters of 
dimensionless EoM, which are the drag coefficients Cpo,Cp1,Cp2,Cp3,Cp,4 that defines the 
aerodynamic polar. A higher value of 0 would be obtained with a vehicle that drags less over 


the range of C,, travelled along the path, hence allowing DS flight in more adverse conditions. 
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It can be seen from Figs. VI.3 and VI.4, that the dimensionless variables match in such 
a way that the three dimensional curve of dimensionless coordinates are well superimposed. 
The order of magnitude of the maximum relative disparity is below 5% and it can also be 
attributed to significant variations in time step length. It can be fairly concluded that solutions 


are all equivalent. 
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Figure VI.4: Evolution of control variables over dimensionless time, obtained by minimizing the required 
wind gradient f, for four vehicle mass, at the same wing area. For all cases, p = 35.93 +/- 0.02. 


One can observe from Table VI-1 that even though the lowest curve is reached by the 
lower wing loading case, its minimum altitude is still 95 metres above ground, while the 
ground clearance was much less restrictive and constrained in this case to a fixed minimum 
altitude of 1.7 metres, for all cases. Consequently, even though the optimization problem 
includes non-equivalent constraints as seen from dimensionless variables, variables do not 
reach the limit value and solutions are still equivalent to each other. It shows that non- 
equivalent constraints do not matter on the equivalence of solutions if those are not reached 


during the optimization. 
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Figure VI.4 shows a good match between control variables. The flattened part of the 
C;, curve corresponds to the lift coefficient reaching the maximum as stated in the “no stall” 
constraint. This constraint is equivalent to every dimensionless optimization problem since the 


coefficient of lift is dimensionless. 


Consequently, it can first be learnt that, in the case of a linear wind profile, increasing 
the wing loading reduces the required wind gradient f. Besides, the evolution of 6 with the 
wing loading is such that the dimensionless parameter 9 remains the same. It can be 
interpreted as a benchmark of DS performance, depending only on the aerodynamic 
performances of the vehicle. Higher values of p can be associated with adverse conditions, 
more challenging for a vehicle to exploit. Therefore, refined aerodynamics for the vehicle 
would entail a higher value of p found by the solver for the problem of minimizing the wind 
gradient required. Besides, despite variations in /, the equivalence between dimensionless sets 
of EoM, representative of the different wing loading cases, provides equivalent sets of 
dimensionless solutions. It shows that, in the case of a linear wind profile, variations in the 
wind gradient required to sustain different wind loadings can be predicted by the theory. 
Furthermore, the non-dimensionalization approach is successful in the way that it highlights a 


dimensionless parameter which characterizes the equivalence between DS cases. 


VI.2 Logarithmic Wind Profile 


VI.2.1 Non-dimensionalization of EoM 


The EoM governing the vehicle evolution within a logarithmic wind profile depart 
from the linear wind profile as the local wind gradient becomes a function of altitude and the 
local wind strength a function of roughness length. For a logarithmic profile, the local increase 


in horizontal wind with altitude, or local wind gradient is expressed is reminded in Eq. VI.17. 


OW, u, 1 


Az — ( VI1.17) 





As it was underlined in Chapter II], an important remark is that the local wind 
gradient is independent from the roughness length. Air relative EoM are developed in Eas. 


V1.18 to VI.23. 
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1 U, 
mV, = — 5 PSCpVa" — mg sinvg —m 5 7 Sin Ya COS Vq COS Wa (VL.18 ) 
. 1 a Us Va, . 
MV, COS Vq Wa = 3 Stila sing + ge sin Wa (VL19 ) 
1 ; Uy Va = 

—MViVq = — 5 PSliVa cos Pp +Mg COS Yq — re COS Wg SiN“ Vq (VL.20 ) 

, Uy —Z 
x = V, COS Vg COS Wa — ce (=) (V1.21) 
y =Vacos Yq sin Wq ( VI.22 ) 
Z=—Vasinyg (VL23 ) 


A significant departure from the linear wind profile equations is that the airspeed 
evolution becomes sensible to the altitude, as seen in Eqg.VI1.18. It can be physically linked to the 
consideration that the airspeed increase due to the wind gradient is not uniform anymore, but 
depends indeed on the altitude. Besides, the altitude also alters to ability of the vehicle to 
deviate both its air relative flight path and heading. An important note is that none of the air 
relative variable is directly affected by the surface roughness length, such that the air relative 
kinematics is therefore unrelated to the surface state. Only the absolute horizontal earth 
coordinate directed along the wind direction sees a dependence on the roughness length, in Eq. 
V1.21, since the vehicle is carried along the wind direction at a speed that depends on surface 


interactions. 


The principle of establishing the non-dimensionalization based on the wind gradient 
proved successful for the linear wind profile. Therefore, a characteristic wind gradient of the 
wind profile would be sought to serve as an equivalent reference for the non- 
dimensionalization. As the local wind gradient depends on altitude, any given characteristic 
gradient could be associated with a local wind gradient at a certain height. The characteristic 
gradient f, is defined to be the local gradient at an altitude corresponding to the reference 


height Z,, as in Eq. VI1.24. 
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Be= > ( VI.24 ) 


Further properties about reference parameters are the same as the linear wind profile 


non-dimensionalization. Those are summed up in Eqs. VI.25 to VI.27. 


— 1 

gJ 

Vo = — 
C B. ( VI.26 ) 
Zo = Vote ( VI.27 ) 








GX 
bo = u, ( V1.28 ) 
i 
coy (VL29 ) 
u, 
y= — ( VL.30 ) 
Y 
1 /u,\? 
Ze = - (=) ( VI.31 ) 
GX 


It entails the following non-dimensionalization of air relative EoM, described in Eqs. VI.32 to 


V1.37. 





= = V, 
V, = —pV,Cp(C,) — sinYg - = sin Va COS Yq COS Wa ( V1.32 ) 
, _—, sing 1 
Wa PEE pean geen ( V1.33 ) 
Ree COSYq 1 2 
Ya =P Vac os Ces ast Wa (VL34) 


a 
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—) i —Z Z¢ 
x" = Vy COS Yq COS Pq — log ( ( VL35) 
0 
y’ — Va COS Yq sin Wq ( VI.36 ) 
z'=—V,sinYq ( VL37) 


The expression of p includes the expression of the characteristic gradient. 


pg’ 


"2 (ng/S) Be? 


( VL38 ) 


Sets of Eqs. VI.32 to VI.34 would be equivalent if parameters p are equal. Sets of Eqs. 
V1.36 and VI.37 would be equivalent in any case. However, for two equations Eq. VI.35 to be 


equivalent to each other, it requires the condition expressed in Eq. VI.39 to be fulfilled. 


a constant (VL39) 
Cc 


If the objective function of the optimization problem is to minimize the wind friction 
velocity u, , whatever the net azimuth direction is, then the evolution of x does not come into 
play regarding any constraint nor the objective. The relation in Eq. VI.39 can therefore be 
ignored, the wind friction velocity obtained would be independent from the surface roughness 
length, albeit the local wind speed would depend on it. Consequently, all dimensionless 
variables but x would be comparable. Inversely, if the optimization problem refers to an 
objective or a constraint involving the variable x, then Eq. VI.35 influences the solution that the 
solver comes out with. Hence, Eq. VI.35 must be adapted, by validating the condition expressed 
in Eq. VI.39, such that the whole set of Eqs. VI.32 to VI.37 could lead to invariance in 


dimensionless solutions. 


The same remark extents to the other constraints that would involve non 
dimensionless parameters, in particular the ground clearance constraint. Indeed, it has been 
observed that the vehicle tends to fly at the lowest possible altitude during the turn windward, 
in the case of logarithmic wind profiles. So on top of Eq. VI.39, another relation in Eq. VI.40 


must be validated to ensure equivalent solutions. 
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Zmax 
Zc 





= Zmax — constant (VL40 ) 


It is to be noted that Eq. VI.40 must be fulfilled in any case, while some optimization 


scenario can do without Eq. VI.39. 


VI.2.2 Application with Identical Wing Loading 


The sensitivity to mass is tested first, at a constant wing loading, for a problem which 
consists in minimizing the wind strength at a certain height, which is tantamount to 
minimizing the friction velocity. The vehicle mass and wing area were varied by keeping an 


arbitrary constant wing loading of 141.6 N/m’, corresponding to a mass of 7 kg for Mariner. 
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Figure VI.5: Optimal open-loop trajectories, minimizing the required friction velocity for various mass at 
a fixed wing loading of 141.6 N/m’. f =0.0113 for all cases, z) was unchanged at 3 cm. 


A first remark is that the flight path obtained in Fig. VI.5 is fundamentally different to 
that corresponding to a linear profile, displayed in Fig. VI.1. Most significantly, the maximum 
altitude reached by the vehicle is about four times lower and the leeward turn is flattened at 
heights below 5 metres, while it never went under 10 metres in the case of a linear profile. It is 
to be mentioned that the wind strength at 10 metres height is 9.4 m.s” in the present case while 
it was only 1.08 ms” in the case of linear wind profile. Those considerations confirm, if 
necessary, the predominant influence of the wind field on the overall DS trajectory and on the 


required wind strength. 


From Albatross to Long Range UAV Flight by Dynamic Soaring 140 


Equations VI.32 to VI.37 suggests that trajectories should be invariant, since 9 would 
have the same value and this is verified with a constant value of 0.0113 for all cases, a 
characteristic wind gradient Bc = 6.068 s” and a reference length z, of 0.2664 metres. The wind 


friction velocity obtained is 66.3 cm.s and the roughness length was left untouched at 3 cm. 


However, it can be observed that trajectories only match for 7 kg and 8 kg and that the 
other masses show significant altered trajectories. Those relative disparities would remain 
when comparing dimensionless coordinates, as the reference length is identical in the four 
cases. Those can be explained by the fact that state and control variables, which govern the 
evolution of the system over time, have sets that are equivalent to each other but with a time 
offset in some case. Indeed nothing constrains the solver to start from different parts of the 
cycle, as the overall solution is equivalent, with no influence on DS performances. The inner 
calculation process of optimization makes the solver choose a different starting point between 
cases. All variables, but horizontal coordinates, are governing the evolution of the system, as 
can be seen in Eqs. VI.32 to VI.37, and are therefore correlated by the same shift in time 
compared to another solution. Horizontal coordinates are set free to be piloted by equivalent 
sets of other variable, which leads to similar trajectories with a horizontal shift respectively to 


each other. 
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Figure VI.6: Evolution of control variables over dimensionless time, obtained by minimizing the required 
friction velocity, for various mass at a fixed wing loading of 141.6 N/m’. fp =0.0113 for all cases. 


Figure VI.6 shows the correlation between the evolution of the lift coefficient time and 
the evolution of the bank angle over time. The relative shift would be the same in time or in 
dimensionless time, given that the reference wind gradient is constant. Overall, DS within a 


logarithmic wind profile shows equivalent results for identical wing loadings. 


VI.2.3 Application with Various Wing Loadings 


For a fixed planform of Mariner, the mass is varied arbitrarily from 6 kg to 9 kg, which 
represents different wing loading cases. As it was underlined before, this indeed has an interest 
as it corresponds to an operational case of various loading for the same planform. Yet, this 
study is two-fold and also aims at highlighting a behaviour which could have been challenging 
to unlock without focusing on the non-dimensionalization of EoM. Thanks to this approach, 
the way the required wind strength relates to the wing loading can be anticipated theoretically, 
which it is being validated by means of simulations here. Besides, different wind strength are 
expected, leading to different dimensionless variables so the condition stipulated in Eq. V1.40 


about adapting the minimum altitude comes into play. It means that in theory, simply changing 
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the wind strength is not sufficient to establish equivalent solutions for various wing loadings. 
None of the constraints expressed involve x, so there is in theory no need to adapt the 
roughness length as well, which was then unchanged at 3 cm. The methodology used here is to 
start with a baseline case and then adapt the minimum altitude imposed. Baseline conditions 
were a mass of 7 kg and a minimum altitude of 1.70 metres. No attention was paid to ground 


clearance conditions, meaning that the bank angle was not limited by wing tip clearance. 
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Figure VI.7: Three dimensional trajectories optimized to minimize the wind strength required, for 
different vehicle mass. p =0.0113 for all cases. 


Some similarities can again be observed, in Fig. VI.7, between curves which seem to 
duplicate each other at a different scale. This is especially visible in the projected curves of 
trajectories in the North-Altitude plane. Just as in the case of a linear wind profile, the highest 
wing loadings correspond to greater amplitudes, reaching greater values in each coordinates. 


Table VI-2 gives some results about those solutions. 
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Table VI-2: Parameters about solutions displayed in Fig. VI.7. 








Mass mg/S = Vo = “a xX. = D 
ity BES) elm) VO I rau (m) 

6 kg 121.36 6.9543 0.228 1.497 1.457 0.011306 

7 kg 141.59 6.0681 0.266 1.617 1.700 0.011306 

8 kg 161.81 9.6762 0.304 1.728 1.943 0.011306 

9 kg 182.04 9.3916 0.343 1.833 2.186 0.011306 


A first observation concerns the evolution of the required wind friction velocity with 
the wing loading. It is reminded that this evolution does not depend on the surface roughness 
length Zg, which has however influences the wind speed required. When the wing loading 
increases, the required wind friction velocity increases with the square root of the wing loading 


(noted WL in the following expression), as detailed in the calculation detailed in Eq. VI.41. 


lu.lw1, y Wh 
Ra, WL, ( VI.41 ) 


This could be theoretically predicted by theory and can be verified from Table VI-2. In 
order to satisfy equivalence conditions, the minimum altitude is however raised when the mass 
increases such that the increase in wing loading alone cannot explain the increase in wind 
friction velocity. Identically to the case of a linear wind profile, the characteristic wind gradient 
P-. is adapted such that p gets a constant value which is necessary to get equivalent sets of 
EoM. Yet, in the case of the linear wind profile, the wind gradient was directly given by the 
physical model of the boundary layer that was taken into account. In the case of the log profile, 
it is a basically a parameter, directly function of u,, which varies to ensure that sets of EoM are 
equivalent to each other. The fact that the characteristic gradient decreases when the wing 
loading increases, as it would in a linear wind profile case, teaches very few things about the 


physical properties of the boundary layer profile. 
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Figure VI.8: Three dimensional curves of dimensionless coordinates, representative of the evolution 
displayed in Fig. VI.7. 


Figure VI.8 shows that despite the slight offset, curves in dimensionless coordinates 
match quite well. Along the x coordinate (Dimensionless North), the offset that can be observed 
shows that the non-dimensionalization does not lead to an equivalent evolution. Indeed, the 
evolution of X is a function of the roughness length, see Eq. VI.35, and the roughness length 


was not adapted for the dimensionless roughness length to be constant. 


Control variables are all equivalent to evolutions displayed in Fig. VI.6. As a matter of 
fact, optimization problems were rendered equivalent, between cases, through the adjustment 
of constant p by the solver. Hence, sets of control variables were identical to the baseline case 
of m=7 kg, time shift aside. This result has a significant impact on the way DS trajectories could 
be piloted, since control variables could be identical for a wide range of cases corresponding to 


p constant. 


VI.2.4 Non-dimensionalization of Thrust-Powered DS 


The vehicle is able to propel itself via the thrust T, which adds to the control variables 
and modifies Eq. VI.32 in Eq. VI.42. 
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; 1 U, 
mV, =T- 5 PSC Va" —mg sinYg —m 5 7 Sin Ya COS Vq COS Wq (VL42) 


It can be dimensionless in the form specified by Eq. VI.43. 


2 = V 
V'’ =T —pVC)(C,) — sing + 5 5in Va COS Vq COS Wa ( VI1.43 ) 


The on board energetics is modelled through the battery state variable Bat, which 


varies due to the power expense from the propulsive system, which is governed by Eq. VI1.44. 


a 


Bat = — 





Norop ( V1.4 ) 


It can be reduced into the form expressed by Eq. VI.45, after the thrust is dimensionless by the 


weight mg. 





Bat’ = — ( V1.45 ) 


Another objective is tested for different wing loadings, but at constant p, in order to 
find solutions that minimize the consumption per unit length travelled. In the case of DS flight, 
the length that is referred to is the horizontal net distance between the start and the end of the 


path. The consumption is expressed and dimensionless under Co, , in Eq. VI.46. 


Baty Baty a 
Co, = —- ——=—_. = —- mg = mg Co, 


a —_ V1.46 
pr +97 Jr + UF — 


For a given mass, minimizing Co, amounts to a minimum in dimensionless 
consumption Cos. The objective function is dependent on the final northbound position x; 
which itself is a function of evolution of x. The surface roughness length must be adapted, in 


Eq. VI.39, so that the dimensionless roughness length is equivalent in each case. 


In terms of methodology, the same baseline case is used, with a mass of 7 kg, a 


minimum altitude imposed at 1.70 metres and a surface roughness length of 3 cm. From results 
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obtained by minimizing the required friction velocity, u, is then reduced. The lower wind 
strength requires some thrust from the vehicle to manage the full energy-neutral cycle, and the 
wind friction velocity is decreased step by step until Co, reaches an arbitrary value of 0.015, 
without imposing any net heading. Then the same is applied for different masses, ranging from 
6kg to 9kg. For each case, the minimum altitude and surface roughness length are adapted 
such that their respective dimensionless values are the same. Besides, u, is manually adapted 
until Co, hits the 0.015 mark. The non-dimensionalization actually helps to predict what the 
friction velocity should be, by maintaining p the same, in order for optimization problems to be 


equivalent to each other. 


— m=6kg 
— m=7kg 
— m=8kg 
— m=9kg 
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Figure VI.9: Three dimensional trajectories optimized to minimize Co,, for different vehicle mass. Co, = 
0.015 and p =0.00952 for all cases. 


Similarities between curves are of the same nature as observed before. Further results 


are summed up in Table VI-3. 
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Table VI-3: Parameters about solutions displayed in Fig. VI.9. 








Mass mg/s Coc vo “4 —Zmax (Mm) Zo (m) p 
(Nim!) ©05 gg gt) mas ' P 

6 kg 121.36 0.015 1.373 1.457 0.0257 0.009518 

7 kg 141.59 0.015 1.483 1.700 0.0300 0.009518 

8 kg 161.81 0.015 1.586 1.943 0.0343 0.009518 

9kg 182.04 0.015 1.682 2.186 0.0386 0.009518 
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Figure VI.10: Evolution of dimensionless control variables over dimensionless time, for trajectories 
optimized to minimize Co, and for varying vehicle mass. Co, = 0.015 and p =0.00952 for all cases. 








The evolution of control variables is displayed in Fig. VI.10. The dimensionless thrust 
is pictured in order to highlight equivalences between solutions. For all cases, the contribution 
from the thrust is only an impulse, which is due to rate and double continuous constraints 


being ignored. Control variables are well correlated with a time shift between cases. 
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Figure VI.11: Three dimensional curves of dimensionless coordinates, corresponding to Fig. VI.9. 


Higher wing loadings require a higher friction velocity in order to perform at a 
constant specific consumption, which is consistent with results obtained with a non-powered 
vehicle. Friction velocities obtained to get to the same Co, are also responsible for equal p 
between the four results, which validates the non-dimensionalization approach. In the present 
situation, the roughness length has also to be adapted in order to yield equivalent optimization 
problems. Simulation results highlight the equivalence predicted from the theory with great 
accuracy, as the four trajectories are all identical when seen from their dimensionless 


perspectives. 


This final and refined comparison between simulations validates the approach 
undertaken to normalize the equations. Several test cases have been simulated, corresponding 
to different optimization problems. For each of those, simulations validated the similarities 
anticipated by theory, which is a strong assessment regarding the reliability of our 


methodology. 


VI.2.5 Condition to equivalence 


It is assumed that parameters such as the air density or the earth gravitational 
constant are not varying. Rather, following equations sum up the requirements on parameters 
for DS dimensionless solutions to be equivalent. Two separate cases are labelled “1” and “2” in 


the following equations. 
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Let’s assume different wing loadings, whatever variations in mass and in area they 


could relate to. 


(5), - (5), (VL47 ) 


The two cases would lead to equivalent dimensionless solutions only if the condition 


expressed in Eq. VI.48 is met. 


(uo [C"/s), 














= (VL48 ) 
m 
(u.)4 ( /s), 
Besides, the minimum altitude must be adapted as detailed in Eq. VI.49. 
m 
(Zmin)2 4 ( /s), 
~~ V1.49 
(Zmin)1 (""/s), 
And the same follows with the respective roughness lengths in Eq. VI.50. 
m 
(Zo)2 = ( /s), 
(VI50 ) 








(Zo)1 7 ("/c), 


If above conditions are met, then the induced wind strength Wy at any given height, 


between the two cases, would compare as described in Eq. VI.51. 


: ers), ers), cud, (Ys), 
(Wy )2 = (7/5), (Wy) + x 9 omy (VI51) 
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VI.3 Summary of Chapter VI 





This chapter consists in looking at DS flight from a dimensionless point of view, in 
the aim of investigating remarkable properties. Reference values are introduced and related to 
each other to form dimensionless EoM. The approach should not only concern EoM, but the 


overall optimization problem in order to yield consistent solutions. 


Non-dimensionalization is first considered in the case of a linear wind profile. 
Dimensionless EoM display a sensibility to a single parameter Pp. It is verified that equivalent 
dimensionless solutions are obtained when the wind gradient 6 and the wing loading (WL) are 
adapted such that pis constant. In particular, the WL is shown to be a parameter of 
equivalence regarding the required f. Indeed, if two vehicles, with identical aerodynamic 
polars, have the same WL, they would behave identically as far as DS is concerned. It means 
that the required f is identical and that trajectories can be superimposed. Besides, p can be 
associated to a benchmark of DS performances. There is a maximal value of p which allows for 
gliding DS flight. For values above, the vehicle must provide some power, for values below, 
energy expenditure margins are allowed. An increase in WL leads to a lower required f, as can 
be anticipated by the theory. This result is independent from any aerodynamic consideration, 
but deduced out of dimensionless EoM. Heavier vehicles perform DS flight paths of larger 
amplitude, with higher altitudes to reach stronger winds. Finally, simulations show that the 


required wind shear for DS is rather weak, approximately 1 m.s’ over a height of 10 metres. 


Simulations within a logarithmic profile display flight paths that are much more 
flattened at lower altitudes and require a wind strength close to 10 m.s’ at 10 metres height. 
An adequate non-dimensionalization is found and verified. Yet the adjustment of the minimal 
altitude and of Zp is required to yield equivalent solutions. This has hence little practical use, 
since those parameters relate to operational and environment conditions. Besides, the 
interpretation of 0 as a benchmark of DS performance is no longer accurate as the 
characteristic wind gradient does not bear any particular physical meaning. Still, the 
correspondence between theory and simulations provides a valuable support for validation. 
Besides, it can be anticipated that Z) only influences the required u, if the variable x is 
involved in either the objective function or operational constraints. Hence, as far as air relative 
motion is concerned, Zg does not come into play. Finally, an increase in WL requires a stronger 


u, and therefore stronger winds, which again departs from the linear wind profile case. 
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CHAPTER VY [| 


LDS FLIGHT PERFORMANCES 


In order to investigate the potential scope of DS flight for UAV-typed vehicles, the 
feasibility of DS flight and the related enhancements in performance should be assessed with 
respect to a vast range of variables. It has been established that, under certain conditions, it is 
possible to sustain gliding flight by extracting energy from the boundary layer. However, the 
range of mission scenario that offers this opportunity still has to be determined. The aim of this 
chapter is to investigate variations in DS performance with respect to parameters related to the 
vehicle, the flight path constraints and the environment. First, the study focuses on vehicles, 
with the three UAV designs compared in terms of minimum required wind strength. Because 
variations in mass were indirectly performed in the previous chapter, it is chosen to focus 
rather on the way the ground clearance Gzjeg, and the surface roughness Zg impact 
performances. It is to be noted that aerodynamic ground effects are not at stake here, since 
those are not taken into account. Then, new perspectives in DS flight are investigated by 


considering thrust-augmented trajectories. 


VII.1 Comparisons between UAVs 


Without getting into the detail of varying design variables, as those would alter the 
overall performances of each vehicle, it was rather chosen to apply the optimization 
methodology to each vehicle. Indeed, each vehicle embodies the feasible gliding performance at 
their scale. For this sake, similar conditions for all three cases were obviously adopted. It 
consists in a logarithmic boundary layer profile, with a surface roughness Zg of 1 cm, and a 
ground clearance Gejeg, of 50 cm. Those values are chosen arbitrarily, but are estimated to be 
sound and conservative values that would be adequate to represent the evolution of a UAV 
over an open flat terrain, covered with low grass. The vehicle design variables, as well as 
operating constraints adopted, are those specified in Table III-1, masses are not increased from 
their baseline. Indeed, simulations show that an increase in wing loading, within the 


logarithmic boundary layer, entails a stronger required wind friction velocity, even without 
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equivalence considerations raised by chapter VI, where the minimum altitude is adapted 


Results are presented first and then subsequently discussed. 


VII.1.1 Delair-Tech DT-18 


The first vehicle considered is the DT-18. It provides an interesting support for an 
insight into DS flight with rather small UAVs. The compromise between a reduced span at an 


expense of hampered gliding performances is of particular interest. 


Altitudet im) 





Norty, eS x 
my -20 ° > 
300, wee BD 


Figure VII.1: Energy-neutral open loop obtained for the DT-18. It takes 6.90 seconds to complete the cycle 
that is achieved for a minimum required friction velocity of 76 cm.s’, which corresponds to a wind 
strength of 12.80 m.s’ at a height of 10 metres. The vehicle is pictured at a 1.5:1 scale. 


The minimum wind friction velocity required for the DT-18 is 76 cm.s’, which 
represents significant windy conditions, to the order of magnitude of 13 m.s’ at 10 metres 
height. The lowest altitude (1.24 metres) is reached at the beginning of the trajectory, where the 


vehicle is banked at 55° to the left. The overall travelling directions corresponding to this 


trajectory is 142.6°. 


From Albatross to Long Range UAV Flight by Dynamic Soaring 153 


100 
ce. 50} 
@ 
= 
x 0 
at 
o 
m -50 


= 
=) 
=, 


fe 


—- 


Load Factor 





t(s) 


Figure VII.2: Evolution of the bank angle, the lift coefficient and the load factor with respect to time, for 
the DT-18, for the case pictured in Fig. VII.1. 


The evolution of control variables along the path, together with the load factor, is 
presented in Fig. VII.2. It can be observed that apart from barely hitting the load factor limit of 
3 and the limit of C; around 1.2, the DT-18 does not seemed hampered in its evolution by any 
structural or control rate limits. To eliminate any doubt, the maximum rolling rate was 
augmented from 60°/s to 90°/s, in order to assess the influence of this parameter on the 


minimum wind speed required. No significant improvement was observed. 
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Figure VII.3: Evolution of the inertial speed, the airspeed and the local wind speed encountered by the 
DT-18 along its optimized path, pictured in Fig. VII.1. 


As displayed in Fig. VII.3, the airspeed along the path is less than 20 m/s, which should 
not pose any problem for the DT-18. 
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VII.1.2 Mariner 


The second vehicle is Mariner, which was already used earlier. Still, it is used this time 


with its baseline mass of 2.0 kg and under the same operating constraints mentioned above. 


Altitude( im) 





Figure VII.4: Energy-neutral open loop obtained for the Mariner. It takes 4.90 seconds to complete the 
cycle that is achieved for a minimum required friction velocity of 56.7 cm.s', which corresponds to a 
wind strength of 9.55 m.s” at a height of 10 metres. The vehicle is pictured at a 1:1 scale. 


The minimum wind friction velocity required for Mariner is 56.7 cm.s’, which 
represents a wind speed of 9.55 m.s’ at 10 metres height. The lowest altitude (1.24 metres) is 
reached at the beginning of the trajectory, where the vehicle is banked at 36.4° against the 


wind. The overall net direction of travel is 140.6°. 


Compared to the DT-18, Mariner offers a significant improvement in the objective 
function. Indeed, it requires lighter wind conditions for Mariner to be powered by DS than the 
DT-18. Overall, the basic cycle obtained with Mariner has much lower amplitude than that of 
the DT-18. Another interesting aspect concerns the minimum altitude reached by Mariner along 
the cycle. It is very much comparable to the DT-18, while the span of Mariner is around 40 % 
larger, which indicates that Mariner, or the point mass that represents it, manages to get closer 


to the surface, relatively to its span. 
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Figure VII.5: Evolution of the bank angle, the lift coefficient and the load factor with respect to time, for 
Mariner, for the case pictured in Fig. VII.4. 


Mariner hits the maximum load factor of 3 during the lower turn. It can be deduced 
that a stronger airframe, which can allow for a higher load factor, has the potential to reduce 
further the minimum wind speed required. Besides, the limit in C i» of 0.5 s’ is reached, which 
suggests that enlarging pitch control surfaces could also play a positive role. The rolling rate is 


already at its maximum for Mariner and no further computation is made. 
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Figure VII.6: Evolution of the inertial speed, the airspeed and the local wind speed encountered by 
Mariner along its optimized path, pictured in Fig. VIL4. 


Compared to the case computed earlier for Mariner and displayed in Fig. V.13, the 
vehicle has less inertia and therefore the evolution of the airspeed and the inertial speed are 
further out of phase. It is to be noticed that speeds, either air or inertial-based, are 


comparatively lower than that of the DT- 18. 


VII.1.3 Cloud Swift 


The third vehicle considered is Cloud Swift, which is of significantly bigger scale 
compared to the two previous UAVs. Although the gliding performances of Cloud Swift stand 
out and would obviously play in its favour, it may be hampered precisely by its large span, 
during the lower turn. Beyond the comparison with the two other UAVs, simulations with 
Cloud Swift would provide an insight into DS for UAVs of the upper range. Indeed, at 4.32 
metres of span, the analogy with the albatross may start to fade and it is of particular interest 


to assess whether or not DS can be scaled-up to bigger geometries. 
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Figure VII.7: Energy-neutral open loop obtained for Cloud Swift. It takes 8.12 seconds to complete the 
cycle that is achieved for a minimum required friction velocity of 62.4 cm.s', which corresponds to a 
wind strength of 10.52 m.s’ at a height of 10 metres. The vehicle is pictured at a 1:1 scale. 


The minimum wind friction velocity required for Cloud Swift is 62.4 cm.s’, which 
represents a wind speed of 10.52 m.s’ at 10 metres height. The lowest altitude (2.06 metres) is 
reached at the end of the trajectory, where the vehicle is banked at 46.2° to the left. The overall 
net direction of travel is 133.6°. The flight path amplitude is the greatest so far, and so is the 


maximum altitude reached. 
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Figure VII.8: Evolution of the bank angle, the lift coefficient and the load factor with respect to time, for 
Cloud Swift, for the case pictured in Fig. VII.7. 


Yet, it can be observed in Fig. VII.8 that both the evolution of the bank angle and of the 
coefficient of lift show piecewise linear patterns. The bank angle hits the maximum rolling rate 
limit, which is quite low at 30°/s for Cloud Swift. Hence, the wind strength required for Cloud 
Swift could be lowered if the maximum rolling rate was increased, all the more as the vehicle is 
hampered by the constraint during most of the cycle. Besides, the coefficient of lift also reaches 


its maximum rate limit. 
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Figure VII.9: Evolution of the inertial speed, the airspeed and the local wind speed encountered by Cloud 
Swift along its optimized path, for the case pictured in Fig. VII.7. 


Since Cloud Swift appears to be rather restricted in its DS evolution, another run was 
performed by increasing the limit maximum rolling rate, first to 60°/s and then to 90°/s. The 
vehicle can reasonably achieve a higher rate of roll, by enlarging aileron surfaces, without 
sacrificing aerodynamic performances or adding too much weight. Besides, the 30°/s limit is 
somehow a conservative value, obtained from the literature [70]. Therefore, the limit in rate of 
roll is increased without modifying any other parameter, in order to unlock the real DS 


potential of Cloud Swift. 
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Figure VII.10: Energy-neutral open loop obtained for Cloud Swift for |p| < 60°/s. It takes 6.01 seconds to 
complete the cycle that is achieved for a minimum required friction velocity of 52.2 cm.s’, which 
corresponds to a wind strength of 8.80 m.s’ at a height of 10 metres. The vehicle is pictured at a 1:1 scale. 


The resulting trajectory is much closer, in its overall aspect, to that obtained for 
Mariner than before. The minimum wind friction velocity is quite lowered at 52.2 cm.s’, which 
represents a 16.35 % improvement from the previous case. The subsequent wind speed at 10 
metres height is 8.80 m.s . The minimum altitude is also lowered to 1.86 metres, down from 2.06 
metres. It is reached at the end of the trajectory, where the vehicle is banked at 39.2° to the left. 
Hence, relaxing the constraint on the rate of roll has enabled Cloud Swift to turn at a lower 
altitude, reached for a reduced bank angle. The overall net direction of travel is 132°. The 
overall flight path amplitude is lowered compared to the previous run and so is the time of 
flight, down to 6.01 seconds from 8.12 seconds. More aggressive banking capabilities translate 


into a narrower evolution. 


A further improvement can be achieved by increasing the maximum rolling rate to 
90°/s. Yet, it is quite marginal as the corresponding wind friction velocity is then 51.1 cm.s’, 
only 2 % better than the 60°/s case. All variables were quite similar in their evolutions and it 


was chosen to only display results relative to the 60°/s case. 
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Figure VII.11: Evolution of the bank angle, the lift coefficient and the load factor with respect to time, for 
Cloud Swift, with a maximum rolling rate of 60°/s, corresponding to the flight path pictured in Fig. VII.10. 


Another limitation arises, as can be observed from the evolution of variables, plotted 
in Fig. VII.11. Indeed, the maximum rate in coefficient of lift is reached again and C;, also hits 
the maximum value of 1 for a significant fraction of time, during most of the higher part of the 
curve. This suggests another way of improvement from the baseline design of Cloud Swift, 
which would focus on increasing the maximum C,. Yet, a redesign of the wing section may 
hamper the best lift to drag ratio, such that a first approach could consist in controlling flaps 
deployment, when a high C; is required. This usually occurs at low airspeeds, such that gains 


from the higher C; should easily overcome the penalty induced by the increased drag. This 
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would add a single control variable and would allow to play with different aerodynamic polar 
while keeping a conventional design. It would constitute a promising study case which would 


approach the concept of dynamic flaps deployment. 
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Figure VII.12: Evolution of the inertial speed, the airspeed and the local wind speed encountered by Cloud 
Swift along its optimized path, for a maximum rolling rate of 60°/s, corresponding to Fig. VII.10. 


Compared to the case of restricted roll rate, the new trajectory is flown at a lower 
average airspeed. Compared to other vehicles, Cloud Swift achieves its energy neutral trajectory 


for less favourable wind conditions, which is a significant result that is discussed below. 


VII.1.4 Discussion on Vehicle Performances 


Different results regarding the energy-neutral open loops obtained for the three vehicles are 


summed up in Table VII-1. 


Since Mariner was designed in order to minimize the wind strength required [67], it 
can serve as a baseline case for comparison. A first observation is that the DT-18 is not 
especially suited for DS flight. Indeed, the wind strength required is significantly higher than 


all other cases and variables weren't observed to be limited by any specific rate constraint. 


The maximum altitude reached by Mariner is significantly lower than that of the DT- 
18, 11.8 metres against 17.5 metres. Because of the lower wind strength and the lower altitude, 
Mariner therefore manages an energy-neutral trajectory which requires a lower wind power 
density than the DT-18. It may be correlated to the fact that Mariner needs a lower minimal 


power to be maintained aloft, see Table III-1. 
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Table VII-1: Summary of results regarding the DS trajectory obtained for minimum 
required wind friction velocity. 





DT-18 Ni rive Cloud Swift Cloud Swift 





Variable (30°/s) (60°/s) 
u, (cm/s) 76.0 56.7 62.4 52.2 
|Wyelmax (m/s) 13.8 9.8 11.7 9.6 
ty (s) 6.90 4.90 8.12 6.01 
Zmin (m) 1.24 1.24 2.06 1.86 
—), 70 % 50 % 48 % 43 % 
Wace 3 3 3 3 
dmax (°) 74.8 72.9 60.0 74.4 
Wnet (°) 142.6 140.6 133.6 132.0 


The lowest altitude reached by the DT-18 accounts for 70 % of the span, while Mariner 
manages to reach the same lowest altitude which is only 50 % of its span. The DT-18 is banked 
at a higher angle (55°) at its lowest altitude. It may be interpreted as a way to reduce the radius 
and the amplitude of the lower turn and therefore to limit the parasitic drag losses that occur at 
high airspeed. Our interpretation is that, in the framework of DS, the DT-18 sacrifices too much 
gliding performances for the sake of a reduced size. It has to fly at a rather high airspeed in 
order to be able to reach favourable wind strength at high heights and hence has to bank 
ageressively during the lower turn to avoid losing momentum. This manoeuvre induces some 
performance penalty, due to the windward acceleration during the lower turn, on top of the 


parasitic drag. 


Although different designs could be investigated for small fixed-wing UAVs, the 
deterioration of aerodynamic performances does not shed a promising light on the exploitation 
of DS on rather flat surfaces by very small UAVs. Yet, other environments, where the wind 
shear is restricted to limited areas, such as the leeward side of buildings could inversely favour 


small designs. 


However, at the other extremity of the range, results are rather promising. Once the 
flight path limitations due to rolling rate constraints are fixed, Cloud Swift can actually exploit 
DS for a wind friction velocity 13 % lower than Mariner. It must be nuanced by the fact that 


Cloud Swift reaches a maximum altitude of around 18 metres, while Mariner does not exceed 12 
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metres. Hence, the maximum wind strength encountered is not that different, but still at the 
advantage of Cloud Swift, which encounters a maximum wind speed |W,.| max of ~9.6 m/s at the 
top of the path, while Mariner sees winds of ~9.8 m/s. The wind power density required by 


Cloud Swift is hence lower. 


It does not put the design of Mariner into question as the authors voluntarily limited 
its span to 2.5 metres and imposed that the vehicle could be hand-launched [67]. Still, the 
interpretation of the comparison with Cloud Swift is not especially intuitive since the tentative 
explanation of the lower in-flight required power is not verified in this case. Quite the contrary 
actually, since Cloud Swift requires, for static flight, a minimum power that is more than three 
times that of Mariner, see Table II-1. Therefore, Cloud Swift exploits lighter winds even though 
it requires more power to balance drag losses. So the minimum required power isn't the driving 
parameter regarding the feasibility to exploit DS. Yet, the right consideration may have to be 
with minimum sink rate, as it is for thermal soaring or slope soaring [95], rather than minimum 
required power. Even though the two quantities are trivially related, the feasibility to exploit a 
thermal is theoretically assessed by comparing the vertical thermal velocity to the minimum 
sink rate of the vehicle. In this respect, Cloud Swift still has an edge over Mariner, with a 


minimum sink rate of 0.39 m.s”’ (see Table III-1), against 0.42 m.s” for Mariner. 


On the lower part of the flight path, the minimum altitude reached by Cloud Swift is 
higher than for Mariner, and accounts for 43 % of its span, compared to 50 % for Mariner. This 
proportion is to be put on the fact that the minimum altitude is the sum of the wing tip 
clearance, imposed at 50 cm, and of the vertical projection of the span. Therefore, an increase in 
span can lead to a decrease in height-to-span ratio, even though the vehicle is banked at a 
higher angle. It happens in the present case, with Cloud Swift banked at 39.2° at its lowest 
point, compared to 36.4° for Mariner. It would be interesting, in further studies, to assess the 
influence of the wing tip clearance on the DS operating advantage of Cloud Swift. Anyway, the 
absolute height, encountered by Cloud Swift during the lower turn, is higher than for Mariner. 
In itself, it represents adverse conditions to DS flight, even though the increased span 


contributes to increase the lift-to-drag ratio. 


To sum up, compared to Mariner, Cloud Swift exploits more adverse conditions (ie 
lower wind friction velocity) along a path that sees lower wind strength at the top of the curve. 
Hence, the increased mass, due to the bigger scale, as well as the minimum height penalty, 
induced by the higher span, can be overcome by refined aerodynamics (higher (L/D) max, lower 


VZmin), Which enables Cloud Swift to maximize the energy extraction at minor parasitic expense. 
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Our results suggest that the DS scope could be enlarged by using UAVs of larger size 
with improved aerodynamic performances. The upper edge of that trend will be reached when 
the aerodynamic benefits of increasing the span are cancelled by the minimum height penalty. 
Besides, since the version of Cloud Swift used is not especially lightly built, using large 
lightweight planforms of the size of Cloud Swift could enable to lift higher payloads by DS. 


VII.2 Variation in DS performance 


VII.2.1 Sensitivity to Surface Roughness Length 


The way the surface roughness length Zp, influences the required wind strength is of 
particular interest as Zp is representative of irregularities of the surface and therefore of the 
type of terrain where the vehicle could possibly exploit DS. Besides, the relationship between 
the net travel heading w,,.;, with respect to the wind, and the wind strength required is also of 


particular interest. 


Therefore, from the baseline case study of Mariner, presented in VII.1, the roughness 
length is altered from its value of 1 cm. A 0.4 mm value is chosen to approximately represents 
the roughness length of the open sea, while 25 cm is representative of high crops. Besides, the 
net travelling direction W,,.; is not set free anymore, but imposed through an added constraint. 
The ground clearance was fixed at 50cm regardless of the chosen roughness length. Figure 
VII.13 displays results under the form of a polar plot. The radius is representative of u,, while 


the angle is given by Wye. 


Some discontinuities can be observed on the polar plot, especially marked on the red 
curve. Those happen because the optimization methodology, from one calculation point to the 
following one, identifies a better family of solutions and suddenly changes many of its 
variables. Such a discontinuity in the objective function, u, in the present case, occurs due to 
the local nature of optimums yielded by the solver. If only global optimums were found, the 
transition between two calculation points could still lead to differences in nature between flight 
paths, but the objective function should be continuous as long as an optimum exists. In the 
current case, variables of the next calculation step are initiated by using results of the previous 
one and then passed to the solver with a 0.5°.variation in the constrained Wy ,.This 
methodology entails solutions that therefore depend on the set of initiated variables, due to the 
local nature of the optimum point, and that hence have the potential to be improved. This 


happens suddenly for a given Wyet, when the solver switches from one family of solution to 
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another. The fundamental difference between trajectories of different families is illustrated in 


Fig. VII.14, where three types of flight path are displayed, each being optimal for a certain 


range of Wnet. 
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Figure VII.13: Evolution of the required friction velocity u, (radius in m/s), with the net travelling 
direction W,,o¢, for different surface roughness Z. In blue, Z)=0.4 mm, in purple Z )=1 cm (baseline case) 
and in red, Z)=25 cm. Green dots correspond to simulation cases of unconstrained in W,.;, which yield, 
for each value of Zp) , the lowest w,. 


All three curves of Fig. VII.13 are obtained by starting the iteration in W,,; from 
respective simulation cases of unconstrained w,,.;, marked by green dots. Solutions are then 
obtained by either increasing or decreasing Wye, through the additional constraint. It can be 
observed that all discontinuities correspond to a sudden improvement of the objective function 


in the direction of the iteration in W,.e,. Going through w,,.; calculations an opposite way 


From Albatross to Long Range UAV Flight by Dynamic Soaring 168 


would therefore yield different curves at the neighbourhood of the observed discontinuities, 


due to the same effect. 


To sum up on the matter of discontinuities in the objective function, each calculation 
point corresponds to a consistent trajectory that represents a local solution. In the worst case, a 
better optimum than the one actually found is available. Before those discontinuities, relative to 
the iteration direction, the true optimum could belong to the family of curves corresponding to 
the other side of the discontinuity. Each discontinuity can therefore be interpreted as a region 


of uncertainty regarding the true optimum. 


As it was determined by the non-dimensionalization study in Chapter VI, the 
minimum required friction velocity does not vary with Zo. It indeed remains at 56.7 cm.s-, 
regardless of the roughness length, because air relative kinematics remains the same when only 
the roughness length is varied. Yet, in the earth-relative point of view, those trajectories 
translate differently. It can be seen for instance, that the net heading angle which corresponds 
to the minimum wind friction velocity, is shifted leeward when the roughness length decreases. 
Besides, it is all the more difficult for the vehicle to progress against the wind when Zo 
decreases, meaning that the range of feasible w,,.; shrinks leeward when Zp decreases. In Fig. 
VIL13, for a roughness length of 25cm, Mariner can fly in any heading. The required wind 
friction velocity is then minimal for W,,.; around 120° and shows a maximum around 23°. Then, 
a new family of optimal trajectories is found and the vehicle manages to progress against the 
wind, for a friction velocity that remains around 69 cm.s”, regardless of Wyer. It should be 
mentioned that if the required net heading is around 23°, the destination can still be reached by 


combining cycles of higher and lower net heading angles that requires lower wind strength. 


As it was mentioned above, the following Fig. VII.14 illustrates the three families of 
optimal trajectories found by the solver. They each correspond to the three continuous regions 
u, = f(Wnet) seen on Fig. VII.13, for Z9= 25 cm. The significant disparities in shape between 
the three trajectories explain the discontinuities that can be observed in Fig. VII.13. When the 
vehicle is imposed to travel directly headwind, the trajectory obtained resembles that of the 
closed loop presented in Chapter V, except that there is a small distance gained against the 
wind. It is then feasible to progress directly against the wind, cycle after cycle, at the price of a 


rather small net speed of 0.4 m/s. 
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Figure VII.14: Solution cases corresponding to Fig. VII.13, for Z)=25 cm. Top: net flight path heading 0°, 
directed windward, u,=69.2 cm/s, Vne¢=0.4 m/s. Middle: net flight path heading 120°, minimal u,=56.7 
cm/s, Vne¢=8.0 m/s. Bottom: net flight path heading 180°, directed leeward, u,=58.2 cm/s, Vnep=4.8 m/s. 
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The main result is to highlight that there exists a roughness length which permits the 
vehicle to progress against the wind. Inversely, the vehicle ability to progress against the wind, 
and therefore to fly in any heading on the long term, is highly dependent on the surface 


roughness length. This ability is hampered when the terrain has a lower roughness length. 


VII.2.2 Sensitivity to Wing Tip Clearance 


The issue of minimum ground clearance is of particular importance because it calls 
attention to the antinomic tendency between safe operations and favourable DS conditions. 
Therefore, investigating the influence of the ground constraint on DS performances may help 
to determine a compromise. The same methodology as before is adopted so that from the 
baseline case study of Mariner, presented in VII.1, the ground clearance is altered from its value 


of 50 cm. The roughness length was fixed at 1 cm, regardless of the chosen ground clearance. 


Results are presented in Fig. VII.15. Each of the three curves shows, at their extremum, 
an asymptotic behaviour around certain values of Wyet, where the required u, sharply 
increases to suggest a diverging behaviour. Those values of p,,.; correspond to the extremum 
in feasible w,,.; that Mariner can exploit by DS, respectively to each ground clearance imposed. 
The vehicle cannot orientate its net heading further windward of those values. It can be 
observed that the vehicle covers a wider W,.; range when the ground clearance is lower. 
Besides, for each available w,,.;, the required friction velocity is reduced when the ground 
clearance is lower. Furthermore, the net travelling angle that sees the minimum required u,, 


marked by green dots on Fig. VII.15, moves windward when the ground clearance reduces. 
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Figure VII.15: Evolution of the required friction velocity u, with the net travelling direction p,,.;, for 
different ground clearance G,jeq,. In blue, Gejeg7=0 cm, in purple Gejeg,-=50 cm (baseline case) and in red, 
Gciear=1 m. Green dots corresponds to the simulation case unconstrained in W,,.;, which yield, for each 
value of Geiegr , the lowest u,. 


This indicates overall that when the vehicle is allowed to fly closer to the surface, it 
can exploit DS at lower wind strength and for a wider range of net travelling angles. Still, even 
when the ground clearance is set to zero, virtually allowing wingtips to touch the surface, 
Mariner is not able to make any headway against the wind. Indeed, the net travelling angle does 


not go below 111°. 


VII.3  Thrust-Augmented Dynamic Soaring 


The study case approached by varying the surface roughness and the ground clearance 


underlines that flying against the wind by pure DS flight requires a combination of high surface 
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roughness and low ground clearance. On top of that, requirements in terms of wind strength 
are sometimes demanding and the likelihood to meet favourable conditions can be assumed to 
be rather low. Rather than expecting a complete provision of the vehicle energy needs by the 
wind; those could be only partially covered by DS flight, while the vehicle would provide the 
rest on its own power. Such a configuration would widen the scope of DS application, by 
lowering requirements related to environmental conditions, while still benefiting from the 
same energy-harvesting principles. A thrust-augmented DS study case is presented before 


exposing some range-improvement charts. 


VII.3.1 Basic features 


The principle of adding thrust (T) as a third control variable is introduced in Chapter 
VI. In the following study, inspired by the baseline case obtained in VII.1 for Mariner in pure 
gliding flight, u, is arbitrarily decreased from the minimum required, 56.7 cm.s’, down to 47.5 
cm.s’, so the wind strength is 8.00 m.s’ at 10 metres height The wind friction velocity is not the 


objective function anymore and is determined as an input to the optimization problem. 
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Figure VII.16: Thrust-augmented-energy-neutral open loop obtained for Mariner, achieved within 4.86 


seconds, for u, = 47.5 cm.s’. The pink-coloured dots are proportional, in size, to the intensity of the 
thrust provided by the vehicle. The vehicle is pictured at a 1:1 scale. 


The objective function is to minimize the energy consumption per unit distance 


travelled (Co,), that is defined in Eq.VI.46. The corresponding trajectory is shown in Fig. VII.16, 
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with the variation of the thrust also pictured along the path. The flight path of minimal Co, is 
performed with thrust applied during the climb, with a peak reached at the steepest climb 
slope. The maximal thrust intensity accounts to just under 5 % of the weight. The Co, obtained 
represents only 45 % of Conso,,jn, suggesting that, under flight conditions specified by Fig. 
VII.16, the range could be more than doubled. 


Subsequent variables, relative to Fig. VII.16, are plotted in Figs. VII.17 and VII.18. It 
shows the evolution of the thrust with respect to time and relative to other control and state 
variables. It can be observed that the peak in thrust occurs when wings are close to level, in the 


steepest part of the climb. 
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Figure VII.17: Evolution of @, C; and T with time, for Mariner, along the flight path that minimizes Co,. 
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Figure VII.18: Evolution of the altitude and y; with time, for Mariner, along the flight path that 


minimizes Co,. 


From Albatross to Long Range UAV Flight by Dynamic Soaring 175 


VII.3.2 Range Sensitivity to the Wind 


Since DS flight is investigated as a way to enhance the vehicle range and endurance 
through the exploitation of wind, it is necessary to compare DS simulations with appropriate 
baseline cases. The approach undertaken in this part is to compare DS-induced performances 
with the nominal range of the vehicle in the presence of wind. Basic nominal performances 
calculations are introduced in Eq. III.4 and Eq. IIL8 for the endurance and the range 
respectively. However, those are valid in still air and one may wonder how those vary with 
wind. Besides, as DS flight requires the presence of wind anyway, it is only consistent to 
compare DS-induced performances with wind-altered performances, before concluding on the 


eventual advantages of DS flight. The vehicle considered here is Mariner. 


It has been presented earlier that endurance is directly related to the minimum sink 
rate of the vehicle. That latter parameter will not be influenced by the presence of horizontal 
wind, which basically means that the baseline maximum endurance of a vehicle would remain 
identical regardless of the presence of horizontal wind. However, the vehicle will be carried 
away by the wind during its flight. So if the endurance performance is required in association 
with the coverage of a specific area on the ground, the conclusion that the endurance does not 
depend on the wind strength is not so obvious. Still, let’s try to address this question without 
calculations, but by conceptualizing a vehicle gliding in the wind. The endurance of the vehicle 
won t be affected if the vehicle manages to sustain its minimal sink rate. This functioning point 
is associated, among other parameters, to a specific airspeed, which would be called the 
airspeed of minimum sink rate. As long as the wind strength encountered is lower than the 
airspeed of minimum sink rate, the vehicle can make headway against the wind while 
maximizing its endurance. Hence, a closed trajectory, with respect to earth, can be flown 
without sacrificing the endurance. The extreme case of this later consideration happens if the 
wind speed is equal to the airspeed of minimum sink rate. In that case, the vehicle can face the 
wind, being motionless with respect to earth. It can be seen from Fig. II.9 that, in any case, the 
speed of minimum sink rate is just above 8 m.s’ for Mariner, which gives an order of magnitude 
of the maximal wind strength the vehicle can handle without compromising on endurance. It 
can then be concluded that, unless specific cases, the endurance does not depend on the wind 


strength. 


The case of the range is different. It refers indeed to a distance, while the vehicle flies 
with respect to the surrounding mass of air. The following lines detail a methodology that 


provides the functioning point of maximal range, for a vehicle in static flight within a windy 
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environment. A top view of that configuration case is provided by Fig. VII.19, the vehicle is 


flying in a straight line. 


| Wind 





y 


Figure VII.19: Top schematic view of the triangle of speed, between inertial speed, wind speed and 
airspeed, for a given inertial speed heading and a given wind speed. 


The ratio between the distance flown, relative to earth, and the theoretical loss of 


height in gliding flight is expressed in Eq. VII.1. 


Z 2 
Dist ab a iy ( VIL.1 ) 


h Viz 
The vectorial relation between speeds is reminded in Eg. VIL2. 


+W ( VIL.2 ) 


SN 


V;, = 
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It breaks down into three scalar equalities, detailed in Eq. VII.3. 


Vix = Vax — Wy 
Viy = Vay ( VIL3 ) 
Viz = Vaz 


By combining Eq. VII.1 and Eq. VII.3, the ratio between the distance flown and the 


corresponding loss of height is expressed in Eq. VII.4. 


17% 
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Besides, the equilibrium state is translated into Eq. VII.5, regarding forces involved. 


1 

3-5. C,.V,* = mg cos Yq 

1 ( VIL5 ) 

5P-S. Cy.Vy* = —mg sinyg 

Hence the lift to drag ratio is expressed in Eq. VIL.6. 
Cr 1 
Cc, tay, ( VIL6 ) 
The airspeed components relate, by definition, as detailed in Eq. VII.7. 
Vaz = —Va SIN Yq 
( VIL7 ) 


es — Vig = Va COS Yq 


By combining Eq. VII.6 and Eq. VII.7, it directly leads to Eq.VIL8. 


ge = | A 
C, _ aX ocd ( VIL8 ) 
Cp Vaz 


Hence, be replacing Eq. VII.8 into Eq.VIL4, the ratio between distance flown and loss 


of height is expressed in Eq. VIL.9. 








Dist [ey Wy* 2VaxWy 
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Equation VII.9 underlines that, with the presence of wind, the ratio between distance 


travelled and height lost is no longer equal to the lift to drag ratio. Rather, it involves, among 


From Albatross to Long Range UAV Flight by Dynamic Soaring 178 


others, the wind speed. Yet, the ratio detailed in Eq. VII.9 is directly related to the in-flight 


energy-expenditure, as it is reminded in Eq. VII.10. 





h 
Fexpenditure = Mg * h = mg * Di * Dist ( VII.10 ) 


st 


The Eq. VII.11 is therefore obtained, specifying the minimum energy consumption per 


unit length travelled, which guarantees the maximal range. 


mg 


CONSOmin = (ist/h) nox ( VII.11 ) 


By replacing Eq. VII.9 in Eq. VII.11, the minimum consumption writes down as in Eq. VII.12. 


( VIL12 ) 





Max 


In order to maximize the range in the presence of wind, the vehicle must not fly at the 
maximum lift to drag ratio anymore, but rather maximize the expression into brackets detailed 
in Eq. VII.12. Note that if the wind Wy is zero, Eq. VII.12 does well merge into Eq. III.7. The 
result highlighted by Eq. VII.12 is well known by glider pilots, which must adapt their airspeed 
to the wind intensity in order to optimize their range. This also applies when the motion of air 
is vertical, when meeting updrafts or downdrafts, albeit speed components obviously right 
down differently. An application is given by Chakrabarty et al. [96], which focuses on 
exploiting an a-priori map of vertical components of the air using heuristic search. Adapting 
the airspeed to in-flight conditions is part of the problem. In the present case, conditions in 
terms of flight path heading and wind speed are known, the objective consists in maximizing 


the range and is formalized in the Eq. VII.13. 





ieee ((2] Wn’ mt) 
AxXlMIze — a VIL.13 
Cp Var” Var ( ) 


It is a relatively straightforward optimization problem, as the airspeed is the only 
variable at stake. A simple methodology to find the extremum is derived in the following lines. 


First, an expression for the x-component V,, of the airspeed is sought. 
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By definition, components of the horizontal inertial speed relate to the inertial flight 


heading angle as given in Eq. VII.14. 


Viy 
tan Wi = V.. ( VII.14 ) 
UX 


Equation VII.15 is obtained by substituting Eq. VIL3 into Eq. VII.14. 


tan Wi = ( VIL15 ) 


Equation VII.16 is given by definition. 
Vig ay Vag = (Va cos Ya)’ ( VII.16 ) 


Equation VII.15 and Equation VII.16 form a system with two unknowns, Vj, and Voy, 


summed up in Eq. VII.17. 


(Vax — Wy) tan Yi= Vay 
( VIL17) 


| a ir Ve — (Va COS Vi)" 


The x-component of the airspeed can be found by solving the quadratic Eq. VII.18, 
obtained from Eq. VII.17. 


Vax (1 + tan? hi) — Vax 2 Wy tan* p; + [Wy* tan? yp; — (Va cos ¥q)?]=0 (vmis) 


Finally, the process to maximize the expression stated in Eq. VII.13 is to iterate 
through discrete values of the airspeed V,. Steps are enumerated as follow. First, for a given 
value of the airspeed V,, the corresponding aerodynamic coefficients at equilibrium are trivially 


computed through Eq. VII.19. 


Cp = Cpo Coin + Cpa +O p30, PCpal, 


m 
IC? + Cy? = a ( VIL.19 ) 
[9 p.S.Vq 
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Then, the corresponding air relative glide angle is computed out of the aerodynamic 


coefficients Cp and C,, as follow in Eq. VII.20. 


D 
Ya > cer ( VIL.20 ) 
ip 


So it directly gives Eq. VII.21, providing the air relative sink rate. 


Vaz = Va SINYq ( VIL21 ) 


The final step is to find V,, by solving Eq. VII.18. Therefore, by knowing V,, and V,,, 
the expression defined in Eq. VII.13 can be estimated. And so on so forth for various iterations 


on the input value of the airspeed V,, until a maximum is reached. 


Therefore, for any given travel heading wy; with respect to the wind direction, and for 
any given wind strength Wy, the airspeed corresponding to the minimal consumption per unit 


length travelled can be found. And so is the related consumption. 


This methodology is applied, for Mariner, in a specific case, in order to illustrate the 
influence of Wy and yw; on the achievable range in straight flight. Since the vehicle’s range 
depends on the capacity of its battery, which is undetermined, the ratio between the range with 
wind and the nominal range enables to draw performance comparisons. It can be expressed in 


Eq. VIL. 22. 
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Figure VII.20: Evolution of the range ratio versus the flight heading, for a given wind. Chart obtained for 
Mariner, flying in straight line, under a wind strength of 8 ms’. 


The influence of the wind on the achievable range is clearly depicted on Fig. VII.20. It 
displays that if Mariner flies straight into 8 m.s’-strong headwind, its maximum range would be 
reduced by more than half compared to its nominal range without wind. On the opposite, if 
Mariner flies completely leeward, its range can almost be doubled. The overall “egg-shaped” 
pattern obtained is not a circle, as it would be if the vehicle was flying at a constant airspeed in 
all directions. An interesting aspect of the alteration of range performance can be observed. It 
concerns the case where the vehicle needs to follow a heading on the ground which is oriented 
crosswind, in this case, w;=90° (or 270°). It can be observed that the associated range is 
significantly reduced, by more than 25% compared to the nominal case. The proper explanation 
is that the vehicle is actually facing the wind to some extent in order to maintain an inertial 
crosswind heading. The airspeed has a component against the direction of the wind, which 


comes at a direct range penalty. A true air relative crosswind flight happens when the airspeed 
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is orthogonal to the wind. In this latter case, the range along the direction orthogonal to the 
wind is unaffected, but the vehicle is also carried away by the wind in the process, which 
means that the overall range is augmented. It happens, in Fig. VII.20, for W;=132° (or 228°), 
where it can be observed that the crosswind component of the range ratio is equal to 1. Stated 
otherwise, it is the only inertial heading that gives the maximum range in the crosswind 


direction. 


Now that the methodology to find the maximum range of a vehicle in straight static 
flight, under windy conditions, is laid down, it can be applied for the sake of comparison with 


DS-enabled range. 


VII.3.3 Range enhancement by DS 


For this last study case, the range in straight line is compared to the range obtained by 
DS, for various heading angles and for several wind speeds. The objective function is still to 
minimize Co;. However, operational parameters were chosen carefully to match potential 
realistic operational conditions. Indeed, the aim is to get to a consistent quantification of the 


improvement DS could provide to the long range flight of a UAV. 


The vehicle is Mariner, with a virtual payload of 2 kg, such that the overall mass is 4 
kg. The surface roughness length Zg=2 cm is chosen in order to match that of an open terrain, 
covered with low grass [76]. The altitude is limited directly through a minimal altitude 
constraint at the centre of gravity, as it is assumed it corresponds to a more realistic operational 
case than monitoring the wing tip clearance. A conservative value of 2 metres was chosen, such 
that even banked to the maximum, Mariner would clear the ground, at its wing tips, by a 
margin of at least 75 cm. Besides, since DS results would be compared to a straight line case, it 
is also necessary to limit the maximal altitude, otherwise the vehicle could opt for unlimited 
heights to seek for stronger winds and the problem would be unconstrained. We decided that 
1000 metres was a fair order of magnitude, for both the upper validity bound of our log-wind 
profile model and the operational altitude of a small UAV. Other than that, all other parameters 


were kept identical. 


The metric plotted consists in the range ratio, defined in Eq. VII.22. Straight line 
results are obtained by applying the methodology applied developed in VII.3.2, with a further 
degree of freedom since the altitude can vary between 2 and 1000 metres. A simple comparison 
loop is used to determine the altitude which optimizes the straight-line range. As for DS results, 


the net heading angle w,,¢ is incremented step by step by intervals of 0.2". Variables of the next 
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calculation point are initiated by using results from the previous calculation point, leading to 
potential discontinuities, in the objective function, related the local nature of the optimum 


found, as discussed earlier. 
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Figure VII.21: Variation of the range ratio with respect to the net heading angle, compared to the straight 
line case, for u,=66 cm/s, which corresponds to a wind strength of 10 m/s at an height of 10 metres. In 
green, results from straight line, in blue, results from DS. 


A first simulation is run for a wind friction velocity just below the minimum wind 
strength required for DS gliding flight. To begin with, the straight line case, seen in Fig. VII.21, 
displays the leeward shift that was described earlier. Yet, a variation in curvature can be 
noticed for Wyez ~ £135". It corresponds to the net heading angle at which the vehicle swap 
from a straight line at the lowest altitude to a straight line at the highest altitude. On the 
windward part of the curve, the vehicle flies as close as possible to the surface to get the lowest 


headwind as possible. On the contrary, on the leeward side, the vehicle takes advantage of the 
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strongest possible tailwind to increase its range. Regarding DS results, the range shows a 
significant maximum for Wy,e¢ around 135°, which corresponds to values of the heading angle 
that were minimizing the required wind strength. It hence reinforces the assertion that DS 
flight largely favours flight headings in this area. Besides, it is also the zone that was identified 
by biologists as the predominant travel heading of albatrosses under strong wind conditions 
[29]. At the maximum, the range provided by DS is more than a 1000 times the straight line 
range. It almost diverges because the wind strength is close to that which allows virtually 
perpetual flight. An important aspect is that for windward flight headings, the DS curve merges 
with the straight line curve, suggesting that when heading +45” into the wind, the best option 
is to fly straight, at the minimum altitude. DS proves more efficient than the straight line for 
some flight paths that have a windward facing component. This underlines that DS can be 
beneficial, even in some cases when the flight path has a windward component. The advantage 
of using DS versus straight line in those configurations is not to be underestimated by the log- 
scale applied, as the range offered by DS can be up to 30 % higher. For leeward flight headings, 
even though the vehicle flies at the maximum altitude in straight line, the DS flight provides a 


significant range advantage all the way until w,,.;=+180°, where the two curves join. 
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ENDURANCE RATIO VERSUS NOMINAL CASE 


North 


West East 





South 


Figure VII.22: Variation of the range ratio with respect to the net heading angle, compared to the straight 
line case, for u,=59.4 cm/s, which corresponds to a wind strength of 9 m/s at an height of 10 metres. In 
green, results from straight line, in blue, results from DS. 


When the wind strength decreases to 9 m/s at 10 metres, as displayed in Fig. VII.22, the 
maximum of the DS range shrinks to only 5 times the nominal zero wind range, which is also 
almost 5 times the straight line range. All other aspects of the curves mentioned earlier are 
repeated here, to the exception of flight paths directly down wind, where the DS range is lower 
than the straight line range. This illustrates very well the fact that the optimum found by the 
optimization routine can be local and can therefore be outperformed by a trajectory which is 
also a solution of the optimization problem. Even though DS trajectories merge into a straight 


line, when the flight path orientates windward, it does not happen identically leeward. 
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ENDURANCE RATIO VERSUS NOMINAL CASE 
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Figure VII.23: Variation of the range ratio with respect to the net heading angle, compared to the straight 
line case, for u,=54.7 cm/s, which corresponds to a wind strength of 8 m/s at an height of 10 metres. In 
green, results from straight line, in blue, results from DS. 


For a wind strength of 8 m/s at 10 metres, as presented in Fig. VII.23, the maximum in 
DS range shrinks to around 2 times the range of the straight line and the heading angle of that 
maximum is shifted windward. Overall, DS becomes advantageous over the straight line for 
only ~50 % of the flight headings. Besides, the gap, leeward, between straight line and DS, 


increases. 
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ENDURANCE RATIO VERSUS NOMINAL CASE 
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Figure VII.24: Variation of the range ratio with respect to the net heading angle, compared to the straight 
line case, for u,=47.9 cm/s, which corresponds to a wind strength of 7 m/s at an height of 10 metres. In 
green, results from straight line, in blue, results from DS. 


For a wind strength of 7 m/s at 10 metres, as displayed in Fig. VII.24, the improvements 
in range do not seem high enough, on most of the compass rose, to justify the complexity of DS 
flight. To the exception of flight path near wW,,.;=+120°, where DS trajectories still manage 


around 50 % better range compared to straight line. 
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ENDURANCE RATIO VERSUS NOMINAL CASE 
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Figure VII.25: Variation of the range ratio with respect to the net heading angle, compared to the straight 
line case, for u,=41.06 cm/s, which corresponds to a wind strength of 6 m/s at an height of 10 metres. In 
green, results from straight line, in blue, results from DS. 


For wind strength of 6 m/s at 10 metres, as seen in Fig. VII.25, the difference in range 
between straight line and DS is marginal and do not justify the use of DS on any heading of the 
compass rose. An interesting aspect at leeward headings is that the DS curve finally merges 
with the straight line curve. The DS optimization process indeed yields a straight line at the 
maximum altitude for those headings. The subsequent discontinuity in the DS range curve 


confirms the local nature of the optimum before it merges with the straight line curve. 
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VII.4 § Summary on DS Performances 





It is first demonstrated that DS is potentially feasible for a wide array of vehicle size, 
with yet significant disparities in required wind strength. It is established that larger vehicles 
could benefit from less favourable conditions. Even though no specific driving aerodynamic 
parameter is outlined, aerodynamic refinements of larger vehicle enable to overbalance their 
induced mass and wingspan penalties. Ideally, lightweight architectures of small wing span, 
low sink rate and high lift to drag ratio would favour DS flight. Practically, some of those 
properties are antinomic and the compromise appears to favour larger vehicles. This result is 
somehow encouraging from an application’s perspective, yet there must be an upper scale to 
that trend and limitations of the point mass model certainly arise when the scale of the 


wingspan becomes comparable to that of the wind gradient. 


As for variations in DS performance, a specific focus is given to two governing 
variables: the ground clearance and the surface roughness length Zy. The ground clearance 
directly affects the required wind strength and the array of feasible net travelling angle Wy 4 
towards the wind. A reduced ground clearance lowers the required u, and widens the range of 
feasible w,,;. Yet, for Zg=1 cm and zero tip clearance, flight headings are still limited such that 
making headway against the wind is not possible. As for Zg, in confirmation with the theory 
from Chapter VI, it does not affect the minimum required u,. Yet it changes the earth-relative 
motion of the vehicle: a rougher surface enables to exploit DS for lower wind speeds and for a 
wider scope of Wnez, eventually leading to DS opportunities all around the compass rose. 
Therefore, DS performances are intrinsically linked to surface characteristics and sea-surface 


conditions are challenging to exploit. 


In order to widen the range of favourable DS conditions, thrust was added as an extra 
control variable. The wind hence only contributes to a fraction of the vehicle’s energy needs. 
When thrust is applied during the upwind climb, the vehicle can take the most out of its 
propelling work by reaching stronger winds. The advantage of DS flight is assessed by 
comparing DS-enabled-range and _ straight-line-range. Significant improvements can be 
achieved by DS, even when the wind is 30 % lighter than previously required, but there are 
significant variations with Wye. Range enhancements are possible even when heading against 
the wind, but when u, decreases, the advantage of DS reduces and becomes limited to values 


of Wet Where DS is most efficient, around 135°, in accordance with albatross observations 
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(CONCLUSION 


The present work has focused on providing an in-depth analysis of DS flight, by 
detailing the methodology, the understanding and the performances associated with DS 
simulations. It completes past studies on DS by investigating further many multidisciplinary 
aspects associated with the topic and by its intention to give shape to a consistent and 


extensive overview of DS challenges. 


Summary of findings 


An extensive review of the engineering inputs from albatross biology studies is 
undertaken in the first Chapter. Although many publications on DS mention some aspects 
related to the albatross flight, this part aims at gathering in a single document all the pieces of 
information that can raise interest and guidance out of the observation of albatrosses. Then this 


literature review is extended to publications in the domain of in-flight energy extraction. 


A methodology is set up in order to adequately model the flying system and its 
environment. In particular, the choice of a point mass model for the vehicle is explained and is 
quantified for one of the three geometries chosen, through a wind tunnel testing campaign. 
Besides, a methodology to relate wind and ocean surface state is extensively detailed, which 
constitutes a first step towards conciliating wave soaring with dynamic soaring. The main 
contribution from this part is to underline that realistic ocean surface roughness lengths are an 
order of magnitude lower than what has been considered in DS literature so far. Only shallow 
wave trains are considered, representative of a well-developed swell and the hypothesis behind 


the model are developped. 


Equations of motion are derived by beginning to underline the significance of the 
point of view considered. Both earth-relative and air relative systems of EoM are derived before 
selecting inertial equations for the remaining of the methodology. An optimization problem is 
set up, laid out in its structure before a particular focus is given to numerical integration 
techniques required to convert the underlying problem. Collocation methods are presented step 
by step, by highlighting the underlying mathematical hypothesis. The induced parameter 
optimization problem is detailed and numerical values for operational constraints are given for 


each vehicle. The solving methodology and the associated tools are presented with a brief 
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description of their mathematical background. The problematic of variable initiation and of 
local optimizer is introduced, to be reminded for results obtained. Eventually, a trajectory of 
similar conditions as in the literature is run before validating the methodology thanks to the 


analogy obtained. 


A specific closed-trajectory is produced to highlight the loitering potential of DS flight, 
as well as to serve as a support for developing an understanding of energy-harvesting 
mechanisms. Local power input expressions are established for various contributions and 
explained in their variations, before integrating those along the whole flight path into evolution 
in total energy. A particular focus is given to wind energy exchange within the boundary layer. 
Periodicity constraints are partly relaxed to get to an open loop which combines the same 
phases of flight as the closed loop. Then a specific study case that optimizes DS within the 
wavy ocean boundary layer model is established. It shows that the methodology of DS flight 
simulation can be applied to a three dimensional moving wind field to produce trajectories that 
associates classical DS with a kind of slope soaring on the forward moving face of wave. The 
positive contribution from the wave updraft is detailed and changes in wave phase periodicity 


are performed. 


Fundamentals of DS flight are then approached by non-dimensionalization of the air 
relative EoM. First with a linear wind profile, as inspired from the literature, then by achieving 
for the first time a non-dimensionalization with a logarithmic wind profile. Several objective 
functions are tested and a thrust-augmented case is introduced. Conditions to equivalences in 
dimensionless solutions are given. On top of providing a strong support for validating the 
simulation methodology, the approach permits to highlight certain remarkable behaviour about 
DS, such as the sensitivity to wing loading and the invariance of the minimum required wind 


friction velocity with the roughness length. 


Then, the focus was given to flight performances by DS. The three different vehicle 
architectures are tested under identical operational constraints, before concluding that the 
planform of the biggest scale is the most promising for maximizing the likelihood of DS flight. 
Then, the influence of the net heading angle, the ground clearance and the surface roughness 
length was established for DS gliding flight. It leads to the conclusion that favourable 
conditions for DS flight over a wide range of heading angles are difficult to meet. Hence an 
alternative is presented by assisting thrust-powered flight with DS. The evolution of that third 
control variable is detailed before DS range charts are produced and compared against the case 


of straight line case. 


From Albatross to Long Range UAV Flight by Dynamic Soaring 193 


With the benefit of hindsight, this project managed to outline a methodology that 
successfully simulates DS flight. It shows to be rather robust and can accommodate varying 
constraints and objectives. One of the strongest assumptions is the choice of a point mass 
model, which imposes that the operational constraints linked with the vehicle dynamics are 
artificially imposed rather than modelled. Besides, this model reaches its limit when the length 
scale of the wind gradient gets similar to that of the vehicle. This may happen especially during 
the low turn, when the vehicle is banked at low altitude within the zone of strong shear. 
Furthermore, the initiation of variables, before passing the problem to the solver, is also critical 
and may lead to different solutions. It requires a careful approach and eventually a guess of 


what the solution may look like. 


Then, mechanisms of energy-harvesting are properly understood and thoroughly 
detailed. The work of aerodynamic forces is the main parameter at stake and the DS flight path 
is a cycle that orientates those forces successively such that gains overbalance losses. The 
upper turn is the main contributor to the energy gain, through the attitude belly-to-the-wind- 
attitude that sees a strong acceleration of the vehicle in the direction of the wind. The lower 
turn is all the more critical as it concentrates the biggest losses. As such, the minimum altitude 
allowed is a key parameter since it directly impacts the wind strength opposed to the vehicle. 
The wind gradient does not alter the local work of aerodynamic forces but plays a role when 
the vehicle kinematics is integrated over the whole flight path, as it introduces disparities in 


wind strength between the top and the bottom of the trajectory. 


Finally, variations in DS performances were quantified against variations in vehicle 
design, ground clearance and surface roughness length, before drawing DS-induced range 
performance charts. A first conclusion is that practical flight applications are favourable for a 
lightweight planform of bigger scale than the albatross, with refined gliding performances. A 
second result is that unless very rough terrain is met, with close ground clearance margins, the 
scope of DS is limited to certain flight headings with respect to the wind. Hence, one approach 
to cope with such limitations is to introduce thrust-augmented DS, where the vehicle provides 
only a fraction of the energy it requires to stay aloft. Results show that the range can be 
improved for a wide range of flight path headings, for winds that are lighter than those 
required for pure gliding DS flight. 
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Perspectives 


Given that even thrust-augmented DS requires rather demanding windy conditions in 
order to stand out performance-wise, the challenge of future DS research is to find solutions 
that lower the required wind strength and hence widen the scope of exploitable environment 


conditions. As such, several direction of research can be followed. 


e Investigate the vehicle planform optimized for lowering the required wind 
strength to exploit DS, without a-priori limitations regarding the vehicle 
design, in order to yield the absolute lowest wind strength exploitable, given 
state of the art designs. Orientate the design towards a planform of bigger 


scale than the albatross. 


e Integrate further refined control behaviours, by investigating the potential 
advantage of active flap deflection. Since the vehicle is defined by an 
aerodynamic polar, morphing the polar along the flight path has obvious 
promising prospective, and a simple and known way to do so is to introduce 
flap deflection. Besides, ground effect would also influence the aerodynamic 
polar at low altitudes and it would be interesting to weigh the influence it has 
on DS trajectories by implementing a simple model of ground effect. Also, the 
theme of regenerative soaring can be approached by first investigating the 
influence of a thrust which can be negative, in order to model in-flight energy 


recuperation. 


e In the case of DS within a turbulent boundary layer, develop an objective 
function which integrates DS inputs of various frequencies due to different 
wind contributions. Hence, augment classical DS within the mean wind profile 
of the boundary layer with gust soaring contributions that benefit from the 
stochastic turbulent behaviour of the boundary layer. It would be essential to 
model the influence of a local span-wise variation in wind strength on the 
flight dynamics and energetics and hence move to a six-degree-of freedom- 


model to fully investigate its influence. 
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e Approach DS within various wind shear regions to benefit from stronger wind 
gradients or from a linear spatial variation. As for the first case, DS on the 
leeward side of a hill, or a building, may benefit from a stronger wind shear, 
without the inconvenience of very close ground proximity. As for the second 
case, it is shown in Chapter VI that a linear wind profile could be exploited for 
much lower wind speeds. An environment where this happens and is rather 


predictable is the so-called jet stream. 


Another area requires further research in order to conclude on the feasibility to 
perform DS for a UAV. The demonstration would be indeed completed when a proper vehicle 
manages to autonomously exploit DS trajectories and when the subsequent energy-harvesting 
mechanisms are measured and validated. To that extent, several directions of research can be 


outlined. 


e Propose a flight control strategy that uses only limited a-priori knowledge of 
the wind field, together with in-flight wind measurements and ground 


proximity sensors in order to autonomously pilot DS trajectories. 


e Apply the state of the art DS methodology within a stochastic turbulent 
boundary layer to assess the influence of gusts on the flight path for 
unchanged flight controls and quantify the influence on the energy- 
harvesting. Conclude on the feasibility to apply the current DS strategy to the 


real-world. 


e Finally, set-up an experiment using a well-quantified platform over an open 
field covered with regular low grass, such as an airfield. The vehicle has to be 
equipped with the appropriate airspeed and GPS sensors to be able to 
reconstruct the local wind speed. The experiment can be guided by a human- 
pilot at first, that approximately reproduce DS flight path, and then the 
guidance should be left to an auto-pilot in order to validate the flight path 
strategy. A robust design should be chosen, as the close ground proximity 
would likely induce accidental collisions until the ground clearance constraint 


is well integrated into the flight path management. 
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Abstract. We investigated the relationship between body mass and the ralc and pattern 
ot feather growth of the four outermost primaries of Wandering Albatross (Diomedea ext- 
fans) chicks. Maximum growth rafes were similar (4.5 mm day ') for alt feathers and be- 
tween sexes, although primaries of males were significantly longer than those of females. 
There was a distinctive pattern to primary growth with p10 grown last, reaching ils asymp- 
tote just prior to fledging. Primaries growing did so al different maximum ratcs; thus p7 
reached its asymptote at an earlicr age than p8 or p9, bul maximum yvrowth rates were the 
same for all primarics. Maximum growth rates of p? and p& were significantly correlated 
with chick mass at the start of the period of primary growth, and chick mass also was 
correlated with age at fledging. The heavier the chick, the earlier it grew its primaries and 
the younger it fledged. Fledging periods tor Wandering Albatross chicks may be constrained 
by the time required to grow a full set of primaries. We suggest that the observed paticm 
of feather growth is a mechanism to minimize potential wear of the outer primaries pricr 


to fledging. 


Key words: 
growth, 


INTRODUCTION 


For adult seabirds, molt is probably one of the 
most significant activities outside the brecding 
season (Prince et al, 1993, 1997, Langston and 
Rohwer 1996). In annually breeding albatrosses, 
the time available for moit may be as little as 
100 days, so feather replacement is spread over 
two or more years (Prince et al. 1993, Langston 
and Rohwer 1995). Breeding status and body 
condition also influence the extent of annual 
molt (Weimerskirch 1991, Langston and Hill- 
garth 1995, Cobley and Prince 1998), and such 
are the potential interactions between brecding 
and molt that some species of albatross may 
need to skip a year of breeding to replace ac- 
cumulated worn primaries (Langston and Roh- 
wer 1996). 

Chicks face potentially even more substantial 
challenges. They need to grow all their flight 
feathers within one-half of their complete fledg- 
ing period (Ricketts and Prince 1981). For bi- 
ennial brecding species such as the Gray-headed 
(Diomedea chrysostoma) and Wandering Alba- 
trosses (D, exulans), chicks must grow twice the 
number of primaries in half the time available 
to adults. Existing studies suggest that comple- 
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ion of wing growth, as measured by length of 
longest primary, by petrels (Procellariiformes) 
prior to fledging is accorded high prionty. Harris 
(1966) showed that both light and heavy Manx 
Shearwaters (Puffinus puffinus) had similarly de- 
veloped wings on fledging, demonstrating that 
there was a premium on being able to fly even 
if poorly fed during development. Development 
of albatross chicks also suggests that wing 
growth is largely independent of body mass. 
Wing growth in Gray-headed and Black-browed 
Albatrosses (D. melanophris) was less affected 
by body mass than growth of internal organs 
(Reid et al., in press), and also was less suscep- 
tible to changes in diet quality (Prince and Rick- 
etts 1981). Lequette and Weimerskirch (1990) 
showed that despite differences in growth and 
provisioning rates, all Wandering Albatross 
chicks, of both sexes, have similar sized wings 
on fledging. 

Petrels are amongst the most pelagic of all 
seabirds, covering vast areas of ocean in search 
of food (Warham 1996), so fully developed 
wings are important to forage successfully. After 
fledging, at the start of the austral summer after 
a chick rearing period of around 277 days, Wan- 
dering Albatross chicks do not start to replace 
flight feathers until their second winter, and it 
may be five years for males and six years for 
females before all juvenile primarics are re- 
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placed for the first time (Prince ct al. 1997). 
Thus, the feathers they fledge with must sustain 
them through the cnilical period after Hedging. 

In this study we cxaminc the rate and pattern 
of growth of individual primaries in the Wan- 
dering Albatross. We also investigale the inter 
action between the rate of feather growth and 
the fledging period and mass change over this 
period. 


METHODS 


This study was carried out at Bird [sland, South 
Georgia (54°01'S, 38°03'W) during the 1996 
austral winter. Wing Icngth from carpal joint to 
the tip of the flattened, straightened wing (Spen- 
cer 1984} was measured, using a stopped rule 
(one which had a raised perpendicular end to 
which the carpal could be abutted), every 15 
days, from when the chick was 10 days old until 
it fledged. The study nests were visited every 
day throughout chick rearing, and hatching and 
fledging dates recorded to the nearest day. A re- 
cord was kept when wings were measured of 
when emerging primaries were first observed. 
Fach of the four outer primaries (p 7-10) was 
measured from the tip to the carpal joint, to the 
nearest mm, every 5 days from 205 days of age 
until fledging. Overall wing length 1s defined as 
the length of the bent wing using the carpe] bend 
to the tip of the longest primary. To provide a 
potential index of body condition, chicks were 
weighed at 145 and 250 days of age using an 
electronic weighing platform accurate to + 10 
g. Chicks were sexed by discriminant function 
analysis, using bill length (mm) and bill depth 
(mm) at 260 days old, with the regression: 0.205 
length + 0,609 depth — 54.610. If the result was 
positive, the sex was male, if negative, female. 
This regression gave 100% discrimination as 
verified by all birds returning to breed on the 
study ridge (Huin, unpubl. data, 7 > 500). Of 
the 64 chicks in the study, 29 were male and 31 
female; 4 were of unknown sex as they had 
fledged before bill dimensions were measured. 


GROWTH MODELS 


Various growth models from the Richards family 
of growth models (Brown and Rothery 1993) 
were fitted to the data on feather growth from 
individual chicks. Gompertz and von Bertalanffy 
growth models fitted the data fairly well, ac- 
counting for over 90% of the variance. However, 
these curves gave unrealistic estimates of max- 
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imum primary Icngth because the growth of p10 
und to a lesser extent p9 nght up to fledging 
meant that, for some chicks, asymptotic growth 
was not attained within the time span of the cb- 
servations. We obtained better results using the 
generalized model of Schnute (1981) which in- 
cludes the Richards family, the Gompertz and 
the von Bertalanffy as special cases. This model 
is non-negative and a non-decreasing function of 
length, y, with time, t, and is given by: 


Ly 


—e* 


l 
y = (yi? + Cy” — yD) oe, 
l-e 








where t, = 205 days, t, = age at last measure- 
ment, y, = fitted feather length at t,, y, = fitted 
leather length at (,. and a and b are parameters 
which describe the shape of the curve and are 
estimated by maximum likelihood from the data. 
This model fitted the primary growth data better 
than any of the alternatives we cxplored, ac- 
counting for more than 98% of variance for all 
feathers. For these wing growth data the fitted 
curves all had parameter estimates where a > 0 
and b < 0, implying that the model approxi- 
mates to a Richards curve (Keen 1993). 

Wing length was measured from 205 days of 
age which corresponded to a penod of rapid, 
approximately linear growth (Fig. 1) from an 
initial length of around 75% of the final wing 
length. However, there was large variation be- 
tween individual chicks in their position along 
the growth curve at this age, with some of the 
more advanced chicks having already reached 
asymptotic lengths for primaries 7 and 8. In or 
der to mode] the pattern of feather growth more 
accurately, a subsample of 28 chicks was se- 
lected (13 males, 15 females} whose growth 
curves were measured sufficiently carly to in- 
clude the period of rapid lincar growth. These 
chicks did not differ significantly in mass at 145 
or 250 days of age from those chicks omitted 
from this subsample (two-sample “tests, fj5,.4, = 
1.50-0.27. P > 0.10) and therefore we consider 
them to be representative of the sampled popu- 
lation. 

Growth models and correlation analyses were 
carried out using the statistical package GEN- 
STAT, and ail means are presented + SE unless 
otherwise stated. All t-tests were two-tailed, and 
a Mann-Whitney U-test was uscd on nonpara- 
metric data. Two-way ANOVAs were used to 
test for differences between sexes. 
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RESULTS 


WING GROWTH AND EMERGENCE OF 
PRIMARIES 

Wings grew slowly during the first 145 days at 
a mean rate of 1.4 mm day ', increasing to 3.9 
mm day"! between 145 and 250 days of age, 
after which the growth rate declined to 0,9 mm 
day~'’ to fledging at around 280 days (Fig. 1). 
Primaries first emerged at 127 days, and there 
was no significant difference between male and 
female chicks (Mann-Whitney U-test, U = 789, 
P > 0.7) (Table 1). There was a wide range in 
age at fledging, with an average of 277 days. 
Male chicks fledged at 277 days old and female 
chicks at 281 days (Table 1), but these dilfer- 
ences were not significant (Mann-Whitney U- 
test, U = 779, P = 0.10). The average time from 
emergence of primaries to fledging was very 
similar between sexes (Table 1). Age at emer- 
gence of primaries is only accurate to within 15 
days. Fifteen of the 62 chicks monitored (24%) 


grew their full set of wings in under 150 days 
from the start of primary growth, but only 1 
chick in under 135 days. 


GROWTH RATES OF PRIMARIES 


Primaries were already increasing rapidly in 
length when measurements began; later growth 
rate declined and feathers eventually reached 
their maximum length (Fig. 2). The asymptote 
was reached first by p? and was reached pro- 
gressively later by each primary with p10 con- 
tinuing to grow right up to fledging. At fledging, 
p10 was the longest primary followed by p9, 
then p&, and finally p? which reached around 
83% of the asymptotic length of plO (Fig. 2A 
and 2B). 

The age at which the maximum growth rate 
was attained increased from p7 to p10, indicat- 
ing that feathers were not all growing at their 
maximum rate simultaneously (Table 2). There 
was no significant difference in the age that 


TABLE 1. Mean + SD (range) age im days ut first emergence of primaries and at fledging for male and female 


Wandering Albatross chicks. 


Males FF 

Age when emergence 126.7 + 13.5 27 
first recorded + 15 days (115-60) 

Age at ficdging + | day 276.8 + 12.4 29 
(262-301) 

Time for primary growth 150.8 + 14.5 af 
(days) (129-186) 


Females 


All 


i 


128.1 = 16.1 31 126.6 + 14.9 62 
(115-175) (115-175) 

280.7 + 11.) 3] 277i = 120 64 
(265-309) (255-309) 

152.6 = 11.7 31 ISi.6-2 11 62 
(133-185) (129-186) 
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FIGURE 2, Typical feather growth curves for (A) male ¢ 


B) female Wandering Albatross chicks using the 
Schnute model. (x = plO, 0 = p9, + — p8, * = p7), 


maximum growth rate was attained, maximum nor between sexes (Fi 5, = 0.45-0.98, P > 0.20). 
growth rate or relative growth rate between each Primary nine had the greatest maximum growth 
adjacent primary for either sex (male: F,, = rate of any feather (4.7-4.8 mm day~'), which 
0.21-1.67, female: F,,, = 0.21-1.45, P > 0.05), may be explained by its relatively long length at 
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& ficdging (96% of p10) but the shorter time dur- 
3 ae ing which this feather is growing. The maximum 
= aZ Ss growth rale of p? may have been greater prior 
s ma - a OS to the beginning of the measurements as p7 was 
© e il = i “ ie i closer to its asymptotic value than the other pri- 
a! nm 2 = 5 maries when measurements began (Fig. 2A and 
4 —s 2B). Relative growth rate was consistent across 
§ primaries (males: 0.009 day-', females: 0.010 
a day~') and the differences between sexes were 
. not significant (Fig. 3; Table 3). Primary 10 was 
a om the longest primary on fledging and was still 
— - 2 88 growing at 0.5 mm day~' with a relative growth 
3 : ori ee [| of 0.0008 day |. Primaries seven and eight had 
2 _ mt On tH stopped growing and po was growing slowly at 
a3 aa ey Ss 0.1 mm day~'. Each primary was significantly 
$5 SS smaller (F\ >, = 5.08—7.60 P < 0.05) for female 
oe chicks on fledging when compared to male 
v 5 chicks, but growth rates (F,,, = 0.01-0.76, P > 
38 0.30) and relative growth rates (F,,, = 0.02- 
Py Oa 0.69, P > 0.40) were not. 

= 39 88 

. . . min OG OS PATTERN OF GROWTH 

. = . eae Bs 7 . im The pattern of growth of primaries 7-10 is 
35 ae yo 3 = shown in Figure 4. For each age, measurements 
E'S of were categorized according to the length of each 
—S primary. There is a clear trend in the pattern of 
2 ¢ growth of primaries from 8 > 7 > 9 > 10 (35% 
z B of birds) or 8 > 9 > 7 > 10 (29%) at 205 days 
zy ae ae of age through 9 > 8 > 10 > 7 (34%) or 9 > 
Eh e. 8a SS 10 > 8 > 7 (37%) at around 245 days of age 
E =e Weta Hat 4 before reaching 10 > 9 > 8 > 7 at least 10 days 
Ee ex tx se before fledging (Fig. 4). There was some varia- 
E = NN +t RS tion between individual chicks in how this pat- 
28 oo tern was attained, but for 61 of the 64 chicks 
a the growth followed the sequence: 7 > 8 > 9 > 
ZY Wt 8 >7>9>10t08 >9>7> 10, then 
33 predominately either 9 > 8 > 7 > 10 (25%) or 
23 i, oe eo a 8 > 9 > 10 > 7 (34%), all chicks then continu- 
ec @lee as 35 ing9>8>10>709>10>8>7 and 
ae 24 26 20 finally 10 > 9 > 8 > 7. From this two main 
‘bb growth patterns are apparent: (1) p9 > p8 before 
i pl0Q > p? or (2) plO > p7 before p9 > p8. 
“a5 

23 2 MASS OF CHICKS AND GROWTH OF 

= 2 - PRIMARIES 

ob z EP Correlations between the linear scaling measure, 
"ae i | b 2 cube root of chick mass at 145 days and 250 
S§2¢ |8|/& & 5 days, and growth rates and timing were calcu- 
22 | 3 & & 2 lated and corrected for multiple comparisons us- 
cio Ss ¢ & ing the Bonferroni method (Sokal and Rohlf 
ul © zs & & 1995). There was a significant correlation be- 
r £ v & = tween mass at 145 days and maxnnum growth 
<< <¢ a, pe 


rate [or p7 (* = 0.27, P < 0.01) but not for p8, 
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FIGURE 3. Typical relative growth rates (day ') for primarics p7—p10 for (A) male and (B) female Wandering 
Albatross chicks. (x — p10, 0 = p9, + = ps, * = p7). 
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TABLE 3. Mean + SD asymptotic icngth (Y,; mm), growth rate (mm day '), and relative growth rate X 10° 
(day ') of primaries al fledging for male (2 = 13) and female (n = 15) Wandering Albatross chicks, 


Parameter Sex plo 
Y; Male §92.2 + 16.7 
Female Gris = Lid 
Growth rate Male 0.52 + 0.54 
Female 0.59 — 0.60 
Relative growth rate Male 0.76 = 0.79 
Female 0.87 ~ 0.89 


p9, or plQ. There was no such correlation be- 
tween mass and relative growth rate or for any 
other parameter and mass at 250 days of age. 
There also was a significant correlation betwecn 
chick mass at 145 days and age at fledging (7° 
= 0.19, P < 0.01) but not between mass at 250 
days old and age at fledging (@- = 0.03, P > 
0.05). Maximum growth rates of p7 and p8 also 
were significantly correlated to mass at 145 
days, but not p9 or plO. As p? and p8 reach 
Inaximum growth rates at a younger age than p9 
and p10, the mass at 145 days is more likely to 
influence these rates compared with primaries 
which grow later. The heavier the chick at 145 
days old, the younger the chick will tend to 
fledge. The correlation between age at which 
primaries emerged and mass al 145 days also 
was significant (7? = 0.19, P < 0.01), with pri- 
marics cMerging at a younger age in heavier 
chicks. 


DISCUSSION 


There was considerable variation in the age, be- 
tween 115 and 175 days old, when primaries 









p? ps p? 
668.0 + 15.5 627.0 + 144 $75.9 + 12.7 
653.5 + 14.2 612.9 + 14.4 561.8 + 14.2 
0.10 + 0.12 0.02 + 0.02 0,009 + 0,010 
O11 + 0.12 0.02 + 0.03 0,006 + 0.010 
0.15 + 0.18 0.03 + 0.03 0.016 + 0.006 
0.16 + 0.19 0.03 + 0.05 9.011 + 0,000 


first emerged, which was correlated with the 
mass of the chick at the start of this period. Once 
primaries started to grow, it took an average of 
151-153 days for full development, around one 
half of the chick rearing period, and chicks did 
not ficdge until the growth of plO had reached 
its asymptote. Lighter chicks, whose primaries 
emerged later than heavier chicks, fledged pro- 
portionalely later, but not before complcting the 
full period required for wing and primary 
growth. Feather emergence dala were collected 
only every [5 days and only one bird fledged 
less than 150 + 15 days after emergence of pri- 
maries. The actual date of ficdging will depend 
on a number of other factors such as wind con- 
ditions and bady mass, but most chicks fledged 
as soon as their wings were fully developed, 
suggesting this may have been an important lim- 
iting factor. There was, however, no significant 
difference in fledging age between male and fe- 
male chicks, although the maximum length of 
female primarics was significantly smaller than 
males. Feathers emerged at the same age, and 
growth rates were similar. [f time needed to fully 
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FIGURE 4. Pattern of primary growth of Wandering Albatross chicks. (7 > 8 > 9 means p7 is longer than 


ps8, which is longer than p9). 


develop wing feathers limits the age at which 
chicks can fledge, we would have expected fe- 
male chicks to fledge around 4 days younger 
(using mean wing length and maximum growth 
raies) than males as they have less feather to 
grow. In fact, they fledged when 4 days older. 
Growth curves were simular lor both sexes, 
which suggests that other factors, operating dif- 
ferentially on cach sex, may influcnce fledging 
date. 


GROWTH RATES 


Wing and primary growth in albatross chicks 
followed a sigmoid curve, typical of logistic- 
type growth curves (Brown and Rothery 1993). 
The rate of wing growth was slow at the begin- 
ning of chick development reaching a linear in- 
crease about half way (145 days) through the 
chick rearing period (~ 280 days). Reid et al. 
{in press) suggested that this is when albatross 
chicks preferentially allocate incoming resources 
to growing flight feathers. Proce}lariiformes 
have relatively long wings and flight feathers for 
birds (Adams et al. 1996), but Wandering Al- 
batross growth rates (4.4-4.8 mm day ! for a 
wing of overall length 655-718 mm) are rela- 
tively low compared to birds of comparable size, 
for example: Mute Swan (Cygnus olor) with a 
wing length of 535-628 mm have growth rates 
$.3-9.0 mm day '; Siberian Crane (Grus leu- 
cogeranus) wing length 538-625 mm, growth 
rate 9.0 mm day~'’; Lammergcier (Gypaetus bar- 
batus) wing length 695-756 mm, growth rate 
6.6 mm day '; Wattled Crane (Bugeranus ca- 
runcilatus) wing length 613—717 mm, growth 
rate 9.0—13.0 mm day~! (Prevost 1983). The rel- 
atively low growth rates for Wandering Alba- 
trosses arc most likely duc to chicks recciving 
only infrequent meals (about one every 3 days, 
with occasional fasts of [0 or more days; Tickell 
1968) of squid and some fish (Rodhousc ct al. 
1987, Croxall et al. 1988) which are of relatively 
low energy content. 

There was no difference in the maximum 
growth rate of each primary which grew at 
around 4.5 mm day~'. This rate falls in the mid- 
dle of the ranges presented by Prevost (1983) 
who reviewed growth rates in a variety of spe- 
cics. Species with long wings rarely had growth 
rates much above 10 mm day ', whereas small 
specics such as the White Wagtail (Motacilla 
alba) have growth rates of between 4 and 5 mm 
day '. Langston and Rohwer (1996) consistently 
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recorded growth rates in molting adult Laysan 
Albatrosses (Diomedea tmmutabilis) of around 5 
mm day ' which, in their opimion, suggested that 
feather growth was limited physiologically. This 
conclusion was based on the examination of 
growth bands laid down in growing feathers, 
wilh the assumption that two light-dark bands 
were laid down each 24-hr period. Growth rates 
reported here are consistent with their findings, 
as maximum growth rates of all four primaries 
measured were similar despite differences in as- 
ympltotic feather length. Feather growth rates of 
developing albatross chicks also are of the same 
magnitude as molting adults, which further sug- 
gests physiological limitations and supports the 
assumptions of Langston and Rohwer (1996) re- 
garding growth bands. 

Wandering Albatrosses have 10 primaries and 
32 secondarics in each wing. Using mean feather 
lengths presented here and assuming lengths of 
500 mim for the six inner primaries and 150 mm 
for secondaries, chicks have to grow around 
20.6 m of feathers before fledging. This growth 
is completed in about 150 days. Thus albatross 
chicks are growing between 137 mm and 138 
mm of flight feathers per day during the second 
half of development. 


PATTERN OF GROWTH 


This study shows that primarics do not all grow 
at the same rate at the same time. They start 
growing in a particular sequence, have overlap- 
ping periods of linear growth, and eventually 
reach their asymptote at different times. The in- 
nemmost primarics grow first, while the outer- 
most continue to grow right up to Hedging. The 
pattern of growth is very consistent and on 
fledging all feather lengths were in the order 10 
> 9 > & > 7. In some cases, this order was not 
achieved until 10 days before fledging, and 
around 50% of birds were only 30 days olf 
fledging before p10 became the longest primary. 
Wandering Albatrosses, unlike the smaller al- 
batross species which stay in their nest through- 
out development, are highly mobile at an early 
aye, an adaptation to breeding during the winter 
in order to avoid drifting snow which may cover 
their nests (Tickell and Pinder 1972). Young al- 
batrosses also spend considcrable time cxercis- 
ing their wings to develop their pectoral muscles 
pnior to their first flight, During these penods of 
travel and pre-fledging exercise, chicks may 
abrade their wing feathers on contact with the 
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ground. The outermost primaries receive more 
wear than other flight feathers (Langston and 
Rohwer 1995) and all albatrosses so far studied 
pnorilize the replacement of the outermost pri- 
maries (Harris 1973, Furness 1988, Prince et al. 
1993). Given that Wandering Albatrosses retain 
their three outer primaries for 3 years and the 
other prumarics for 4 years after fledging (Prince 
et al, 1997), the observed pattern of growth 
seems likely to be an adaptation to protect thesc 
outermost primaries from damage prior to fledg- 
ng. 
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Anatomy and Histochemistry of Spread-Wing Posture in 
Birds. 3. Immunohistochemistry of Flight Muscles and 
the “Shoulder Lock” in Albatrosses 


Ron A. Meyers* and Eric F. Stakebake 


Department of Zoology, Weber State University, Ogden, Utah 84408-2505 


ABSTRACT Asa postural behavior, gliding and soaring 
flight in birds requires less energy than flapping flight. 
Slow tonic and slow twitch muscle fibers are specialized 
for sustained contraction with high fatigue resistance and 
are typically found in muscles associated with posture. 
Albatrosses are the elite of avian gliders; as such, we 
wanted to learn how their musculoskeletal system enables 
them to maintain spread-wing posture for prolonged glid- 
ing bouts. We used dissection and immunohistochemistry 
to evaluate muscle function for gliding flight in Laysan 
and Black-footed Albatrosses. Albatrosses possess a lock- 
ing mechanism at the shoulder composed of a tendinous 
sheet that extends from origin to insertion throughout the 
length of the deep layer of the pectoralis muscle. This 
fascial “strut” passively maintains horizontal wing orien- 
tation during gliding and soaring flight. A number of mus- 
cles, which likely facilitate gliding posture, are composed 
exclusively of slow fibers. These include Mm. coracobra- 
chialis cranialis, extensor metacarpi radialis dorsalis, and 
deep pectoralis. In addition, a number of other muscles, 
including triceps scapularis, triceps humeralis, supracora- 
coideus, and extensor metacarpi radialis ventralis, were 
found to have populations of slow fibers. We believe that 
this extensive suite of uniformly slow muscles is associ- 
ated with sustained gliding and is unique to birds that 
glide and soar for extended periods. These findings sug- 
gest that albatrosses utilize a combination of slow muscle 
fibers and a rigid limiting tendon for maintaining a pro- 
longed, gliding posture. J. Morphol. 263:12—29, 2005. 
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Compared to flapping flight, gliding and soaring 
flight has been shown to be more efficient in a number 
of ways. First, measurements of oxygen consumption 
have shown that gliding is only about twice as costly as 
resting (Baudinette and Schmidt-Nielsen, 1974), 
whereas flapping flight incurs a cost of about seven 
times that of resting (Tucker, 1972). Second, the heart 
rate of gliding pelicans was found to be about 150 beats 
per minute (BPM), compared with 190 BPM for birds 
flapping at 50 m (Weimerskirch et al., 2001). Third, 
Goldspink et al. (1978) and Meyers (1993) showed that 
gliding flight in Herring Gulls (Larus argentatus) and 
American Kestrels (Falco sparverius), respectively, re- 
quires less muscle activity than flapping flight as de- 
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termined by electromyography. Gliding is a more 
energy-efficient form of locomotion than flapping flight 
because fewer muscle fibers are required to be active. 

Gliding and soaring flight are static forms of 
locomotion in which the wings are held stationary 
in a horizontal position, while movement of the 
atmosphere provides much of the necessary en- 
ergy (Norberg, 1985; Pennycuick, 1989). The glid- 
ing or soaring bird must be able to maintain its 
outstretched wings and resist the force of air from 
below and in front, as well as support its body 
mass. The use of static, isometric contractions are 
better suited for such postural roles, due to the 
longer actin—myosin interaction and the reduced 
number of ATP molecules needed per second for 
repriming the cross-bridges. Thus, slow muscle 
fibers are more efficient for such postural activi- 
ties (Goldspink, 1980, 1981). 

A correlation between avian muscle fiber histo- 
chemistry and muscle function has been described 
previously (e.g., Simpson, 1979; Maier, 1988; 
Rosser and George, 1985; Welsford et al., 1991; 
Meyers, 1992a, 1993; Sokoloff et al., 1998), in that 
muscles thought to be used for activities such as 
locomotion have greater proportions of fast-twitch 
fibers, and muscles thought to have a postural role 
have higher proportions of slow fibers. Extensive 
studies of various birds (Rosser and George, 
1986a,b; Rosser et al., 1994), indicated that a 
number of taxa possess slow-contracting muscle 
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fibers presumed to function in posture, gliding 
flight, or underwater swimming. 

Avian slow muscle fibers can be classified by mor- 
phological, physiological, or biochemical criteria as 
slow tonic (ST) or slow-twitch oxidative (SO; see 
Goldspink, 1981; Hikida, 1987; Rosser et al., 1987; 
Williams and Dhoot, 1992). With respect to other 
physiological and morphological characteristics, ST 
fibers fail to respond to single nerve impulses, slowly 
shorten when stimulated repetitively (Morgan and 
Proske, 1984), and have multiple “en grappe” nerve 
terminals (Hess, 1967), whereas slow-twitch (and 
fast-twitch) muscle fibers generate an action poten- 
tial and exhibit a rapid rise in tension after nerve 
stimulation (Morgan and Proske, 1984; Torrella et 
al., 1993). Histochemically, avian SO fibers react 
like mammalian SO fibers, but avian SO fibers, like 
slow tonic fibers, are multiply innervated (Baier and 
Gatesy, 2000). Recent work on the biochemistry of 
avian myosin (Bandman and Rosser, 2000) has 
shown that birds possess nine myosin heavy chains 
(MyHC): five fast and four slow/cardiac. Two slow 
MyHCs, MyHC1 and MyHC2, have been found in 
avian slow muscles (typically studied are the 
chicken M. latissimus dorsi pars anterior and the 
“red-strip”’ of the pectoralis), with most mature fi- 
bers expressing MyHC2 only (Bandman and Rosser, 
2000). The deep layer of the pelican’s pectoralis ex- 
hibits a slow myosin isoform (SM) similar to that of 
the chicken latissimus dorsi (Rosser et al., 1994; 
Bandman and Rosser, 2000). These slow myosins do 
not appear to be homologous with those in mammals 
(Bandman and Rosser, 2000). 

Albatrosses are elite avian gliders. These birds, 
some with wing spans of up to 12 feet, spend a great 
amount of time gliding on the winds of the oceans, 
presumably for months (Goldspink et al., 1978). 
They are considered to be among the most econom- 
ical energy users among flying birds (Costa and 
Prince, 1987), possessing a metabolic cost during 
soaring (plus take-off and landing) of only about 
three times their basal metabolic rate (BMR). In 
comparison, Sooty Terns and Ring-billed Gulls show 
increases of 4.8 and 7.5 times their BMR when flying 
(Costa and Prince, 1987). Clearly, this relates to the 
amount of time albatrosses spend soaring in com- 
parison to the flapping flight of these other species. 
Further, Adams et al. (1986) found that the energy 
used by Wandering Albatrosses was 1.83 times 
BMR, the lowest value measured among breeding 
birds. This economy is related to the soaring lifestyle 
of albatrosses and to the anatomical specializations 
they possess. 


'The “red strip” is a well-studied zone of slow tonic muscle fibers in 
the deep part of the pectoralis of chickens. It represents a remnant of 
slow myosins that are more widespread during early pectoralis devel- 
opment (see Matsuda et al., 1983). 


A wide variety of avian taxa (including alba- 
trosses, petrels, pelicans, frigatebirds, cormorants, 
storks, cranes, cathartid vultures, and various oth- 
ers) possess a deep layer of the pectoralis muscle 
(the deep pectoralis; see Meyers and Mathias, 1997, 
and references therein). It has been suggested that 
the deep layer of the pectoralis should be a slow 
tonic muscle to aid in gliding (Pennycuick, 1972; 
Meyers, 1993), and Pennycuick (1972) calculated 
that the energy used by the deep pectoralis of vul- 
tures is 5.86 kcal/h. The deep pectoralis muscle of 
vultures and pelicans was studied histochemically 
by Rosser and George (1986a) and Rosser et al. 
(1994), respectively, and was found to be composed 
of slow muscle fibers. Rosser and co-workers indi- 
cated that the deep pectoralis is specialized for glid- 
ing and soaring flight and the superficial pectoralis 
is specialized for flapping flight. However, very little 
is known about the fiber types in the pectoralis 
muscles of most soaring birds (Meyers, 1993), and 
nothing is known about the fiber types of their other 
flight muscles. 

Pennycuick (1982) discovered a shoulder locking 
mechanism in albatrosses and giant petrels and pro- 
posed that the deep pectoralis in these birds need 
not be composed of slow fibers. This was contrary to 
his previous prediction (1972; see above) regarding 
the fiber type of the deep pectoralis. He described 
the lock as being made up of a tendinous sheet 
within the superficial pectoralis and suggested that 
because of the lock, the deep pectoralis in alba- 
trosses and giant petrels should be a “sprint” muscle 
(FG/fast-fatigable/white) and presumed it to func- 
tion during high-frequency wing movements ob- 
served during landing. According to Pennycuick, the 
lock restricted movement above the horizontal when 
the wing is moved into a fully protracted position, 
but releases when retracted a few degrees. In addi- 
tion, an elbow lock was reported by Hector (1894). 
This was rejected and replaced by an elbow “fixing” 
mechanism (Joudine, 1955; Yudin, 1957). 

We had two principal goals in this study. First, we 
wanted to quantify the presence of slow muscle fi- 
bers within muscles that we believe are associated 
with gliding and soaring behavior in albatrosses 
using immunohistochemical techniques. Meyers and 
Mathias (1997) predicted that larger gliding birds, 
such as albatrosses, pelicans, and vultures, would 
have more muscle fibers devoted to posture and dis- 
tinct muscles relegated to a postural role. Second, 
we sought to describe the locking mechanism of the 
shoulder by anatomical investigation. Preliminary 
results of this work were presented in Stakebake 
and Meyers (2000). 


The articles cited as Joudine (1955) and Yudin (1957), written in 
French and Russian, respectively, are in fact by the same person. Both 
articles share identical figures and were written at the Zoological 
Institute in Leningrad. 
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Fig. 1. Diomedea immutabilis. Serial sections of M. extensor 
metacarpi radialis pars ventralis to illustrate ALD 58 and MY32 
antibody reactions in the Laysan Albatross. A positive reaction 
for the anti-slow ALD58 antibody (top) results in red-stained 
fibers (asterisk), whereas a negative reaction for a homologous 
fiber results in no color with the anti-fast MY32 antibody (bot- 
tom). Note that there is no overlap of reaction with these two 
antibodies. 


MATERIALS AND METHODS 


Two Laysan (Diomedea immutabilis) and two Black-footed (D. 
nigripes) Albatrosses were dissected to study muscle attach- 
ments, muscle fiber types, and biomechanical functions. Fresh 
specimens salvaged by the United States Fish and Wildlife Ser- 
vice (USFWS), were received and stored in freezers prior to im- 
munohistochemical and anatomical examination. Muscle fiber 
types were assayed using the histochemical protocols of Herman- 
son and Cobb (1992). After thawing, muscles on one side of the 
birds were removed, cut perpendicularly to fascicle orientation at 
mid-belly, and mounted on cork blocks with 5% gum tragacanth. 
The muscles were then frozen in 2-methylbutane, cooled with 
liquid nitrogen to approximately —150°C, and stored at —70°C 
prior to sectioning at 12 wm in a cryostat (Tissue-Tek II, 
Microtome/Cryostat, models 4551 and 45538, respectively) set at 
—20° C. Slow fibers were analyzed by reaction with ALD 58 
(University of lowa, Hybridoma Bank), an antibody that labels 
both slow-twitch and slow-tonic fibers in mammals (Han et al., 
1999) as well as birds (Meyers, 1997). Because ALD 58 reacts 
with both slow-twitch and slow-tonic bird muscle fibers, no dis- 
crimination between the two was possible. Fast fibers were iden- 
tified with MY 32 (Sigma Chemical, St. Louis, MO), an antibody 
that labels fast-twitch fibers in mammals and birds. Figure 1 
illustrates the typical reactivity observed with both antibodies for 
fast and slow fibers. Samples were reacted against the antibodies 
in a humidified chamber at 4°C for 16-18 h (ALD 58) or at 25°C 
for 2 h (MY 382). 





After rinsing in phosphate-buffered saline, samples were incu- 
bated in goat antimouse antibody and stained with streptavidin 
peroxidase system (SPS kit; Zymed Labs, San Francisco, CA) 
(Meyers and Mathias, 1997). Overlapping black and white pho- 
tographs (Olympus PM 10AD microphotography system or Nikon 
Coolpix 995 Digital camera with ocular adapter on an Olympus 
BH-2 compound microscope at 40 or 100) of prepared muscle 
cross-sections were taped together to create a montage of whole 
muscles or whole sample sections of muscles, and fiber types 
within each muscle or muscle sample were counted. The percent- 
age of fiber types (fast vs. slow) of whole muscle sections (six 
muscles) or samples of larger muscles (four muscles), and the 
standard deviations were calculated (see Table 1). Muscle cross- 
sectional areas were estimated by tracing the muscle shapes from 
glass slides and transferring the outline to graph paper. Muscle 
outlines were cut out and weighed and compared to the mass of a 
square cm of the graph paper. Simpson’s Rule (Thomas and 
Finney, 1996) was used to verify the calculations. Muscle areas 
(see Table 2) are provided to illustrate the range of muscle size 
and to account for the resulting variability of muscle fiber num- 
bers. Since Laysan and Black-footed Albatrosses are closely re- 
lated (Kuroda et al., 1990; Robertson and Nunn, 1998), we pooled 
fiber numbers from both species when calculating means. 

Muscles were studied and examined with regard to their anat- 
omy and actions and divided into the following functional cate- 
gories. 


Body Support 


This refers to postural musculature that prevents undesirable 
wing elevation and also functions to keep the body from “falling 
through the wings” (Meyers, 1993). Here, M. pectoralis, superficial 
layer (SP), and deep layer (DP), as well as the locking mechanism 
associated with the pectoralis, were examined from four individuals. 
Because of the size of DP and SP, samples were taken from through- 
out the bellies of these muscles. The locking mechanism of the 
shoulder was studied by wing manipulation of fresh-frozen-thawed 
specimens and by dissection. The object was to identify the compo- 
nents of the lock, the sites of muscle and tendon attachment, and 
joint mechanics associated with the shoulder. 


Wing Elevation 


Wing elevation is critical to flight. Like the wings of an aircraft, 
bird wings also need to be held horizontally during gliding flight. 
In birds, this position is primarily maintained by atmospheric 
movement and muscles that elevate and hold the wings at the 
horizontal. Meyers (1993) reported activity of M. supracoracoi- 
deus (SC) during gliding flight in kestrels. The present study 
examined Mm. supracoracoideus (SC) and deltoideus major (DM) 
as the principle wing elevators from three birds. Whole cross- 
sections were taken of both heads of DM, in addition to samples 
from all three heads of SC. 


Wing Protraction 


These are muscles responsible for pulling the wing cranially 
and resisting forces caused by the air pushing against the leading 
edge of the wing. We studied Mm. coracobrachialis cranialis 
(CBC) and the cranial edge fascicles of the superficial layer of M. 
pectoralis (PCr). Meyers (1993) reported activity of the cranial 
edge of the pectoralis during gliding flight in kestrels. Due to its 
mechanically advantageous position, it is a protractor of the wing 
(Fisher, 1946; Stegmann, 1964; Meyers, 1993). Whole cross- 
sections of CBC from four individuals were used; tissue blocks 
averaging 48 mm” were removed from the cranial border of the 
superficial layer of the pectoralis from three birds (see Table 2). 


Wing Extension 


These muscles extend or straighten the wing at the elbow and 
wrist joints, and also participate in the automatic extension- 
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TABLE 1. Total fiber numbers, mean number of slow fibers, and relative slow fiber percentages, for nine muscles from Laysan and 
Black-footed Albatrosses, Diomedea immutabilis and D. nigripes 


# of Fibers 
—_—  Mean#slow %Slowfibers Mean % Slow fibers 
Muscle Bird* slow fast fibers + SD% fibers + SD% + SD% 
M. pectoralis, deep layer 1 N/A 0 — 100 100 
2 N/A 0 — 100 
3 N/A 0 — 100 
4 N/A 0 — 100 
M. supracoracoideus, coracoid head 2 33829 22123 2985 + 691 13 14+1 
3 2020 11152 15 
4 3605 22887 14 
M. deltoideus major, pars cranialis: 2 6386 13253 4511 + 4332 5 18 = 11 
3 2339 24570 9 
4 10559 + =30742 26 
M. pectoralis, cranial fascicles 2 5688 10735 £13806 + 7189 30 38 + 10 
3 23165 23926 49 
4 12564 30107 29 
M. triceps humeralis 4 5966 N/A 
M. triceps scapularis 1 4538 N/A 
4 21552 N/A 
M. extensor metacarpi radialis, pars ventralis: 1 7939 11624 T4427+ 3479 41 40 + 13 
2 6111 23585 21 
3 3061 2938 51 
4 12657 14906 46 
M. extensor metacarpi radialis, pars dorsalis: 1 3614 0 2937 + 971 100 99 + 2.0 
2 3135 97 98 
3 3702 0 100 
4 1298 0 100 


*Birds 1 and 2 are D. immutabilis, 3 and 4 are D. nigripes. 


flexion mechanism of the avian wing (Fisher, 1957; Vazquez, 
1994). We examined whole cross-sections of the two heads of M. 
triceps brachii, the triceps humeralis (TH) and triceps scapularis 
(TS) from three birds. Meyers (1993) reported activity of these 
elbow extensor muscles during gliding flight in kestrels. 

In addition, whole cross-sections of the two wrist extensors, 
Mm. extensor metacarpi radialis pars dorsalis (EMRd) and ex- 
tensor metacarpi radialis pars ventralis (EMRv), were examined 
from four individuals. 


RESULTS 
Body Support 

Anatomy. The superficial layer of the pectoralis 
(SP) originates from the keel and body of the ster- 
num, the furcula, and the sterno-coraco-clavicular 
membrane. A narrow Fascia pectoralis (Meyers, 
1992b) extends along the length of the origin, ad- 
jacent to the keel, and provides additional surface 


for fiber attachment (Fig. 2). The middle one-third 
of the muscle has a combination of tendinous and 
muscular fibers on its surface. The varied fiber 
orientation is well known (see Dial et al., 1988), 
with craniodorsal fascicles oriented transversely 
(to protract the wing), cranioventral fascicles ori- 
ented dorsoventrally (to depress the wing), and 
caudoventral fascicles oriented from caudoven- 
trally to craniodorsally (to retract the wing) 
(Boggs and Dial, 19983). 

The SP inserts by a thick, dense tendon onto the 
base of the deltopectoral crest (Fig. 2), just cranial to 
the insertion of the deep layer (Fig. 3). It also at- 
taches onto the biceps brachii tendon where the 
latter attaches onto the bicipital crest of the hu- 
merus (Figs. 3, 4). Such an attachment of SP to the 


TABLE 2. Muscle areas, for eight muscles studied from Laysan and Black-footed Albatrosses, 
Diomedea immutabilis and D. nigripes 


Muscle 1 


Supracoracoideus, coracoid head 41 
Deltoideus major, pars cranialis N/A 
Pect thoracicus, cranial fascicles N/A 


Coracobrachialis cranialis 7 


Triceps humeralis N/A 
Triceps scapularis 116 
Extensor metacarpi radialis, pars ventralis vd. 


Bird* 

2 38 4 Mean (mm?) + SD 
28 30 33 33.0 + 5.7 
14 Al 59 38.0 + 22.6 
28 60 55 AT.7 = 17.2 

9 10 13 9.8 + 2.5 
56 70 58 61.3 + 7.6 
66 109 68 89.8 + 26.4 
26 10 25 205 22°7.3 
10 10 10 9.8+ 0.5 


Extensor metacarpi radialis, pars dorsalis 9 


*Birds 1 and 2 are D. immutabilis, 3 and 4 are D. nigripes. 
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Fig. 2. Diomedea immutabilis. Lateral view of the right thoracic wall and shoulder girdle of the Laysan Albatross, showing the 
superficial pectoralis and coracobrachialis cranialis muscles. Propatagialis muscles have been removed. The fine dotted line on the 
pectoralis about one-third from the origin represents a transition from muscle fibers to a mixture of muscle and tendon. The line about 
two-thirds from the origin indicates a transition to a fully tendinous region. Similar patterns are also visible on other figures. c, 
coracoid; CBC, M. coracobrachialis cranialis; DMa, M. deltoideus major; DMi, M. deltoideus minor; dpc, deltopectoral crest; F, Fascia 
pectoralis; h, humerus; s, sternum; sc, scapula; SP, M. pectoralis thoracicus, superficial layer, sternobrachialis portion; TB, thoraco- 


brachialis portion of superficial pectoralis. 


biceps tendon is not uncommon in birds (see George 
and Berger, 1966; Meyers, 1992c, 1993). 

The deep layer of the pectoralis (DP) lies be- 
tween SP and the supracoracoideus and is a com- 
plex muscle, divided into two unequal parts by the 
blood vessels passing from the subclavian artery 
to the superficial layer through a foramen in the 
muscle (Fig. 4). The cranial part makes up most of 
the deep layer and originates from the furcula, 
sternal keel, and sterno-coraco-clavicular mem- 
brane. Most of the muscle fascicles of this part 
converge and form a flat tendon that passes deeply 
to the SP and inserts onto the base of the deltopec- 
toral crest, just caudal to the insertion of SP (Figs. 
3, 4). 

The caudal part is oval in shape and arises from 
the caudal aspect of the sternum (Fig. 4). It has a 
tendinous surface and inserts with the biceps ten- 
don onto the bicipital crest of the humerus. At this 


location it is joined by fascicles of the SP that also 
insert onto the biceps tendon. The tendinous inser- 
tions of the two parts of DP are continuous with each 
other (Fig. 4). 

The superficial fascia of DP is relatively thick and 
extends from origin to insertion across the entire 
surface of the muscle. Relatively short muscle fasci- 
cles insert at an angle on an internal tendon that 
parallels the superficial fascia throughout the 
length of the muscle from origin to insertion (Fig. 3). 
This internal tendon and thickened fascia have a 
ligamentous function and make up the “shoulder 
lock,” which passively limits wing elevation above 
the horizontal. 

Immunohistochemistry. All samples of DP 
from four birds reacted against ALD 58 revealed a 
uniform pattern of slow muscle fibers (Fig. 5). In 
contrast, all samples of SP revealed uniform fast 
fibers after reaction against both antibodies (Fig. 5), 
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Fig. 3. Diomedea immutabilis. Ventral surface of right hu- 
merus of the Laysan Albatross, showing muscle attachments onto 
the proximal end. In the avian anatomical position, cranial is to 
the left and lateral to the bottom of the page. bc, bicipital crest; 
CBC, insertion of M. coracobrachialis cranialis; dpc, deltopectoral 
crest; h, humeral head; SC, insertion of M. supracoracoideus; DP, 
insertion of M. pectoralis, deep layer; SP, insertion of M. pecto- 
ralis thoracicus, superficial layer; td, tuberculum dorsale. 


with the exception of the cranial border of SP, dis- 
cussed below. 


Wing Elevation 

Anatomy. The M. supracoracoideus (SC) has 
three distinct heads originating from the sternum, 
coracoid, and furcula (Fig. 6). All three heads con- 
verge to form a thick tendon that inserts onto the 
tuberculum dorsalis along the deltopectoral crest of 
the humerus (Fig. 3). A fourth “deep” part of SC was 
described and illustrated in Diomedea nigripes by 
Kuroda (1960), who also cited its existence from 
Forbes (1882). Subsequent articles (Kuroda, 1961; 
George and Berger, 1966; McKitrick, 1991) refer toa 


deep layer or four-part SC in albatrosses, citing each 
other or Kuroda (1960). Vanden Berge and Zweers 
(1993) identified this deep layer, which arises from 
the sterno-coraco-clavicular membrane, as being the 
ventral head of M. deltoideus minor (see Fig. 6). This 
was supported by Kovacs and Meyers (2000), who 
noted that this deep muscle in the puffin Fratercula 
arctica is innervated by the axillary nerve and is 
more properly considered a part of M. deltoideus 
minor. 

The M. deltoideus major (DM) extends from the 
cranial scapula to the dorsal surface of the pectoral 
crest and adjacent proximal shaft of the humerus 
(Fig. 7). A larger cranial head makes up most of the 
muscle; the caudal head lies along the caudal border 
of the latter. A typical scapular anchor attaches to 
the DM (Meyers, 1992c; Fig. 7). Both SC and DM act 
to elevate the wing. 

Immunohistochemistry. All three heads of SC 
as well as both of DM possessed numerous, evenly 
distributed populations of slow fibers throughout 
these muscles (Fig. 8). The coracoid head was arbi- 
trarily selected for fiber quantification and was 
found to possess 14 + 1% slow fibers (Table 1). The 
cranial head of DM was chosen likewise and pos- 
sessed 13 + 11% slow fibers. This is the first known, 
documented description of slow fibers in SC or DM 
in any bird. 


Wing Protraction 

Anatomy. The cranial edge of the superficial 
layer of the pectoralis (PCr) has fibers that cross the 
cranial aspect of the shoulder joint and are posi- 
tioned to protract the wing (Fig. 2). Species with 
highly bowed furculae (e.g., alcids) typically utilize 
these cranial fibers for wing protraction during flap- 
ping flight (see Stegmann, 1964). 

The M. coracobrachialis cranialis (CBC) lies at the 
deep aspect of the cranial shoulder joint and extends 
from the coracoid to the cranial surface of the hu- 
merus. It is a thin, flat muscle, arising from the 
processus acrocoracoideus of the coracoid, and 
crosses over the cranial surface of the shoulder ar- 
ticulation (Figs. 4, 6). CBC inserts onto the Impres- 
sio coracobrachialis of the humerus, deep to M. pec- 
toralis (Figs. 2, 3). The CBC is overlain by a dense 
sheet of collagenous tissue (Lig. acrocoracohume- 
rale) that extends from the muscle’s origin to the 
distal part of the deltopectoral crest (beneath the 
pectoralis) and also to the bicipital crest (Fig. 6). 

Immunohistochemistry. The PCr had 38 + 10% 
slow fibers. In a muscle sample of 47.7 +17 mm?” 
there were an average of 13,806 slow fibers (Table 
1). These slow fibers were distributed more densely 
along the cranial edge of the superficial pectoralis 
and became more diffuse and less numerous cau- 
doventrally. All samples from birds studied (n = 3) 
revealed a similar pattern of slow fibers (Fig. 8). No 
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Fig.4. Diomedea immutabilis. Lateral view of the right thoracic wall and shoulder girdle of the Laysan Albatross, showing the deep 
pectoralis and coracobrachialis cranialis muscles. Superficial pectoralis and deltoid muscles have been removed. A window cut into the 
deep pectoralis shows its fascicle orientation and the presence of an internal tendinous sheet, which forms the shoulder lock. The SC 
can be seen deep to the deep pectoralis through the window. a, opening through which blood vessels pass from supplying the superficial 
pectoralis—it separates the deep pectoralis into cranial and caudal parts; c, coracoid; CBC, M. coracobrachialis cranialis; DP, M. 
pectoralis, deep layer; dpc, deltopectoral crest; f, furcula; h, humerus; i, area of insertion of the deep pectoralis onto the bicipital 
crest—the superficial pectoralis inserts here as well; s, sternum; SC, M. supracoracoideus, t, internal tendon of deep pectoralis, which 


forms the wing lock. 


other area of the superficial pectoralis possessed 
slow fibers. 

The CBC was found to be uniformly slow (x = 
6,576 + 2,920 fibers, n = 4) (Fig. 8; Table 1). 


Elbow Extension 

Anatomy. The two parts of M. triceps brachii 
extend the wing at the elbow in birds. The humeral 
head (triceps humeralis; TH) originates on the hu- 
merus near the Fossa pneumotricipitalis. It can be 
roughly divided into two parts, with different fasci- 
cle orientations. The lateral part has fascicles that 
extend obliquely from the humeral shaft to the ten- 
don, whereas the medial part extends more in par- 
allel with the tendon. The triceps humeralis inserts 
distally on the olecranon of the ulna (Fig. 9). 

The scapular head (triceps scapularis; TS) arises 
by a narrow, round area on the lateral surface of the 


scapular shaft. An additional attachment, via a 
“scapular anchor” (Retinaculum m. scapulotricipi- 
tis) anchors the cranial edge of the muscle to the 
proximal humeral shaft. At its insertion, TS is 
joined to the antebrachial fascia, and inserts onto 
the Processus cotylaris dorsalis of the ulna, adjacent 
to the olecranon (Fig. 9). 

The tendons of TH and TS are bound together by 
loose connective tissue for most of their lengths. The 
insertions, although distinct, are connected length- 
wise. 

Immunohistochemistry. Both heads of M. tri- 
ceps were composed of both fast and slow fibers (Fig. 
10). In TH the slow fibers were evenly distributed 
throughout the muscle. In TS, the concentration of 
slow fibers was greatest along the caudal end and 
the ventral edge, then diminished to almost no slow 
fibers at the cranialmost end. Due to the large size of 


ALBATROSS SHOULDER LOCK 1g 





pel 


Ste 
[ - al 
3 ie a i | 
ii ‘ i oa ra i wy | r 
kh 1 sss ite 
a Yl ea) Pe 
* ie % as 
“well 7 i f : Fi 
: oT kh —_— 7 - J = —— i 
4 7 ‘ . a . 
i . = = f : 
7 a Se ee ae 
* L 4 7 u 
a 
. Gi 
la ‘ Fs 
‘ & 
= 
f 


Fig. 5. Diomedea immutabilis. ALD 58 antibody reactions in superficial pectoralis (SP) and 
deep pectoralis (DP) muscles of the Laysan Albatross. Note that the superficial pectoralis has a 
negative reaction to the slow antibody and is entirely fast, whereas the deep pectoralis has a 
uniformly positive reaction and is entirely slow. Scale bar = 100 wm. 


the muscle cross-sections (TH: 61 + 7 mm?; TS:90 + 
26 mm”; see Table 2) slow fibers only were quanti- 
fied from three muscles. In one bird (D. nigripes), 
there were 5,966 slow fibers in TS, and 21,552 slow 
fibers in TH. In another (D. immutabilis), TS had 
4,538 slow fibers. 


Wrist Extension 

Anatomy. Wrist extension is facilitated by the 
automatic extension-flexion mechanism of the avian 
wing (Fisher, 1957; Vazquez, 1994) and also through 
action of Mm. extensor metacarpi radialis dorsalis 
(EMRd) and extensor metacarpi radialis ventralis 
(EMRv). Vazquez (1994) suggested that as a two- 
joint muscle, EMR can extend both wrist and elbow. 
EMRd lies dorsal to the pars ventralis, with a fleshy 
origin from the distal surface of the patagial ossicle 
(Fig. 9). It tapers quickly to a long tendon, which 
becomes fused with the tendon of EMRv by about 
half-way down the length of the ulna. 

The EMRv lies deep to the pars dorsalis and orig- 
inates by a robust, round tendon from the ventral 
surface of distal humerus, ventral to the Processus 


supracondylaris dorsalis (Fig. 9). Both EMR muscles 
insert by a common tendon onto the Processus ex- 
tensorius of the proximal carpometacarpus. 

Immunohistochemistry. The EMRv had a 
mean of 40 + 13% slow fibers for four birds (Fig. 10; 
Table 1). 

The EMRd was composed exclusively of slow fi- 
bers in three birds (Fig. 10), whereas one D. immu- 
tabilis had 97% slow fibers (see Table 1). 


DISCUSSION 
Slow Muscle Fibers and Posture 


Slow muscles have been well studied with respect 
to their involvement in postural activities across 
many taxa and musculoskeletal regions (e.g., see 
Armstrong, 1980; Putnam et al., 1980; Mendiola et 
al., 1991; Hermanson and Cobb, 1992; Meyers, 
1992a; Hermanson et al., 1993; Rosser et al., 1994; 
Meyers and Mathias, 1997; Gellman et al., 2002). 
Goldspink (1980, 1981) has indicated that isometric 
contractions are best suited for postural roles and 
that slow muscle fibers are known to be more effi- 
cient in such roles. 
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Fig.6. Diomedea immutabilis. Lateral view of the right pectoral region and shoulder girdle of the Laysan Albatross, showing Mm. 
supracoracoideus, coracobrachialis cranialis, and sternocoracoideus. Deep pectoralis muscle has been removed. ac, acrocoracohumeral 
ligament; i, area of insertion of the deep pectoralis onto the bicipital crest—the superficial pectoralis inserts here as well; c, coracoid; 
CBC, M. coracobrachialis cranialis; DMi, M. deltoideus minor, deep layer; f, furcula; h, humerus; SC, M. supracoracoideus (1 = 
coracoid head; 2 = furcular head; 3 = sternal head); sc, sternocoracoid joint and associated ligaments; scc, sterno-coraco-clavicular 


membrane; s, sternum; StC, M. sternocoracoideus. 


Body Support 


The pectoralis muscle is active during gliding 
flight as shown by electromyography in gulls (Gold- 
spink et al., 1978) and kestrels (Meyers, 1993). The 
predominant fast-twitch fibers of the superficial pec- 
toralis muscle would not be suitable for endurant, 
gliding behavior (but see Meyers and Mathias, 
1997). 

A deep layer of the pectoralis is common to gliding 
and soaring birds. Although it can depress the wing, 
the deep pectoralis is believed to act during gliding 
to support the body between the wings via isometric 
muscular contraction. 

The diversity of avian species with superficial and 
deep layers of the pectoralis has been described by 
Meyers and Mathias (1997; see references therein). 
Knowledge of this division of the pectoralis goes 
back at least to Forbes (1882), who said it was char- 
acteristic of albatrosses, storks, vultures, Phaethon, 





Fig. 7. Diomedea immutabilis. Dorsal view of the right shoul- 
der of the Laysan Albatross, showing Mm. deltoideus and triceps 
brachii. Propatagial musculature has been removed. DMCa, M. 
deltoideus major, caudal head; DMCr, M. deltoideus major, cra- 
nial head; DMi, M. deltoideus minor; dpc, deltopectoral crest; h, 
humerus; sa, scapular anchor; TS, M. triceps scapularis. 
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Fig. 8. Diomedea immutabi- 
lis. ALD 58 antibody reactions in 
wing elevator and _ protractor 
muscles of the Laysan Albatross. 
SC, supracoracoideus; DM, del- 
toideus major, pars cranialis; 
CBC, coracobrachialis cranialis; 
PCr, Cranial region of the super- 
ficial pectoralis. Positive reac- 
tion for slow muscle fibers re- 
sults in a dark staining profile. 
Note that CBC is uniformly slow 
and that the other three muscles 
have various populations of slow 
muscle fibers. Scale bar = 160 et here a pa hare | . 
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Fregata, Plotus [=Anhinga], Sula, and Pelecanus. flight (and was thus absent from birds that did not 
Fisher (1946) indicated that its main function (in _ soar), that it acted to hold the wing motionless de- 
vultures) was to hold the wing down against the air spite wind action from below, and that its fibers 
moving upwards. Kuroda (1960, 1961) suggested should be “white” for this purpose. In 1972 Penny- 
that the deep layer was an adaptation to soaring cuick indicated that the deep layer makes up 8—10% 
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Fig. 9. Diomedea immutabilis. Dorsal view of the right elbow 
region of the Laysan Albatross, showing the insertions of Mm. 
triceps and the origins of Mm. extensor metacarpi radialis. AF, 
antebrachial fascia; EMRd, M. extensor metacarpi radialis pars 
dorsalis; EMRv, M. extensor metacarpi radialis pars ventralis; h, 
humerus; 0, patagial ossicle; oc, olecranon area of ulna; pc, Proc. 
Cotylaris dorsalis of ulna; Pr, Lig. propatagiale; TH, M. triceps 
humeralis tendon; TS, M. triceps scapularis tendon. 


of the entire pectoralis mass in vultures and storks, 
that it was most probably a tonic muscle, and also 
calculated its power consumption. In 1982, Penny- 
cuick described the shoulder lock in albatrosses, and 
suggested that the deep layer should be a twitch, 
rather than tonic, muscle (see below). 

Rosser and George’s (1986a) thorough study on 
the pectoralis of 43 species of birds revealed only a 
few species with slow fibers in this muscle. Subse- 
quent examination of Turkey Vultures (Rosser and 
George, 1986a) and White Pelicans (Rosser et al., 
1994) revealed that their deep pectoralis muscles 
were uniformly slow. The pelican was found to ex- 
press the SM2 myosin, also found in the tonic ante- 
rior latissimus dorsi (Rosser et al., 1994; Bandman 
and Rosser, 2000), suggesting the deep pectoralis is 
tonic as well. A small cluster of slow-twitch fibers 
(~20) was found in the deep region of the undivided 
pectoralis of only one California Gull (Meyers and 
Mathias, 1997). No slow fibers were found in various 
regions of the pectoralis of two Herring Gulls or one 
Ring-billed Gull studied by Rosser and George 
(1986a). Likewise, only fast fibers were found in the 
pectoralis of skuas and Herring Gulls (Caldow and 
Furness, 1993). Meyers and Mathias (1997) sug- 
gested that fast fibers may function during gliding 
and soaring in California Gulls, which lack the deep 
layer to the pectoralis and any functional quantity of 
slow fibers. The endurant properties of gull pectora- 
lis muscle during isometric contractions need to be 


tested experimentally. Other gliding and soaring 
taxa that lack the deep layer (e.g., eagles, hawks) 
need to be investigated as well. 


Shoulder Lock 


Pennycuick (1982) discovered the locking mecha- 
nism in the albatross shoulder (although Rayner 
first mentions Pennycuick’s finding in his 1981 ar- 
ticle), and described it as being made up of “a sheet 
of tendon forming part of the superficial pectoralis 
muscle.” In three species of albatrosses examined 
(Wandering, Black-browed, and Light-mantled: D. 
exulans, D. melanophris, and Phoebetria palpebrata, 
respectively) he found the lock at a “superficial level” 
in the pectoralis, with a small amount of muscle 
lying over it, whereas in the Southern Giant Petrel 
(Macronectes giganteus) he described the lock as be- 
ing on the deep surface of the superficial pectoralis. 
Due to the presence of the lock, and the pale color of 
the deep pectoralis muscle, Pennycuick (1982) sug- 
gested that the deep layer should be a fast muscle 
(not slow tonic), specialized to function during land- 
ing when albatrosses rotate their wings about the 
long axis at frequencies twice that of flapping (see 
Scholey, 1982). Bannasch and Lundberg (1984) ex- 
amined Macronectes and provided more detail on the 
anatomy of the lock. They reported (as we do) that 
the locking tendon is found within the deep pectora- 
lis, and also suggested that the muscle should be 
tonic—even though they agreed with Pennycuick 
about the unusual wing rotation in albatross land- 
ing. Bannasch and Lundberg suggested that the su- 
perficial pectoralis can contribute to the rotation 
described by Scholey and Pennycuick. Our own ob- 
servations of Macronectes support the suggestions of 
Bannasch and Lundberg. The lock in the giant pe- 
trel as well as two additional albatross species we 
examined (D. exulans and melanophris) was quali- 
tatively identical to that in D. immutabilis and D. 
nigripes, and was clearly located within the deep 
pectoralis muscle. Furthermore, the pale color Pen- 
nycuick observed is typical for the deep pectoralis 
and avian slow tonic muscles in general (Meyers, 
pers. obs.). 

Pennycuick (1982) described the albatross locking 
mechanism as a functional consequence of full wing 
protraction, and reported that the lock could be “re- 
leased” by retracting the wing a few degrees. Con- 
trary to Pennycuick’s findings, we could not get the 
lock to release using the manipulations he de- 
scribed. The wing lifted easily only when the inter- 
nal tendon of the deep pectoralis was cut (in one 
specimen excessive manipulation tore the deep pec- 
toralis off of the sternum). We suggest that since the 
lock is made up of collagenous tissue extending from 
origin to insertion, elevation would require some 
degree of stretching or a changed skeletal configu- 
ration to provide slack in the tendon. 
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Fig. 10. Diomedea immutabi- 
lis. ALD 58 antibody reactions in 
elbow and wrist extensor muscles 
of the Laysan Albatross. TS, tri- 
ceps scapularis; TH, triceps hu- 
meralis; EMRd, extensor meta- 
carpi radialis pars. dorsalis; 
EMRv, extensor metacarpi radia- 
lis pars ventralis. Positive reac- 
tion for slow muscle fibers results 
in a dark staining profile. Note 
that EMRd is uniformly slow and 
that the other three muscles have | e 
populations of slow muscle fibers. ; 
Scale bar = 160 pm. 
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As is well known, the coracoid forms a saddle-like 
synovial joint within the coracoid sulcus of the ster- 
num (Baumel and Raikow, 1998; see Fig. 6), and is 
able to slide or glide within this joint through the 
action of the M. sternocoracoideus. The functional 
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significance of this sliding movement is not known. 
It is not believed to function in respiration (Vanden 
Berge and Zweers, 1993) as previously suggested 
(Raikow, 1985); Dial et al. (1991) found that it is 
active at the upstroke—downstroke transition, just 
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prior to the beginning of downstroke. They indicated 
that this activity pulls the coracoid caudolaterally, 
deflecting the acrocoracoid process laterally and 
spreading the shoulders. This movement was also 
reported by Atkins (1977) who showed the effect on 
coracoid translation on linkage ligaments in the pi- 
geon pectoral girdle. He calculated a coracoid sliding 
angle of about 30°, corresponding to a decrease in 
the distance from the glenoid to the sulcus of about 
0.6 mm in the pigeon. We suggest that this lateral 
movement of the coracoid within the sternal sulcus 
(which we also observed as possible in albatrosses), 
may also reduce the distance from the pectoralis 
insertion to the origin. The movement of the coracoid 
caudolaterally would bring the glenoid (and there- 
fore the pectoralis insertion onto the humerus) to- 
wards the sternum, thus decreasing that distance. 
This reduced distance could make elevation of the 
humerus possible without stretching the tendinous 
shoulder lock. Whether any stretching or elasticity 
of this tendon is possible requires additional in vivo 
physiological measurements, and a further evalua- 
tion of the function of M. sternocoracoideus in alba- 
trosses. 

Other locking mechanisms have been described 
for bird and bat feet (Quinn and Baumel, 1990; 
Schutt, 1993) and horse knees (Sack, 1989) and 
shoulders (Hermanson and Hurley, 1990). The ar- 
rangement of an internal tendon within the deep 
pectoralis, extending from origin to insertion as the 
limiting strut, is analogous to that found in horse 
biceps brachii, and that muscle’s role in quiet stand- 
ing (Hermanson and Hurley, 1990). The slow, short- 
fibered fascicles within the lateral head of the horse 
biceps brachii were suggested to be important for 
small amounts of force production required over long 
periods of time. This provided sufficient tension on 
the internal tendon for maintaining posture through 
slight regional adjustments (Hermanson and Hur- 
ley, 1990). A similar situation exists in the horse 
knee joint, which requires a low level of vastus me- 
dialis activation to stabilize the knee despite the 
presence of a “locking mechanism” (Schuurman et 
al., 2003). The uniform, slow fiber composition of the 
deep pectoralis provides a postural, endurant stabil- 
ity to the lock that limits wing elevation to, or below, 
the horizontal during gliding and soaring. This ap- 
pears to provide fine control of wing elevation when 
confronted with the buffeting of rigid ocean winds 
during maneuvers such as banking and turning. 


Wing Elevation 


Wing elevation musculature is important to glid- 
ing flight as an antagonist to the pectoralis in main- 
taining static shoulder stabilization in the trans- 
verse plane (Meyers, 1993). Based on the uniform 
slow fiber composition of albatross deep pectoralis 
(DP), one might predict slow fiber populations for 
muscles of wing elevation as well. Indeed, all three 


heads of supracoracoideus (SC), as well as deltoid 
major (DM), all possessed about 13% slow fibers. 
The SC and DM have not been well studied with 
respect to their fiber types. Neither muscle had any 
slow fibers in kestrels (Meyers, 1992a), nor did the 
SC in skuas or Herring Gulls (Caldow and Furness, 
1993) or starlings (Goslow et al., 2000). Boesiger 
(1987, 1992) reported two fiber types in the SC of 
five passerine and three galliform species, but these 
were Classified as lipid-rich and lipid-poor. No slow 
fibers were mentioned. That these muscles have 
both fast and slow fibers in albatrosses seems to 
agree with their dual role in flight—as powerful 
wing elevators during take off and landing, as well 
as stabilizing muscles for prolonged soaring. 


Wing Protraction 


We found the M. coracobrachialis cranialis (CBC) 
to be exclusively comprised of slow fibers in alba- 
trosses. This suggests the importance of postural 
control in wing protraction for gliding flight. Fisher 
(1946) found larger CBC muscles in condors than 
vultures and suggested that it was an adaptation for 
improved soaring ability. Stegmann (1964) indi- 
cated that doves use this muscle for wing protrac- 
tion, since they lack a bowed furcula and could not 
use the cranial part of the pectoralis for this action. 
Meyers and Mathias (1997) found the CBC of the 
California Gull to have about 8% slow fibers, 
whereas Double-crested Cormorants contain 31% 
slow-twitch fibers, assumed to function in their 
wing-drying posture (Meyers, 1997). Meyers (1992a) 
surveyed CBC along with other kestrel shoulder and 
brachial muscles and found it to be uniformly fast- 
twitch. Simpson (1979) described various percent- 
ages of tonic fibers in several pigeon muscles, includ- 
ing CBC, but did not quantify fiber types. 

The cranial edge of the superficial pectoralis (PCr) 
has a large population of slow fibers. These slow 
fibers exist only within that portion of SP that is 
apparently best suited for wing protraction. Meyers 
(1993) found this region active during gliding flight 
in kestrels, yet it was exclusively fast in that species. 
A similar, although much smaller population of slow 
fibers (about 100 fibers in only one of three birds 
examined) was found in this region of the pectoralis 
of cormorants (Meyers, 1997). These too were as- 
sumed to function in the wing-drying posture. No 
slow fibers were found in this region of the Califor- 
nia Gull pectoralis (Meyers and Mathias, 1997). 


Wing Extension 


The histochemistry of the triceps muscle is per- 
haps one of the best studied of the non-pectoralis 
flight muscles in birds. Kovacs and Meyers (2000) 
found 1% of the puffin TS to be slow tonic (four out 
of 480 fibers). No slow fibers were found in the TS of 
mallards, coots, or Yellow-legged Gulls (Torrella et 


ALBATROSS SHOULDER LOCK 25 


al., 1996, 1998a,b, 1999), although Meyers and 
Mathias (1997) found fewer than 40 slow (twitch) 
fibers in two out of three California Gulls studied. 
This discrepancy may be due to the fact that Meyers 
and Mathias examined the entire muscle in cross- 
section, whereas Torrella and colleagues relied on 
three small sample areas of about 500 fibers each 
(Torrella et al., 1999). Hector’s (1894) suggestion 
that the sesamoids of the patagial fan (subsequently 
investigated by Mathews, 1936, and Brooks, 1937) 
act as an elbow lock in albatrosses has been rejected 
by Yudin (1957) and cannot be substantiated by our 
anatomical observations. The arrangement of the 
sesamoid is peculiar, and has been suggested to 
function as a strut in the long patagial tendons 
(Mathews, 1936; Brooks, 1937). In his analysis of 
Hector’s proposed mechanism, Yudin described an 
elbow “fixing” mechanism. When the wing is ex- 
tended, the proximal radius butts against the distal 
humerus, thus limiting the degree of wing extension 
(and therefore the wrist as well). Joudine [Yudin] 
(1955) suggested that albatrosses and other soaring 
species (e.g., Puffinus griseus, Fulmarus glacialis, 
Oceanodroma furcata) possessed this elbow “stop” 
but those without this specialized flight mode did 
not. 

Like the triceps, EMR histochemistry has been 
relatively well studied in birds. Cormorants have 29 
and 6% slow fibers in their EMRd and EMRyvy, re- 
spectively (Meyers, 1997); almost identical percent- 
ages were found in California Gulls (28 and 6%; 
Meyers and Mathias, 1997). Torrella et al. (1998a) 
reported 9.4 and 3.6% of SO fibers in their sampled 
regions of EMRd and EMRv of coots, and 6.1 and 8.1 
in Yellow-legged Gulls (Torrella et al., 1998b). Mal- 
lards (Torrella et al., 1998a) and House Finches 
(Meyers, unpubl. obs.) have uniformly fast EMR 
muscles. These numbers contrast sharply with the 
slow muscle content in the albatross EMR. 


Development and Evolution of the Deep 
Pectoralis 


Rosser et al. (1994) discussed the evolution of the 
deep pectoralis in Gruiformes and Ciconiiformes. 
They suggested that the deep layer developed from a 
separate neuromuscular compartment of the deep 
“red strip” of the undivided pectoralis. They also 
discussed the relationship of the mammalian soleus 
within the triceps surae as being analogous to the 
superficial/deep pectoralis in soaring birds. 

Peters (1989) indicated that during evolution, 
neuromuscular compartments can split off from 
their parent muscle and may allow special functions. 
Studies of primitive marsupials suggest that the 
soleus arose as a neuromuscular compartment from 
the lateral gastrocnemius (Lewis, 1962). In lizards, 
the homolog to the triceps surae is a single muscle, 
with slow fibers (a neuromuscular compartment) sit- 


uated in the same position and function as the mam- 
malian soleus (Putnam et al., 1980; Peters, 1989). 

The pectoralis of chickens possesses muscle fibers 
that react with antibodies to both fast and slow 
myosins (Bandman et al., 1982). In 11-day chick 
embryos, slow myosin light chains were detected 
throughout the entire pectoralis (Matsuda et al., 
1982). By day 16 the slow myosins begin to disap- 
pear, and become restricted to the most anterior 
region of the muscle by day 2 posthatch (Matsuda et 
al., 1983). Pigeons show some reactivity to the slow 
myosin antibody (NA8) throughout the pectoralis at 
13 days in ovo, but have a uniformly fast pectoralis 
as adults (Rosser et al., 1998). A similar ontogenetic 
pattern of widespread slow fibers that are replaced 
by fast fibers can be found in the pectoralis of the 
bat, Myotis lucifugus. Myotis has a deep area of the 
pectoralis with slow fibers in late fetal life through 5 
days postnatally, giving way to a muscle that is 
100% fast in adults (Schutt et al., 1994). These stud- 
ies may provide an explanation for the development 
of slow fiber regions within the pectoralis muscle of 
gliding and soaring species. Since slow myosin is 
typically expressed first, those species with a deep 
layer may retain this early developmental feature. 
This can also explain the existence of slow fibers in 
the cranial part of the pectoralis, where they are not 
part of a distinct, isolated muscle, but are presumed 
to be a functional neuromuscular compartment. On- 
togenetic studies may elucidate the development of 
these pectoralis fiber types. 

Meyers (1993) described the evolution of gliding 
morphology and identified birds with a “partly di- 
vided” pectoralis. These birds (including kestrels, 
ravens, dippers, Sandhill Cranes) have a region of 
muscle fascicles, largely arising from the furcula 
and adjacent connective tissues, which insert onto 
the biceps tendon overlying the bicipital crest of the 
humerus (Meyers, 1993) (Fig. 11). This morphology 
is similar to that of the deep pectoralis, and Meyers 
(1993) indicated that this could represent an inter- 
mediate step in the evolution of gliding and soaring 
birds. In kestrels, this region was active during glid- 
ing flight in a windtunnel, but had no slow muscle 
fibers (Meyers, 1993). The muscle fiber types within 
the region of other species with the “partly divided” 
morphology are presently unknown. 

Chick limb muscles typically cleave from adjacent 
muscles from origin to insertion (or vice versa) but 
not in the middle (Schroeter and Tosney, 1991; Kar- 
don, 1998). Schroeter and Tosney (1991) reported 
that most of the thigh muscles cleave from insertion 
to origin and stated that a similar pattern also exists 
within the forelimb (see also Sullivan, 1962). This 
cleavage pattern does not appear to be related to the 
pattern of blood vessels, activity, or innervation. 
Therefore, the “partly divided” morphology de- 
scribed by Meyers (1993) might represent a partial 
or incomplete cleavage beginning at the insertion 
and stopping before the muscle was cleaved in two. 
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Fig. 11. Diagram of the hypothetical evolution of the deep 
layer of pectoralis muscle, seen from medial view. The humerus 
and M. biceps brachii are shown in cross-sections. A: Typical 
undivided pectoralis inserting onto the deltopectoral crest. 
B: “Partly divided” pectoralis, with fascicles arising from the 
furcula and sterno-coraco-clavicular membrane and attaching 
onto the biceps tendon. C: Divided pectoralis of albatrosses and 
others, with a deep, slow layer inserting separately onto the 
biceps tendon. B, M. biceps brachii; h, humerus. 


In species with a completely separate, slow, deep 
layer, cleavage would complete and produce two 
muscles (Fig. 10), and the slow muscle fiber compo- 
nent of the deep layer would be retained. 

Sullivan (1962) reported that the separation of 
pectoralis and supracoracoideus is visible by stage 


25-26 (~4.5 days: Hamburger and Hamilton, 1951). 
Presumably, a deep/superficial pectoralis separation 
would be visible by this stage as well. Muscle devel- 
opment and embryo studies of a readily available 
species with a slow deep layer (e.g., pelicans) may 
reveal more on the development and evolution of 
this structure. 


Biomechanics of the Deltopectoral Crest 


In albatrosses, the bulk of the deltopectoral crest 
is devoid of muscle attachment; the large pectoralis 
muscle inserts at the base of the crest, and the 
deltoideus muscle does not reach the edge of the 
crest (Fig. 3). Thus, the crest protrudes farther than 
“necessary for muscle attachment. This is in con- 
trast to birds such as pigeons and starlings, where 
the entire surface area of pectoralis insertion is onto 
the crest (Dial and Biewener, 1993). It is well known 
that the pectoralis muscle produces pronation of the 
wing in addition to depression (Bannasch and Lun- 
dberg, 1984; Dial et al., 1988). An attachment closer 
to the edge of the crest would increase the muscle 
in-lever, and hence, pronation ability, since it would 
move the insertion further from the axis of rotation. 
Why then is the crest larger than needed? We sug- 
gest a few possibilities: 1) The patagialis muscles 
arise from the furcula and pass over the deltopec- 
toral crest. The more the crest protrudes, the greater 
the leverage of these muscles, although they do not 
attach to the crest. 2) The large, “unused” part of the 
crest may be the result of differential growth of the 
humerus and not be strictly “adaptive.” Further in- 
vestigations can evaluate these hypotheses and oth- 
ers. 


Use of Frozen-Refrozen Tissue 


Jouffroy et al. (1999) reported that they could get 
reliable immunohistochemical results with fixed tis- 
sue. While we were unable to duplicate their success 
with fixed albatross tissue, we were able to use tis- 
sue from previously frozen specimens. In some 
cases, birds had been stored in the freezer for over 
1.5 years. Our specimens were frozen on site in 
Hawaii, shipped to us, then thawed and refrozen in 
liquid nitrogen. Comparisons with control tissues 
showed no affect of refreezing, nor were there any 
serious freezer artifacts. Previous research of this 
type has typically made use of fresh, unfrozen spec- 
imens, requiring histochemical or immunohisto- 
chemical analysis to be completed soon after speci- 
men death (although Rosser et al., 1994, also used 
previously frozen tissue). We were impressed with 
the quality of the sections as well as antibody reac- 
tivity. This unconventional technique may facilitate 
future research in muscle histochemistry and immu- 
nohistochemistry, by allowing specimen freezing, 
thawing, reaction, and subsequent photography or 
analysis without qualitative muscle tissue damage. 
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CONCLUSIONS 


This study revealed immunohistochemical, ana- 
tomical, and biomechanical data heretofore un- 
known or poorly described in albatrosses. Immuno- 
histochemical reactions were used to _ identify 
muscles within the albatross gliding flight appara- 
tus that possessed slow (postural) fibers, including 
some that were uniformly slow (CBC, EMRd, and 
DP). This appears consistent with the extreme na- 
ture of gliding and soaring behavior in these birds 
and the low energetic cost of locomotion in these 
birds (Costa and Prince, 1986), and sharply con- 
trasted with findings of most other birds studied. 
This study also reinvestigated the anatomy and bi- 
omechanics of the shoulder lock discovered by Pen- 
nycuick (1982). The tendinous locking mechanism, 
which functions to limit wing elevation above the 
horizontal, was discovered to be associated with the 
deep layer of the pectoralis in this investigation, 
rather than the superficial layer of the pectoralis as 
described by Pennycuick (1982). Pennycuick’s place- 
ment of the shoulder lock, its ability to release, and 
the prediction of fast fiber types in DP are not con- 
sistent with the findings of the present study. 
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An inflatable wing using the principle of Tensairity 


Joep Breuer and Wubbo Ockels' 
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Rolf H. Luchsinger' 
Empa, Dtibendorf, Switzerland 


[Abstract] The paper describes the new concept Tensairity which can be used 
to significantly improve the load bearing capacity of inflatable wings. The basic principle of 
Tensairity is to use an inflatable structure to stabilize conventional compression and tension 
elements. So far, Tensairity has been mainly used in civil engineering application like roof 
structures and bridges. In this work, considerations to apply Tensairity to wing structures 
are given and the construction of two wing-like Tensairity kite prototypes is described. Test 
results on the Tensairity structure used in these kites are presented and compared to purely 
air inflated structures. Finally, the advantages of Tensairity wings are discussed and some 
application areas of these wings are suggested. 


Nomenclature 
E = Young’s Modulus [ N/m? ] 
h = height [m] 
I = moment of inertia [ m*] 
k = spring coefficient [ N/m’ ] 
L = length [ m ] 
M = bending moment [ Nm ] 
P = force [N | 
p = pressure [ N/m*] 
gq = distributed load [ N/m ] 
Ro = radius [ m ] 
T = force [N | 
y = slenderness [ - ] 
0) = angle [rad] 


I. Inflatable wings 


he idea of using an inflatable structure to build a load carrying wing has a long history. In a patent from 1933, 

inflatable spars were already suggested. During the 1950’s there were two interesting developments in manned 
flight using inflatable wings (the Inflatoplane from Goodyear and the British ML Aviation Mk1, which is a tailless 
design)'. In the more recent past, from 1990-2000, the Swiss company ‘prospective concepts’ developed a series of 
aircrafts with inflatable wings as technology demonstrators’. The Stingray, a tailless design with 13m span had webs 
in chord direction and operated with a very low internal pressure of 0.02-0.05-10° N/m° (Fig. 1). 


“ Graduate Student, Department of Aerospace Engineering, ASSET Chair, Klyuverweg 1, 2629 HS, Delft, The 
Netherlands. 
' Professor and Head of Chair, Department of Aerospace Engineering, ASSET Chair, Klyuverweg 1, 2629 HS, 
Delft, The Netherlands. 
* Head of Center, Empa, Center for Synergetic Structures, Ueberlandstrasse 129, CH-8600, Diibendorf, Switzerland. 
‘ www.prospective-concepts.ch 
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Figure 1. The manned inflated aircraft Stingray of ‘prospective concepts’. 


In the later prototypes Pneuwing and Pneumagic, slender wings were realized with webs in span direction and a 
higher internal pressure in the order of 0.7-10° N/m’ (Fig. 2). The spacing between the webs is very small (in the 
order of a few centimeters) to accurately define the airfoil shape. 





Figure 2. The manned inflated aircraft Pheuwing (left) and Pheumagic (right) of ‘prospective concepts’. 


Recently the interest in inflatable wings is shifted to un-manned aircraft with examples as the NASA Dryden 
12000 inflatable wing and the prototypes of the University of Kentucky in cooperation with ILC Dover’. 

Inflatable surf kites can also be seen as inflatable wings although there the definition of the airfoil 1s much 
cruder. The front of the airfoil is shaped by a single air tube (with an internal pressure of 0.5-10° N/m’). The rest of 
the airfoil 1s a thin membrane behind this tube, tensioned by air struts in chord wise direction and the wind. The 
development in these kites is advancing quickly as the sport is gaining popularity. Most of the development follows 
from experiments by surf kite designers, although currently the scientific interest in kites 1s increasing for their 
possible use in energy production and ship propulsion® ’. 

One advantage of inflatable wings is that their flexibility gives them the capability to recover from gusts and 
crashes. Also it gives an opportunity to actively morph the wing for control. However, this flexibility also has a 
disadvantage. To get enough stiffness for a certain load capability either high pressure, thick wings, low aspect ratio 
or a combination of these factors is necessary. A way to improve the load bearing capacity and stiffness of inflatable 
wings while keeping most of the advantages 1s to use the structural concept Tensairity. 
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Il. Tensairity® 


The basic idea of Tensairity is to combine an airbeam with conventional cables and struts to improve the load 
bearing capacity of inflatable structures*'°. The name Tensairity, a combination of tension, air and integrity, reflects 
the relationship with the structural concept Tensegrity. Developed by R. Buckminster Fuller, Tensegrity structures 
use a combination of cables and struts purely loaded in tension and compression, where two compression elements 
only interact with each other by means of the cables. In short, Tensairity 1s Tensegrity plus air. 

The basic Tensairity beam consists of a compression element, a low pressure airbeam which is tightly connected 
to the compression element and two tension elements which run from end to end of the compression element in a 
spiral way around the airbeam (Fig. 3). Following Ref. 8 and 9, the low pressure airbeam (typical pressures for 
Tensairity are in order of 0.1-10° N/m’) pretensions the cables and stabilizes the compression element against 
buckling. 


compression element 





airbeam . _ 


Figure 3. Basic Tensairity beam element. 


Some fundamental relations shall be briefly summarized. The slenderness of a girder is defined as: 


(1) 





Figure 4. Forces in a Tensairity beam element. 


If a distributed load q acts on the structure, the load is transferred by the airbeam to the tension elements 
(cables). The total force in the tension element is given by 


™ Tensairity® is a patented technology of the Swiss company Airlight Ltd developed in close collaboration with 
prospective concepts ag. The Tensairity research activities of ‘prospective concepts’ have been recently transferred 
to the Center for Synergetic Structures at Empa, the Swiss Federal Laboratories for Materials Testing and Research. 
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T= 7 Be oe Be ey Se i . (2) 


As the tension element is tightly connected at both ends with the compression element, the horizontal force flow 
is closed leading to a compressive force in the compression element. For a compression member not only the 
stresses but also buckling phenomena have to be taken into account. In case of a simply supported structure, the 
allowable compression load can by approximated by Euler buckling, where the buckling load depends as the inverse 
square of the length of the element. In Tensairity, the compression element is tightly connected to the airbeam which 
can be considered as a continuous elastic support. In this case, the allowable compression load 1s independent of the 
length and given by [ref. 8.]: 


P=2-Vk-E-I , (3) 


where k is the spring coefficient of the foundation. The spring coefficient of the airbeam in a Tensairity structure 
depends mainly on the internal overpressure and can be approximated by 


kK=p-7. (4) 


This means that the allowable buckling load for a compression element in Tensairity is approximated by 


P=2-/jp-a-H-Ig . (5) 


Depending on pressure, Young’s modulus and moment of inertia , this allowable buckling load is in general 
higher than the Euler buckling load and can be higher than the load based on the yield stress of the material. 

First realized applications of Tensairity are bridges and roof structures (Fig. 5). Light weight, fast and easy setup, 
new formal language and lighting options are properties of Tensairity which make this technology interesting for 
civil engineering applications. But Tensairity is a general concept for light weight structures and thus by no means 
restricted to civil engineering applications. It is the purpose of this work to investigate the potential of Tensairity for 
wing structures. 





Figure 5. First applications of Tensairity in civil engineering: test bridge with 8 m span (left) and roof over a 
parking area in Montreux, Switzerland with 28 m span. 
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II. Theoretical weight comparison 


As a first attempt to demonstrate the potential of Tensairity, the weight of different structures under the same 
load conditions and geometries are compared. A simple airbeam, a truss, a Tensairity spindle and a Tensairity web 
spindle are considered. Next to the cylindrical form (Fig. 3), Tensairity girders can also have a spindle shape as in 
the roof structure of Fig. 5. In this comparison only symmetrical spindles are considered. The spindle shape leads to 
stiffer structures by using less membrane material compared to cylindrical shaped Tensairity10. A further 
improvement of Tensairity is possible when a web 1s placed between the tension and compression element (Fig. 6). 


2°Ro 





Figure 6. Cross-section of a Tensairity spindle (left) and a Tensairity web spindle (right). The black dots 
indicate tension and compression element. 


This web will give an additional support for the compression element increasing the value of the spring 
coefficient in Eq. 4. This improves the stabilization of the compression element and can further reduce the necessary 
internal pressure. The main function of the air pressure for the Tensairity web structure is to pretension the web 
which then stabilizes the compression element against buckling. 

A span of 3m is assumed for all girders in this test and a distributed bending load of 250 N/m is applied on the 
simply supported beams, while different values for the slenderness are investigated. To have a fair comparison the 
truss structure has the same spindle shape. The airbeam, however, has a cylindrical shape, because a spindle shaped 
airbeam would buckle at the thin supported ends. The slenderness of a spindle is defined in the same way as in Eq. 
1, where Ro is the radius in the middle of the structure. In case of the web spindle, the slenderness is defined as the 
ratio of length over the height of the web in the middle of the structure. The four different structures used in this test 
are shown in Fig. 7. 





(Cc) (d) 


Figure 7. Four structures investigated in the theoretical weight study: a.) simple cylindrical airbeam, 
b.) truss, c.) Tensairity spindle, d.) Tensairity web spindle. 
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The following materials are used in the calculations if appropriate: high E-modulus carbon/epoxy for the 
compression elements (E = 142.5 GPa, Omax = 490 MPa), high tension carbon/epoxy for the tension elements (E = 
115 GPa, Omax = 890 MPa), and an Aramid/PU composite for the membrane (E = 35 GPa, Omax = 550 MPa). 

To find the minimum weight for each structure at each slenderness under the given load conditions different 
parameters were varied. For the Tensairity spindle the pressure was varied. The pressure determines the thickness of 
the membrane of the airbeam and has an influence on the size of the compression element (Eq. 5). In case of the 
web Tensairity spindle the angle @ between the top of the web and the center of the adjacent tube was varied (Fig. 
5). This angle determines the necessary internal pressure to pretension the web and the area of the membrane. 

In case of the truss the number of equally spaced vertical elements was varied. The number of vertical elements 
has an influence on the size of the vertical elements, the diagonal elements and the buckling of the compression 
element. For the airbeam, no parameter was varied. However, the pressure needed to sustain the moment in the 
middle of the beam was calculated® and from this pressure the membrane thickness determined. The safety factor for 
the membrane was set to 2. Local buckling of the thin walls of the square shaped compression elements as well as 
global buckling were taken into account. No minimum was put on the cross sectional area of the compression and 
tension elements and the thickness of the membrane. The weight of the connections between the elements both in 
the truss and in Tensairity were not taken into account and the out of plane stability was not considered. 


Theoretical weight comparison ( L = 3 mq = 250 N/m) 
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Figure 8. The minimal weight of various structures for given load condition and span as a function of the 
slenderness. 


The result of this theoretical weight study is shown in Fig. 8. For slenderness above 10, the airbeam is the 
heaviest structure. This is due to the fact that the air pressure of a simple airbeam increases with the square of the 
slenderness*. The high pressure leads to high fabric tensions and thus to a thick membrane. The Tensairity spindle is 
considerably lighter than the airbeam. The much smaller air pressure in Tensairity allows for a thinner membrane 
and thus a drastic weight reduction which is by no means compensated by the weight of the additional compression 
and tension element. The lightest structure is the web Tensairity spindle, where the pressure can again be reduced. 
For slenderness values above 20, the difference between truss and web Tensairity becomes negligible. The reason is 
that for high slenderness values the weight of both the truss and Tensairity is dominated by the weight of the 
compression and tension elements. The weight of the compression element is then determined by the material limit 
too and no more a function of the type of support (air, web or strut). This tendency can also be seen for the normal 
Tensairity spindle which approaches the weight of the truss and web Tensairity for high slenderness values. 

The major result of this theoretical study is that the weight of a Tensairity structure 1s comparable to the weight 
of an optimized truss. The web Tensairity structure is lighter than the normal Tensairity structure by a further 
reduction of the air pressure. The airbeam is from all structures the heaviest one for slenderness values above 10. 
Typical slenderness values for wing structures can be in the order of 50. Thus, much weight can be saved compared 
to the conventional inflatable wing design by using Tensairity. 
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IV. Experiments on a Tensairity web spindle 


To get a better insight into the loading capabilities of a slender Tensairity web spindle, a test model of a 
Tensairity spar with the two adjacent tubes was built and loaded in different configurations. The dimension of the 
girder is defined by the kite model with 2 m* area which will be discussed in the next section. The model was made 
using the same light weight materials as for the kite: Exel carbon tubes as compression elements, an Aramid line as 
tension element, aluminum connection elements and Icarex polyester rip-stop fabric with a PU coating as restraint. 
Compared to the kite, only the bladder was of a slightly heavier polyethylene foil, due to availability. The length of 
the test model was 3.1 m, the radius of the tubes about 45 mm, the maximum height of the web 58 mm and the 
angle @ varied from 50° in the center of the tube to 78° at the ends of the web,. These dimensions result in a 
slenderness of 53 (see Eq. 1). Both the tension and compression element have the shape describe a circular arc over 
the length of the girder and are inserted in pockets sewn at the top and bottom of the web in the restraint. The 
distributed load was approximated by using 9 load introduction points. The test model was supported at both ends. A 
drawing of the test set-up can be seen in Fig. 9. 


Figure 9. The experimental set-up for a Tensairity web spindle. The arched lines are the tension and 
compression element. The compression element is on the upper side under the given load conditions. The web 
is the area between tension and compression element. 


In the first experiment, the Tensairity web configuration was compared with a simple inflatable configuration 


(air tubes), that means the tension and compression element were removed from the structure in this case. In Fig. 10, 
the load deflection diagram is shown for a pressure of 0.075-10° N/m’. 
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Figure 10. Experimental load-deflection diagram for a Tensairity web spindle and the according air tubes 
(simple airbeam) for a pressure of 0.075-10° N/m”. 
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The stiffness and the maximum allowable load of the air tubes are as expected at the same pressure much smaller 
for the air tubes than for Tensairity. The deflection curve of Tensairity is very straight up to the point of buckling. A 
similar result is found for other investigated pressure values ranging from 0.025-10° N/m‘* to 0.1-10° N/m”. An effect 
which lowers the allowable buckling load in Tensairity 1s the moving of the compression element in the pocket 
under large deformations'’. This effect can be reduced by a careful design of the pocket increasing the friction or by 
connecting the membrane after assembling at both ends with the compression element. . 






















































































































































Tensairity 
25 mbar 
50 mbar 
FS 75 mbar 
-50 - <—— 100 mbar | 
~—+—~ analytical estimation 
© -100/ | 
E 
ce ™ 
6 ta oS 
\ \S 
206 L \ * | 
-250 ! ! ! ! : : 
0 5 10 15 20 25 30 35 


distributed load (N/m) 


Figure 11. Load-deflection diagram of the Tensairity web spindle for different pressure values. 


The load-deflection diagram for the Tensairity web spindle at different pressure values is shown in Fig. 11. With 
increasing pressure, obviously the buckling point is postponed and the stiffness increases. The positive influence of 
the pressure on the stiffness can be partly explained by the increased friction between the pocket and the 
compression element for higher pressure values. However, the major effect of the pressure on the stiffness is a more 
pretensioned membrane which leads to a more constant spacing between tension and compression element. 

An analytical estimation is also shown in Fig. 11, which was made using the modified Castigliano’s theorem 


dx , (6) 





_6 (Mids (= OM 
' OP? 2EI EI }\ éP 


where P is a dummy load. As a simplification, only the tension and compression element with a fixed distance 
(infinitely stiff web with infinite pressure) were considered in this model. The bending stiffness EI varies along the 
length x of the beam due to the geometry of the spindle and the bending moment M varies with x too. However, due 
to these simplifications, the deflection of this analytical model 1s independent of the air pressure. Considering Fig. 
10 it is a reasonable approximation to neglect the contribution of the air tubes to the stiffness of the Tensairity 
structure. As can be seen in Fig. 11, the stiffness of the Tensairity web spindle is quite well approximated by the 
simple analytical model especially for high pressure values. 
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The increase in weight by adding the compression and tension element to the air tubes in Tensairity is very small 
compared to the increase in stiffness and allowable load. The weights of the test girder model are given in table 1. 








Part Material Weight (gr) % of total weight 
Compression element Exel carbon tube 3.9x2.5 mm 34 10.8 
Tension element 2 mm Aramid line 5 1.6 
Connection elements Aluminum 12 3.8 

Total Tensairity 16 
Restraint Icarex kite fabric 31 gr/m* 86 27.3 
Bladder Polyethylene 70 gr/m* 178 56.5 

Total weight 315 100 





Table 1. Weights of the different components of the test web Tensairity girder. 


The tension and compression elements are only 16% of the total weight of the structure which is dominated by 
the weight of the restraint and especially the weight of the bladder. Using a thinner bladder would reduce the total 
weight of the girder. However, this would also slightly increase the portion of the tension and compression element 
to the total weight. 


V. Inflatable Tensairity Wings 


Several ways to insert Tensairity in a wing structure were investigated, but eventually a fully inflatable multi 
spar wing with two discrete Tensairity elements was chosen. The spars with Tensairity are placed at about 15% and 
70% of the wing chord. The positions of the tension and compression elements of the Tensairity spars are indicated 
with the black dots in Fig. 12. In this way the advantages of Tensairity and an inflatable wing can be fully used. As 
shown in Fig. 12 an outer hull was used to smooth the surface. The trailing edge was made of foam. 





Figure 12. The Tensairity spars in the 4 m* kite (MH 91 airfoil). 


To investigate the applicability of Tensairity in wing structures, two kites were constructed. These kites enabled 
construction of prototypes in a short time and allowed to test Tensairity wings in flight without the need of power 
and/or radio control. The first full-scale model was a 2 m” kite with a span of 3 m. This size generates enough power 
to do measurements and is big enough to ease production. For production reasons the taper ratio was chosen equal to 
1. A second prototype has a surface area of 4 m’ and a span of 4.5 m to generate more force and to make the period 
of the oscillations in the flight dynamics of the kite a bit larger (Fig. 13). 








Figure 13. The complete 4 m’ Tensairity kite after inflation. 
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Because of the low internal air pressure in the order of 5000 N/m’ in the Tensairity wing lightweight materials 
can be used. For the restraint and the outer skin, kite fabric (rip-stop polyester (Icarex, 31 gr/m’) or rip-stop nylon 
(Chikara, 42 gr/m’) with a thin PU coating) is used. The bladders are made of thermoplastic polyurethane (56 gr/m’) 
or Mylar (MC2, 21 gr/m’). To fix the shape to the trailing edge of the wing low density polypropylene foam 
(Eperan, 20 kg/m*) is used. The Tensairity compression elements are carbon fiber kite tubes (Exel) and the tension 
elements are made of Aramid (Edelrid)/Vectran (Marlow ropes) lines. These materials lead to a total weight of about 
600 gr/m’ for both kites. Depending on the size of the wing and the number of cells of the restraint, the tension and 
compression element make up only between 10% to 15 % of the total weight. The tension and compression elements 
of both kites are designed such that the wing can take up a maximum load of 250 N/m”. The use of kite fabric meant 
that the restraints could be made with a simple sewing machine resulting in fast and simple production. 

The kites where tested on different beaches at the coast of the Netherlands, dependent on the wind direction, 
because a sea-wind is preferred. The wind speeds ranged from 4 m/s to 12 m/s (3-6 Beaufort). The test flights had in 
general a short duration (up to 5 minutes) and after each flight the kite was adjusted, especially in the bridle, or the 
test was canceled. First tests were made with 2 lines but a 4 line bridle proved to give more control. In the final 
bridle the power lines were connected to the front spar at the compression element, while the steering lines where 
connected to the compression element of the rear spar or even at the tips of the kite. To reduce the deflection at high 
loads and to increase the maximum allowable load of the kites, an extra attachment point was added in the middle of 


the front spar (Fig. 14). 
‘i DOF 


Figure 14. The final bridle of the Tensairity kites (the arrows indicate the kite lines). 
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Figure 15. The 2 m? Tensairity kite with 4 control lines in full flight 





Due to the high L/D of the kites and the very flat shape, the kites were very responsive and fast. Only 
experienced kiters were able to fly with the kites. Because of the airfoil the small kite tended to be unstable at small 
angles of attack and the 4 m’ kite proved difficult to handle (on the ground) in high winds, because of its size and 
stiffness. During the test flights the tips of the bladders (especially the thin Mylar bladders) proved to be very 
vulnerable, 
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VI. Conclusions 


The new structural concept Tensairity does have interesting properties for wing structures. Theoretical 
investigations show, that the weight of a Tensairity structure is comparable to an optimized truss structure and much 
lighter than a simple inflatable structure for slenderness values typical in wings. Experimental tests with a Tensairity 
web girder confirm that the stiffness is much higher in Tensairity compared to a corresponding conventional 
inflatable structure with the same internal pressure. Therefore, compared to conventional inflatable wings, either a 
considerable amount of weight can be saved at a comparable stiffness or a considerable increase of stiffness at a 
comparable weight can be gained with Tensairity wings. The increased stiffness can be used to increase the span, for 
better performance or to increase the load bearing capacity of the wing. The lower pressure has consequences for the 
weight (use of lighter and cheaper materials) and the survivability. In case of a leakage it is much easier to 
compensate the air leakage with a small integrated pump, while still keeping structural rigidity. On the spot repairs 
of fissures are possible with simple methods at low pressure, too. Since the rigid elements are placed inside the 
wing and thus protected by an aircushion, the Tensairity wing has a good crash resistance. The ability to flex under 
gusts and overloads of a Tensairity wing is slightly less compared to a normal inflatable wing due to the presence of 
the rigid compression elements, but this is compensated by the higher allowable load. A disadvantage of the 
Tensairity wing presented in this work compared to conventional inflatable wings is that the Tensairity wing can not 
be deployed by simple inflation due to the rigid compression elements. The set up requires some small assembly by 
inserting the compression elements in their pockets, making the Tensairity wing not feasible for some special 
applications. 

Practical testing of Tensairity wings by using them as kites worked well, although they are difficult to fly. This 
however is not related to the Tensairity concept but to the critical stability behavior of a rectangular wing which was 
chosen to simplify the manufacturing of the Tensairity kite in this work. Other wing shapes e.g. delta wings can be 
realized with Tensairity without conceptual problems. But using a Tensairity wing as a kite was a fast, cheap and 
easy method to get a first impression of the flight behavior of a wing and to refine the constructional details 

The application area of a Tensairity inflatable wing is very wide. It can be used in Ultra Light aircrafts like the 
Pneuwing of ‘prospective concepts’, or hang-gliders like the Pneumagic. It would also be very interesting for small 
UAV’s like the ones developed by ILC Dover and the University of Kentucky, which have a comparable restraint 
shape. Currently the research with Tensairity wings is mainly focused on kites, because the L/D of a Tensairity kite 
is due to the good airfoil at least twice as high as in conventional kites. This makes the Tensairity kite interesting for 
use as a high altitude record kite (research on this subject was conducted at the faculty of aerospace engineering as 
part of an ESA project'" '*), as part of a wind power plant or for ship propulsion. 
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Birds and their Wing Shapes 


Activity Information Sheet 


With over 9,000 species of birds in the world and over 715 species in just the United States 
alone, you can find birds almost everywhere you go. Because birds are found in all types of 
environments, bird species come in all sizes and shapes with wings that are adapted for their 
survival needs. What, specifically might a bird need its wings for? Take some time to think 
and then list your ideas here: 


On these pages we are going to focus on flight. There are four general wing shapes that are 
common in birds: Passive soaring, active soaring, elliptical wings, and high-speed wings. 


Passive soaring wings have long primary 
feathers that spread out, creating “slots” that 
allow the bird to catch vertical columns of hot 
air called "thermals” and rise higher in the air. 
Examples of birds with this wing type include 
eagles, most hawks, and storks. 

The Bald Eagle is a classic example of a bird with 
passive soaring wings. 


Active soaring wings are long and narrow, 
allowing birds to soar, or fly without flapping — 
their wings, for a long time. However, these birds : 
are much more dependent on wind currents 
than passive soaring birds. Examples of birds 
with this wing type are albatrosses, gulls, and 
gannets. 

The Laysan Albatross has very long and narrow 
wings that are well-adapted for the strong, 
constant winds over parts of the ocean. Its up- 
and-down flight style allows it to use small 
differences in wind speed to go long distances 
without using a lot of energy. 


| Photo by J. Young 
L — 


| Photo by Yuji Nishimura 
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Elliptical wings are good for short bursts of high speed. They 
allow fast take offs and tight maneuvering. While they allow high 
speed, the speed cannot be maintained. Examples of birds that 
have this wing type are crows, ravens, blackbirds, sparrows, and 
thrushes (such as the American Robin). 

7 ae Common Ravens are acrobatic flyers. It’s not uncommon to see 
none : ravens doing rolls and somersaults with their elliptical wings. 


High-speed wings are long and thin, but not nearly as long as birds 
with active soaring wings. As the name suggests, birds with this wing 
type are incredibly fast, but unlike those with elliptical wings, these 
birds can maintain their speed for a while. Examples of birds that have 
this wing type are swifts, ducks, falcons, terns, and sandpipers. Notice 
the slender wing form of the Forster's Tern. 


And a final type of wing.... 


Hovering wings are small and quick. For hovering wings, in addition 
to the wing shape, the bird’s nerves and muscles are specially adapted 
for incredibly fast movement. 

The Ruby-throated Hummingbird might appear to float in 
space while sipping nectar and hovering from flower to flower 
thanks to its quick and mighty wings. 





What paper airplane models do you think reflect the wing 
types we ve featured here? 





Wings also might be adapted for any of the following: migration, attacking prey, attracting a 
mate, camouflaging into the surroundings, swimming in a lake or the ocean, or simply for 
fluttering around for food. How might you adjust your paper airplane to fit any of the 
functions above? (Be creative!) 


This information presented here is provided by the Cornell Lab of Ornithology and the activity 
is by the BirdSleuth K-12 Education Programs of the Cornell Lab of Ornithology. To find more 
information, activities, or science content on birds, please visit www.BirdSleuth.org 





Resources for constructing different types of paper airplanes 
The links below can be stretched from 4th grade up. 
1. Fun Paper Airplanes- a great site with a variety of levels of ability 
www.funpaperairplanes.com/ 
2. Alex’s paper airplanes- many different types for more experienced students 
www.paperairplanes.co.uk/planes.php / 
3. The World Record Paper Airplane Book by Ken Blackburn and Jeff Lammers 
4. Your imagination! Some kids have designed their own beautiful airplanes that were 
inspired by birds. 
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Optimization of a Boomerang shape using modeKFRONTIER 


Rosario Russo* and Alberto Claricht 
Esteco Spa, Padriciano 99,Trieste, Italy, 34149 


Enrico Nobilet and Carlo Poloni§ 
Universita di Trieste, Via Valerio 10, Trieste, Italy, 34127 


The paper describes the optimization of a boomerang, simulating its trajectory by a dynamic 
model coupled to CFD analysis to compute aerodynamic coefficients. The optimization process 
flow and formulation is built within the commercial process integration and optimization 
software modeFRONTIER. The design variables involved are primarily the geometric 
parameters to change the shape of the boomerang. To steer the complete process of 
optimization, from variables variation to performance evaluation, modeFRONTIER is coupled 
to Catia v5 software for geometry modification and mass properties evaluation, to MATLAB 
for dynamic simulation, and to the commercial Computational Fluid Dynamics (CFD) tool Star- 
CCM+ for aerodynamic analysis. In addition, dedicated RSM (Response Surfaces Methods), 
available in modeFRONTIER, are used to extrapolate the aerodynamic coefficients as a 
function of the angle of incidence and velocity, as required by the dynamic model, through a 
reduced number of CFD analyses (database) for each geometric configuration. Different design 
simulations are therefore executed automatically by modeFRONTIER following a dedicated 
optimization strategy, until the optimal configuration of the boomerang is found, accordingly to 
the specified requirements, such as minimum energy required for the launch, maximum 
accuracy in returning, and a guaranteed minimum range. The physical complexity of this, 
apparently simple, problem, and the not standard application case, has been selected as an 
interesting benchmark that can be disclosed in full to test the multi-objective and multi- 
disciplinary capabilities of the optimization environment modeFRONTIER. 


Nomenclature 
0, = first boomerang precession angular velocity 
OQ, second boomerang precession angular velocity 
V = boomerang center of mass velocity magnitude 
XYZ = absolute fixed reference system 
XYZ = partially fixed on the boomerang reference system 
123 = fixed on the boomerang reference system 
X,Y,Z = boomerang center of mass absolute position 
0,,Q,,0, = angular velocity of the reference system xyz 
0,9,W = Euler angles of the reference system xyz with respect to the reference system XYZ 
p angle between y axis of the xyz reference system and 2 axis of the 123 reference system 
Wy Wy Wz = boomerang angular velocity components 
W = boomerang angular velocity magnitude 
i oe Gere = aerodynamic plus gravitational force components acting on the boomerang 
F, = aerodynamic force acting on the boomerang 
I ycl yin op ip = aerodynamic torque components and aerodynamic torque acting on the boomerang 
K, = boomerang gravity force 
F (WY, U) = non dimensional aerodynamic force acting on the boomerang 
T(Y, U) = non dimensional aerodynamic torque acting on the boomerang 
Y angle between boomerang center of mass velocity and boomerang plane of rotation (incidence angle) 
a = boomerang characteristic dimension (radius) 
I, 15,1 = minimum, medium and maximum boomerang principal moments of inertia 
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= non dimensional boomerang velocity 

= air density 

boomerang mass 

= gravity acceleration 

Reynolds number 

angular velocity of a generic moving reference system 

= origin velocity of a generic moving reference system 

= position of a point that belongs to a generic moving reference system 
velocity of a point that belongs to a generic moving reference system 
= origin velocity of a generic moving reference system 

= fluid velocity with respect to a generic moving reference system 


.s 
Aa ZvS 
Hou 


st El 
] 


Si =] Ut 
= 
I 


o 


generic external force vector 
= generic fluid pressure 
= viscosity stress tensor 
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I. Introduction 


OOMERANGS were born in Australia, they were created by Aborigines as tools for hunting and fighting purposes, 

but in a different variety also as toys. The word boomerang, “bumarangs” originally, comes from an Aborigines 

sub-group, the Turuwal, that lived close to Sidney, and means “the fighting weapon’”® . The real “bumarangs” were 
in fact effective weapons having a range of about 130-140m and having been known to pass completely through an 
adversary or prey when the body was struck directly by the edge of the weapon. 

Nowadays boomerangs in the world are used as outdoor play objects, and one requisite they must have is that once 
thrown they follow a curved path and return back to the thrower. The objects called “bumarangs” by Aborigines were 
not returning, but they also used a returning similar object called with a different name: “bargan’’, that means “bent like 
a sickle” or “crescent moon”, because of its shape®. This object was used as a toy to be thrown for amusement, since it 
was not suited for hunting or fighting: unlike the “bumarangs” in fact the “bargan” was very hard to aim, and of course 
when hitting a target it stopped from returning to the thrower, pretty much defeating the purpose of the design. However 
the name “bargan” was lost and only the word “boomerang” passed into the English language and has become known 
all over the world. So when we say boomerang we are referring to a weapon only from the etymological point of view. 

In any case a boomerang is a sort of (in most of the cases wooden) object which can be thrown in such a way that 
it rotates rapidly, a condition that produces an interaction with the air that makes it follow a path very different from 
that of an ordinary thrown stick: in the not returning variety this interaction makes the object assume a pretty steady 
straight trajectory and allows it to reach a good distance at a very high speed, while in the returning variety the 
interaction allows the object to draw a curved circular trajectory coming back to the thrower. 

In this article we will focus only on the latter; the main purpose will be designing an easy to be launched boomerang 
able to draw a nice returning trajectory, satisfying at the same time a constraint on its range (i.e. the maximum distance 
reached from the thrower). Since each different boomerang needs to be launched in a slightly, or completely, different 
way also the lunch parameters will be optimized for each new candidate design. 


II. Boomerang flight physics 


Typical boomerang are about 30-50 cm across, but even very small or very large varieties exist, moreover many 
different kind of shapes are possible: from the traditional “v” shape or “moon” shape with two arms to the most 
complicated ones with weird shapes and three or four arms. 


Shapes, dimension and weights are very different (Fig.1*) es, lg 
according to the main task the launcher wishes to achieve el Pi ti: 
which can be more focused on obtaining a long distance path or oe — " Cc om ~ iret 
very short or conversely very long time flight. a P ~~ ; 
— Of NY " x. wn) 
Ye ~~ ie ff — ~ * 7 a - 
However in any boomerang it is possible to identify on each = aS a sor 
arm some blades having a typical aerodynamic shape: F me fy SS or" 
Jf NW Y S 
e radial blades (airfoils) on the edge of each arm; val = eee ir~» 
e arc blades (stabilizers) on the tip of each arm. fo % la \ 


In Fig.2 a boomerang with two arms is sketched (the angle 
between them is 180°), and the position of the blades is shown’. 





Figure 1:Different boomerangs shapes 
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Figure 2: Airfoil (or radial blades) and stabilizers (arc blades) in a simplified 180° boomerang. 


The radial blades work during the entire rotation of the boomerang, but mainly when their velocity is opposed to the 
boomerang center of mass velocity, while the arc blades work only when the corresponding arm is aligned enough with 
the translational velocity, as depicted in Fig.3°. 
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Figure 3: Radial blades and arc blades position at their maximum efficiency. 


A boomerang must be thrown with its plane of rotation almost vertical in an almost horizontal direction and with a 
high spin. The boomerang at the beginning seems to fly away but after a while it swerves to the left (for a right handed 
boomerang) and also upwards, draws a wide loop approaching the thrower and, in some cases, draws close to the 
thrower another loop. The path can then assume a sort of “o” trajectory or an “8” trajectory. Some typical boomerang 
trajectories are illustrated in Fig.20 and Fig.22. The boomerang plane of rotation, vertical at the beginning, is closer to 
the horizontal at the end. 

This peculiar flight is due to the boomerang interaction with the air by its blades. The key point is the high spin: the 
interaction with the aerodynamic forces exerted by the blades causes two kind of precessions, the first (angular velocity 
(),) responsible for the boomerang return and the second (angular velocity (11,) responsible for the boomerang plane of 
rotation mutation (vertical at the beginning and almost horizontal at the flight end) that in turn allows the boomerang to 
stay in air for more time (the lift component that contrast the weight increases as the boomerang rotation plane becomes 
more and more horizontal). The ratio between first and second precession angular velocity affects the trajectory: 


QO ; , 
e = = 4 result in an almost “o” trajectory (a complete first precession loop corresponds to a 
2 
quarter of the second precession) 
fo) ; ; , 
e = < 3 the boomerang does not come back and flies forward with a corkscrew trajectory 
2 
0} 
e > 4 the boomerang remains oriented vertically too long (lift force component does not 
2 


contrast gravity) and falls down with a classic ballistic trajectory. 


The first precession is caused by the radial blades. Let us consider an xyz reference system having origin always 
placed in the boomerang center of mass, with the x axis oriented as the translational velocity and the z axis oriented as 
the boomerang rotational axis (Fig.4’). The combination of spin and translational boomerang velocity makes always one 
radial blade have a higher relative air velocity than the other blades. The higher the relative velocity is and the higher 
the lift force (orthogonal to the boomerang plane of rotation) is exerted by the corresponding boomerang blades. The 
unbalance between the lift forces on the different blades causes a torque having an x momentum component that in turn 
causes a precession around the y axis. Moreover the lift forces on the arms make the center of mass of the boomerang 
follow a curved trajectory. 





Figure 4: Boomerang main precession responsible for the boomerang return. 


The second kind of precession is caused instead by the arc blades. Each arc blade indeed becomes active when the 
boomerang gets a position in which radial blades are more or less parallel to the flight direction as mentioned before 
and the arc blades are built, by a proper dihedral angle and angle of attack (see Fig.12), in such a way that the air 
impacting will exert on the arm a lift force having a y momentum component that causes a precession around the x axis 
(Fig.5°). This precession is responsible for the mutation of the rotation plane of the boomerang that tends to lay down 
and hence helps the radial blade lift forces to contrast the gravity. 

Another additional phenomenon (mainly for two arms boomerang), responsible for the boomerang plane to lay 
down during the flight, is given by the position of the arms (the angle for two arms boomerangs is less than 180°) that 
are often designed in order to provide a different lift. The arm exerting the higher lift force is the one that at its 
maximum lift position is placed forward with respect to the center of mass (see Fig.6 left), so that it will give a torque 
with a y momentum component that causes a precession around the x axis, the other arm instead is placed behind the 
center of mass at its maximum lift (see Fig.6 right) and in a similar way it will contrast the previous precession, but this 
arm effect is lower than the previous, since its lift force is lower. So the final effect will be the precession of the 
boomerang around the x axis. 

For two arms boomerang the arm that is forward with respect to the center of mass at its maximum lift is called 
leading am, the other one is called trailing arm. From now on we refer to this two names when talking about the 
boomerang arms (see also Fig.11). 





Figure 6: Boomerang positions when each arm exerts the maximum lift force. The center of mass position with 
respect to each arm is different. 


III. Equations of the boomerang motion 


In order to write the equation of motion three different reference systems are considered, as depicted in Fig.7': 


Reference system XYZ: it is an external inertial reference system, in which we wish to find the 
path of the boomerang center of mass. 

Reference system /23: it is fixed to the boomerang. Its origin is placed in the boomerang center of 
mass, the axes /23 are the principal axes of inertia of the boomerang with I, < I, < I; 

Reference system xyz: it is a system partially fixed on the boomerang: its origin is always in the 
boomerang center of mass and its z-axis coincides with the 3-axis of the previous reference 
system. The projection of the boomerang center of mass velocity on the xy plane is always 
directed as —x. (Oy, O, 0) is the angular velocity of this system. 





Figure 7: Reference systems considered for the boomerang motion equations 


The position of the reference system xyz with respect to XYZ can be identified by the classical Euler angles (J, ~, W), 
as shown in Fig.8’. 





Line of nodes 


Figure 8: xyz reference system position with respect to the external XYZ and its Euler angles. 


To write the equation of motion it is possible to proceed in this way: 

1) the motion of the center of mass is written with respect to the xyz reference system; 

11) the motion of the boomerang (basically the position of its principal axes) is written with respect to the 
center of mass; 

111) the previous equations are smoothed during a complete rotation of the boomerang i.e. the boomerang 
angular velocity (Wy, Wy W, ) is averaged over a single rotation period; in this way any term 
multiplied by sin(2@)or cos(2q@) can be discarded (where @ is the angle shown in Fig.7); this 
simplification is justified, since we are not interested in the fast fluctuations of the boomerang motion, 
that have a characteristic time of the order of variation of one rotation period or less; 

iv) since the boomerang spins rapidly w,, Wy,Q, K Wz; Wy, K WyWz; Wy K WW, so the corresponding 
terms are discarded; 

v) the previous equations are written with respect to the XYZ reference system, using the Euler angles. 

By these steps it is possible to achieve the following equations’: 
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Where V is the velocity magnitude of the boomerang center of mass; m the boomerang mass; ¥ is the angle 
between the boomerang velocity and the negative x axis (namely the angle of incidence of the boomerang); V, @, w are 
the Euler angles of the xyz reference system; 7), T,,, T;, Fy, Fy, F, are torque and force components in the xyz reference 
system, basically due to the interaction between the boomerang and the air, and to the gravity force. The gravity force 
(that has the —Z direction of the external reference system) can be expressed in the xyz reference system as K, = 
—mg(sinv¥ sin wy, sin 9 cos w, cos). 

The absolute position of the boomerang center of mass can be found as function of the previous parameters by: 


X =V(—cos W(cos cos gy — siny sing cos 9) — sinW sing sin 9) 
Y =V(—cos V(cosw sing +siny cos g cos 9) + sin cos @ sin¥) (3) 


Z =V(—cos¥ siny sind — sin ¥ cos 9) 


The equations (1) (2) and (3) can be integrated numerically once the initial condition are provided and the forces and 
torques are available at any time. 

It is worth to note that being these equations smoothed over a complete rotation of the boomerang around its axis, 
any high frequency fluctuation is not modeled and then discarded, so it is not possible to assess by these equations the 
stability of a candidate boomerang. According to this model indeed any candidate boomerang is considered stable a- 
priori, and thus this means that it cannot start “wobbling” and eventually losing its spin and falling down as some badly 
designed boomerangs do. The candidate boomerang stability will then be evaluated by some considerations involving 
its moments of inertia relative values. 

It has been observed in fact that the instability is likely to occur when the ratio between the maximum principal 


moment of inertia and the medium one is not high enough, therefore a fairly safe condition is the following”: 


I, — Ip 





s= > 0.1 (4) 
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IV. Boomerang Parameterization 
The boomerang geometry chosen for optimization will be the classical two arms “V” and “Q” shape type. Different 
parameters can be optimized once the maximum dimensions of the boomerang are fixed. 
The most important parameters that affect the boomerang behavior are linked to the blades profile, to the angle 
between the two arms and to the dihedral of the arms. 


A. Blade profiles 

Some typical radial blades profile are shown in Fig.9”. 

Changing the profile by playing with the angle of attack and cut on the top of the leading and trailing edge can 
change a lot the lift provided by the arm. 

The lift as described before affects the turn capability of the boomerang (increasing in general 01, ). 

The arc blades are in general designed with a positive angle of attack; these blades, as described above, help the 
boomerang plane to lay down (affecting 0.,) and help the boomerang to float in air. 

For the parametric boomerang geometry a flat bottom airfoil has been chosen. The blades profile are built by a 
Bezier!” parametric curve, with 4 control points. 


Figure 9: Possible profiles for radial blades. 


The profile shape is modified by two parameters: 
1. the vertical position of one of the Bezier control point 


2. the horizontal position of the same control point 
In this way it is possible to change the angle of attach and the thickness of the blades (see Fig.10). 





Figure 10: Effect on blade profile of horizontal (left) and vertical (right) position of second Bezier control point 


In order to reduce the number of parameters, the profiles of the leading and of the trailing arm are controlled by the 
same parameters. In particular the vertical position of the trailing am is set as a fraction of the vertical position of the 


leading arm (the trailing arm lift effect is preferred to be lower). 
By the way it is interesting to note, as shown in Fig.11, the position of the leading and trailing edge of the two arms. 


B. Dihedral angle 
Boomerangs arms usually have a positive dihedral angle that is about 10°-15°; the dihedral affects both the lift and 


the lay down velocity of the rotation plane, keeping practically unchanged the mass of the boomerang. 
Trailing edge 


TRAILING ARM 45 





Leading edge 


LEADING ARM 


Figure 11:Leading and trailing edges position in a two arms, right-handed boomerang. 


The boomerang parametric model is provided with the two parameters q@ and d that allow to change the dihedral by 


removing a small amount of material at the boomerang arms tips (Fig.12). The @ parameter is basically the stabilizers 
angle of attack. 
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Figure 12: AA sections showing the effect of the a@ and d parameters on the boomerang arm dihedral. From the 
top it is visible first the effect of the d modification and then the effect of the a. Note that when d is at its maximum 
value the maximum possible amount of material is removed from the tip, at a given a (like that shown in the sketch on 
top). 


C. Angle between arms 
This angle usually ranges between 70° and 140°. In fact this parameter has a great effect on the boomerang stability 
and the considered interval corresponds to the aforementioned criterion equation (4), (even if a good thrower might be 


72 ~ 0.015). 
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able to launch a boomerang with angle between arms close to 180° corresponding to 


D. Other parameters 
The other parameters that affect the boomerang behavior are some geometrical dimensions, such as the arms width 
and the arc blades width and length, together with some angular parameters controlling the boomerang base profile 
shape (to be more similar to a “V” shape or to a “Q” shape). The length of the arms is fixed to have a constant 
overall size of the boomerang. At the end for this first optimization activity 9 input (design) parameters have been kept. 
Some examples of the different boomerangs obtained by the parametric model are shown in Fig.13. 





Figure 13: Boomerang geometries obtained by the parametric model 


V. Boomerang trajectory computation and optimization 


In order to solve the ODEs (Ordinary Differential Equations) (1) (2) and (3), a proper numerical integration method, 
like a high order Runge-Kutta method, could be used. However, in order to proceed, it is necessary to know the 
aerodynamic forces and torques values with respect to the xyz reference system at any given time (the gravity force can 
be easily expressed in this reference system as seen before). 

The general aerodynamic forces depend on: 

e air properties such as density, viscosity, turbulence; 

e the shape of the boomerang; 

e the boomerang state of motion, namely depend on the parameter W, w, V, d; 

e on the previous history i.e. on the conditions occurring in the boomerang flight before the 
considered time. 

By a dimensional analysis it is pretty straightforward finding that the aerodynamic forces and torques can be 
expressed as: 


r - Vi Oy Wy 
F, = pa‘V*f(¥, 6, —,—,— Re, History, Shape) 
W,a Wz Wz 
7 (5) 
—, se, Re, History, Shape) 
Z Wz Wz 

Where a is a characteristic boomerang dimension (for example its maximum radius), p the air density and Re the 
Reynolds number. Since the fluid is considered as incompressible the Mach number does not appear in the previous 
expression. 

From now on the History is discarded by assuming a quasi-steady approach, namely the flow field is considered 
constant at any time before the current one. This is partially justified since we are interested in averaged (smoothed) 
forces Over an entire rotation period and not at the actual current time, so the forces at a given time step can be 
considered averaged in a period, and it is assumed that they are not significantly affected by the previous history details. 
ny 
Oz 
can be discarded (they can always be assumed equal to 0). At the end once the boomerang shape is fixed we obtain for 
aerodynamic forces and torques: 


T, = paeV7F(Y, dp, 





Expression (5) can be dramatically simplified also considering that the boomerang spins fast, so the terms @, er 


F, = pa*w2F(¥, U) (6) 
T, = pa®°w2T(¥,U) (7) 
with U = — (8) 
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With respect to (5) the factor V has been substituted with w,a that, from a non-dimensional point view, is 
equivalent, and moreover the Re number dependency has been discarded. Althoug this is not completely rigorous, the 
experimental evidence suggests that, within typical Re numbers range for boomerangs, the dependency on Re is quite 
negligible with respect to other parameters’. 

A candidate boomerang trajectory is finally obtained through the flowing steps: 


1) foracertain number of Y and U couples the corresponding values for F, and T q are computed by CFD 
simulations; 


ii) response surfaces for F(W, U)and T(Y, U)are built; 
111) equations(1) (2) and (3) are integrated starting from given initial conditions and using the response surfaces 
computed previously to express forces and torques at any position and time. 


The trajectory of the boomerang is affected of course by the initial conditions, namely by the way the boomerang 1s 
launched: in fact different boomerang types are to be thrown in a very different way to obtain a returning trajectory. 
So, in order to compare different boomerangs, for each candidate geometry design it is necessary to perform an internal 
optimization to find the proper launching parameters. Four launching parameters are considered: 
V: initial boomerang translational velocity; 
Spin: initial boomerang spin; 
Aim: angle between the initial boomerang translational velocity and the horizontal; 
Tilt: angle between the initial boomerang rotational axes and the vertical (0° tilt corresponds to a vertical 
boomerang plane of rotation). 

The purpose of this internal optimization is to find the 4 parameters set that allow a returning trajectory within one 
meter from the launcher position. 


The reader may ask why a “direct approach’, 1.e. a CFD-only strategy, was not considered to find the boomerang trajectory. In fact 
modern CFD numerical codes nowadays would potentially allow to simulate completely a boomerang flight taking into account 
directly its interaction with the air. This could be done solving numerically the classical Navier Stokes equations coupled with a 
general six degree of freedom rigid body solver that computes the position of the rigid body at the following time step once forces 
and torques exerted by the fluid at the previous time step are given. This approach would overcome the limitations previously seen, 
since the smoothed approach is avoided and the history is automatically taken into account in the aerodynamic forces and torques 
computation. However this cannot be easily done for many reasons. First the interaction of the boomerang motion with the relative 
turbulent air motion is not easy to be modeled and a not smoothed approach would be not trivial, since probably the standard 
Reynolds Averaged Navier Stokes (RANS) approach is not ideally suited for this task, and a much more expensive Large Eddy 
Simulation (LES) approach would be a better choice (it has to be noted that the time interval to be simulated should be equal to the 
boomerang flight time that is 10 seconds, or even much more, as an order of magnitude, so the time taken by each single simulation 
can become unaffordable in terms of computational effort). It is also to be considered that the boomerang should be actually moved 
into the computational domain to find its trajectory, and although this can be avoided using proper moving reference systems (with 
motion varying at each time-step), this is not straightforward, and moreover the mesh may need to be dynamically adjusted to allow 
to catch properly the flow characteristic. Last but not least the trajectory computation would be not decoupled by the CFD simulation 
so it would not be feasible computing quickly different trajectories varying the initial conditions to optimize the launch parameters. 
For these reasons the “direct approach” was considered not feasible and discarded. The previously described smoothed approach 
allows instead to use classical CFD RANS solvers without excessive difficulties as described in the next paragraph. 


VI. Aerodynamic forces computation details by CFD 


First of all let us report the classical Navier Stokes equations (considering constant fluid density) written with 
respect to a moving reference system: 


V-v, =0 (9) 
Ov > 2 are > > 1 = rR 
a ee Na ea ee 


Where @ is the angular velocity of the moving reference system; V; is the velocity of the moving reference system 
origin; 7 is the position vector with respect to the moving reference system; U, = DV, + @ X Fr is the velocity of a generic 
point that belongs to the moving reference system; v, = V — U, the fluid velocity in the reference system. 

The CFD software employed is Star-CCM+ from CD-Adapco. 

Two different approaches are possible. 

The first consists in using a static grid, taking a reference system fixed with respect to the boomerang. This 
approach would have the advantage that a steady state analysis can be run, since the velocity field with respect to this 
reference system is almost steady and so a time accurate solution is not requested. The drawback of this approach is that 
the position of the wake is “spread” in the domain and so it is not possible to refine the grid just in the wake region. 

The second approach consists in using two reference system one external and inertial, the other fixed with respect to 
the boomerang and having its origin placed in the boomerang center of mass. Also two domains and two grids are used: 
one spherical having its origin placed in the boomerang center of mass and associated to the boomerang reference 
system, and the other around that domain with an external parallelepiped shape associated to the external reference 
system. 

Since we are interested in evaluating aerodynamic forces and torques at different U (.e. at different translational 
boomerang velocity and rotational speed, see (8)) and at different Y (namely at different angles of incidence of the 
boomerang), the translational boomerang velocity is always set to zero while the external air inlet velocity is set equal 
to the boomerang translational velocity (fixed for a given U and for an assigned boomerang rotation velocity) with the 
correspondent angle of incidence. The internal spherical domain is provided with a rotation velocity around an axis 
normal to the boomerang plane and passing through the boomerang center of mass. The information between the two 
domains is exchanged by an interface boundary that allows to interpolate the field values. 

This second approach, unlike the previous one, has the advantage that the wake is almost always placed in the same 
location of the computational domains, but the drawback is that the flow is unsteady so to obtain forces and torques it is 
necessary to simulate a proper interval of time and time-average the results. Another additional advantage is that by 
choosing the described external reference system it is immediate to compute forces and torques with respect to the xyz 
reference, as requested. This second approach was then chosen. 

The mesh is built in Star-CCM+, using a polyhedral mesh within the sphere around the boomerang, with prisms 
layers at the boomerang walls, and an hexahedral mesh in the rest of the domain (see Fig.14). The two-equations RANS 
SST (Shear Stress Transport) turbulent model, with wall functions, is chosen and a segregated solver with constant 
density is employed. 

As preliminary study different simulations have been carried out at different U and ¥ to analyze forces and torques 
on a first baseline boomerang geometry, and to assess the minimum resolution of the computational grids. To change U 
a constant value for the boomerang spin equal to 6.67 Hz has been selected (a pretty fair mean value for the spin); the U 
value has been varied between O and 2, while Y between -5° and 45°, these ranges contain the typical values assumed 
during a correct boomerang flight'”: values outside are in fact either not realistic or may cause the boomerang to stall, 
wobbling and fall down. From this preliminary study it has been observed that at high U values forces and torques 
acting on the boomerang assume a good periodic behavior within 2 complete boomerang rotations while at low U 
values about 5-6 rotations are necessary. The simulated time period will be then chosen according to the U value 
considered. 

A mesh size of about 2.5 millions of cells, depicted in Fig.14, has been employed being a good tradeoff between 
accuracy and computational efforts. 

Fig.16 shows the pressure on a boomerang surface during a rotation. It is possible to see how the pressure force on 
each arm changes a lot during the rotation according to the relative position of the blades with respect to the 
translational velocity. 
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Figure 14: particulars of the mesh section at the boomerang mean plane. 
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Figure 15: lift force on the boomerang vs time (the periodic behavior is achieved in few rotations). 
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Figure 16: pressure field on the boomerang and on a plane at the boomerang mean section, during a rotation. 


At the end of the numeric simulation (involving a Y, U couple) the averaged forces and torques acting during the 
last simulated rotation are computed and then the corresponding F, and Tare available. 


VII. Complete process flow automation in modeFRONTIER 


The whole process needed to compute and optimize the boomerang, by changing the CAD geometrical parameters 
until the needed objectives are satisfied, has been completely automatized through the software modeFRONTIER”' 
widely used in industrial engineering design. 

Traditional design approach (‘trial and error’) usually requires many attempts of the designers, which every time 
need to modify their numerical models by hand and run several solvers, especially when it is difficult to know a priori 
in which direction of the multi-dimensional variables space to move in order to find the best solutions. Conversely, the 
multi-objective design environment modeFRONTIER allows to integrate different computational software (any 
commercial or in-house code) into a common design environment, thus allowing the automatic run of a series of 
computations proposed by a selected optimization algorithm, until the specified objectives are satisfied. 

In this modular environment, each component of the optimization process, including input variables, input files, 
scripts or direct interfaces to run any software, output files, output variables and objectives, is defined as a node to be 
connected to the other components. 

In this way, the complete logic flow, from parameterization to performance evaluation, is defined by the user, that 
can select among several available optimization algorithms, accordingly to the objectives defined, including Genetic 
Algorithms, Evolutionary Algorithms, Game Strategies*, Gradient-based Methodologies, Response Surfaces” (to speed 
up the convergence of optimization, approximating the response of the system by the use of the available Meta- 
Models), and Robust Design Optimization (optimization under uncertainties on input parameters). Statistical and 
Multi-Variate Analysis tools can be applied to find important information about the influence of the parameters in the 
system, in order to face up problems characterized by a large number of parameters and large computational efforts with 
the highest efficiency. 

modeFRONTIER is able to make automatic the boomerang trajectory computation through the following steps: 


1) modify the boomerang Catia model parameters, for each configuration design proposed by a selected 
optimization algorithm available in modeFRONTIER; 

11) obtain the updated geometry (stl file) from Catia for each proposed configuration and pass it to Star- 
CCM+ execution module; from Catia also boomerang mass, center of mass position, and moments of 
inertia are obtained; 

11) incase condition (4) is not fulfilled discard the design without performing any CFD simulation; 

iv) launch Star-CCM+ to build the computational mesh around the boomerang; 

v) launch different Star-CCM+ simulations using the same mesh prepared as above varying U and Y 
parameters for an appropriate number of samples (¥ is changed by rotating the spherical mesh around 
the boomerang center of mass accordingly); for each U and VY couple the corresponding forces and 


torques are obtained and can be averaged over a proper time interval to obtain finally Fand Ts 
vi) use the set of simulations computed in v) as training set to build response surfaces to obtain F(¥, U)and 


T(Y, U) over the whole range of variation of the two parameters ,U ; dedicated response surfaces 
algorithms —metamodels - are available in modeFRONTIER ’; 

vii) pass the response surfaces and the boomerang data (mass and moments of inertia) to a MATLAB script 
able to compute the trajectory by integrating (1) (2) and (3) using a 4th order Runge-Kutta method (see 
Fig.20 for some complete trajectories examples); 

Vili) run an internal optimization for the given configuration considering as input variables the previously 
described 4 launching parameters and as objective the arrival distance (to be less than | m); 

ix) the main multi-objective algorithm assesses how good the trajectory is with respect to one (or more) 
specified objectives (for this paper we just considered as objective the total energy needed for the 
launch to be minimized, while the global range is considered as a constraint, in order to avoid 
boomerangs giving a too small range) and propose a new boomerang configuration by changing 
accordingly the Catia model parameters; 

X) the steps i)-viil) are repeated automatically by the algorithm until one or more optimal configurations 
are obtained. 


The modeFRONTIER workflow is shown in Fig.17. Each module of the workflow is linked to the others in order to 
realize the automatic computational chain described above. 

In particular, on the top we find the nodes (green subsystem) that define the range of variations of all the parameters, 
then the process flow (black line) starts with the interfaces to select the optimization algorithms and set their 
parameters, to continue with CATIA direct interface that allows to automatically update the geometric model at the 
variation of the parameters, obtaining as results all the mass properties (cyan nodes, including inertia momentum to 
compute the instability limit of eq.4), and transferring the updated (Stl model to the following script node which is used 
to run Star-CCM+ to create the mesh for each proposed geometry. The mesh (.sim file) is then transferred to the 
following application node, which basically launches in batch mode another modeFRONTIER project file, which runs a 
set of CFD computations through Star-CCM+ on the same mesh varying U and W parameters, as described at point v) 
above. 
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Figure 17: modeFRONTIER main workflow 


The output from the internal modeFRONTIER project is a Response Surface (RSM) or Meta-model, based on the 


available training set, which is able to extrapolate and obtain F (PY, U)and T(Y, U) over the whole range of variation of 
the two parameters. The algorithm used for the automatic RSM training is Kriging’, and the model is automatically 
exported as a C script, which can be transformed by the following application of the process flow into a .mex file which 
is the format for Response Surfaces that can be read automatically by MATLAB. 





Figure 18: Response surfaces of boomerang aerodynamic coefficients (x and z direction forces vs angle and U) 


The last application node in the process flow of Fig.17 is therefore another modeFRONTIER Batch node, called 
“launch parameters tuning” in the picture. This node actually runs another modeFRONTIER project (see fig.19) in 
batch mode, for which the input variables are the 4 launch parameters, the boomerang shape is fixed and the objective is 
defined by the minimization of the distance from the arrival position and the launching position. For this purpose, a 
simple mono-objective algorithm is used (SIMPLEX), and the project just runs for each set of launching parameters a 
MATLAB script through the MATLAB direct interface, that basically drives a .m script which implements the Runge- 
Kutta integration for the boomerang trajectory computations (Fig.20), obtaining the F(Y, U) and T(Y, U) values 
needed to solve the equations directly from the Response surface (.mex file) available for each boomerang 
configuration. 
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Figure 19: modeFRONTIER internal optimizer (launch_parameters_tuning workflow) 


The final outcome of the modeFRONTIER Batch node in the main process flow of Fig.17 are therefore the nodes 
corresponding to the responses of each boomerang design, i.e. the trajectory performances to be optimized in the 
external loop. In this case, we extract the following responses: 


e Range: this is the maximum distance reached by the boomerang during its flight; it 
has just been considered as a constraint in the optimization, in order to avoid boomerang 
configurations having a too small range; it can be additionally considered is a criteria to 
select the final configurations among the candidate solutions (for instance, it might be 
favorable to choose an higher range configuration); 

e Accuracy: this is the difference between the position from which the boomerang 
is launched and the position where the boomerang returns (this quantity is optimized by 
the internal loop for each boomerang candidate solution) 

e Energy: this is the energy (translational plus rotational) necessary to launch the 
boomerang, that is a quantity to be minimized (to avoid excessive efforts for the launcher). 
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Figure 20: Some boomerang trajectories found by the smoothed motion equations integration. 


VIII. Optimization Strategy (Hierarchical Games) and Optimization Results 


Several tests were performed to find the proper number of simulations requested to create accurate enough response 
surfaces. It has been found that a matrix of 12 points guarantees an error of approximation less of 5%, and this was the 
size of the training set finally selected. 

This means that each boomerang trajectory computation needs 12 CFD simulations. For this reason a classical 
multi-objective optimization algorithm that may require hundreds of designs evaluations is not feasible, therefore a 
different strategy, based on Game Theory’, has been selected. 

As indicated in the previous chapter, two different objectives (returning accuracy and launch global energy) are to 
be considered, but actually any candidate solution is first optimized by the internal workflow (Fig.19) in order to tune 
the launching parameters, (this optimization is pretty fast because only a MATLAB script that takes in total few seconds 
is run), then the found optimal solution is evaluated by the external optimization workflow (Fig.17) that handles the 
energy objective minimization by changing in the proper way the geometrical parameters to propose a new candidate 
solution in the next iteration step; note that for both the internal and external optimizer the same modeFRONTIER 
algorithm, Simplex, has been used due to its efficiency to solve single-objective problems. 

This type of optimization strategy is also called Hierarchical Games or Stackelberg Games"’, and is actually based 
on the definition of one player as the /eader (in our case the external optimization workflow) that would optimize his 
part of parameters (here geometrical parameters) accordingly to his objective (here energy minimization), and of 
another player as follower (in this case the internal optimization workflow) that would optimize his objective (returning 
accuracy) playing his parameters (here launching parameters) for any candidate solution proposed by the leader. The 
convergence of the Game is a compromise equilibrium between the two objectives, indeed called Stackelberg 
Equilibrium. 

This strategy is particular efficient in this case because it allows to apply a fast and efficient single-objective 
optimization algorithm in the main workflow, for which the simulation time are very high since many CFD evaluation 
are needed for each candidate solution, whereas the second objective is optimized for each solution by the internal 
workflow in a very short time, since the script to tune the launching parameters is very fast to execute, definitively 
avoiding to apply a slow-converging multi-objective algorithm in the main and CPU-expensive optimization workflow. 


In addition, to further reduce the overall computational time and better exploit the available computational 
resources, the Grid functionality'” of modeFRONTIER is applied: the latter allows to distribute concurrent executions 
of different candidate designs on different machines available in the local network, reducing dramatically the global 
time to complete the optimization. In this way, if appropriate licenses of the compute applications are also available, 
the optimization becomes a feasible task that can be completed within few days of computations. 


Once the optimization strategy has been set up in modeFRONTIER, the automatic optimization process has been 
carried on. Figure 21 reports the global results of the optimization process, in the space of the objectives and constrains 
considered. In particular, for each design, the abscissa reports the launch energy, the ordinate indicates the range, and 
the color of the bubbles reports the returning accuracy for each design. 
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Figure 21: Optimization results. Launching energy is expressed in Joule, range and returning accuracy is expressed 
in meters. 


At the end of the process, one of the optimal boomerang configuration has been selected, and its geometry and 
performances are reproduced in Fig.22. The energy required to launch the boomerang is 3.5 J, the percentage of 
rotational energy is only 7% that corresponds to an initial spin of about 4 Hz and an initial translational velocity equal 
to 15 m/s; the tilt angle is 0°, while the aim is about 20°. This set should make the boomerang launch pretty easy. The 
range is 14.5 m. 

Note that the selection of optimal design has been made considering the criterion based on obtaining an easy lunch, 
but the same methodology described in this paper may be adapted for any other requirement or objective. 
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Figure 22: Optimal boomerang configuration and corresponding drawn trajectory 


IX. Conclusions 


This paper has described an automatic and efficient methodology for the multi-objective optimization of a 
boomerang shape, as an interesting benchmark and proof of concept to test the multi-objective and multi-disciplinary 
capabilities of the optimization environment modeFRONTIER. 

The process flow can be easily automatized in modeFRONTIER, integrating the different parts in a single 
workflow: the CAD parameterization of the boomerang design, the CFD simulations though Star-CCM+ to create a 
training database, the creation of Response Surfaces to extrapolate the aerodynamic response in function of any 
combination of flight parameters, and the MATLAB script to evaluate the boomerang trajectory on the basis of 
aerodynamic and mass properties. 

A fast and efficient strategy based on Hierarchical Games in modeFRONTIER has been applied, so that it has been 
possible to find the optimal boomerang configuration (in terms of minimum launching energy and accuracy of the 
return) by a reduced overall number of different configurations. 
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lien bo roactom apeed can pe neeomp ished in abot § tc cr lee. However, 
if the wlimece speed. 6 not obtained for ane Teaser we accelerulh..u Decimals 
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Abstract 


We describe a new approach for the purely Eulerian simulation of 
incompressible fluids. In it, the fluid state is represented by a C?- 
valued wave function evolving under the Schrodinger equation 
subject to incompressibility constraints. The underlying dynami- 
cal system is Hamiltonian and governed by the kinetic energy of 
the fluid together with an energy of Landau-Lifshitz type. The 
latter ensures that dynamics due to thin vortical structures, all 
important for visual simulation, are faithfully reproduced. This 
enables robust simulation of intricate phenomena such as vor- 
tical wakes and interacting vortex filaments, even on modestly 
sized grids. Our implementation uses a simple splitting method 
for time integration, employing the FFT for Schrodinger evolu- 
tion as well as constraint projection. Using a standard penalty 
method we also allow arbitrary obstacles. The resulting algorithm 
is simple, unconditionally stable, and efficient. In particular it 
does not require any Lagrangian techniques for advection or to 
counteract the loss of vorticity. We demonstrate its use in a va- 
riety of scenarios, compare it with experiments, and evaluate it 
against benchmark tests. A full implementation is included in the 
ancillary materials. 


Keywords: _ discrete differential geometry, fluid simulation, 
Schrodinger operator 


Concepts: *Mathematics of computing — Partial differential 
equations; *Computing methodologies — Physical simula- 
tion; -Applied computing — Physics; 


1 Introduction 


We introduce incompressible Schrodinger flow (ISF), a new method 
to simulate incompressible fluids (Fig. 1, middle). Instead of 
describing the fluid evolution in terms of the velocity or vor- 
ticity field, ISF evolves a two-component wave function zw = 
(w1,W.)™: M — C?, which encodes the fluid state on a 3D domain 
M. The classical fluid density p and fluid velocity v = (1, v2, v3)™ 
are extracted from y) as 


OV, a 5 ie) A=1,2;3 
OX y 


p=|lyP=(~y)x and 


where ($,%)z = Re((¢,H)c) = Re($i1 + G22). The time 


evolution of these wave functions is governed by the Schrodinger 
equation 
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Figure 1: Comparing experiment (dry ice vapor, top) with ISF simu- 
lation (middle), followed by a visualization of the underlying wave 
function y. Vorticity is concentrated within the green region. 


subject to the constraints 
(Ay,ip),=0 and = |p| =1, (2) 


which correspond to div(v) = 0 and p = 1 in the classical vari- 
ables (Sec. 4.1). The scalar potential p: M — R in Eq. (1) is the 
Lagrange multiplier for the divergence constraint (App. A), and 
we will refer to it as pressure in analogy to the Euler equation. The 
reduced Planck constant f of quantum Physics becomes the only 
parameter for our fluid and controls the quantization of vorticity. 
For a large range of initial conditions ISF tends to concentrate 
vorticity in filaments of strength 277f (Fig. 1, bottom). 


We call Eqs. (1) and (2) the incompressible Schrodinger equations 
and the corresponding flow the incompressible Schrodinger flow. 
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ISF is computationally very attractive. Using a splitting method 
for time integration, each step requires only linear Schrodinger 
evolution, followed by normalization and pressure projection, 
i.e., solution of a Poisson problem (Alg. 1). All advection hap- 
pens within the Schrodinger evolution, no separate advection 
step is needed. The resulting algorithm is unconditionally stable, 
straightforward to implement, and efficient. Obstacles are also 
easily incorporated (Alg. 4). 


ISF is a Hamiltonian flow with respect to the energy 


Hise(w) = *|ldp |? = 4 lvl? + SHO) (3) 


(Thm. 2, Sec. 4.4) with ||.|| denoting the L? norm. The kinetic 
energy alone (first summand) would give the Euler equations 
for ideal, i.e., inviscid and incompressible, fluids. In ISF these 
equations are modified by the presence of the term H, (Secs. 4.2- 
4.5). For flows dominated by vortex filaments it can be interpreted 
geometrically as approximating the 47 multiple of the length of 
all vortex filaments in the fluid (Sec. 4.5). 


This modification has significant consequences for incompress- 
ible fluid simulation on grids. Standard methods using a direct 
velocity or vorticity representation typically struggle with main- 
taining coherent vortical structures and their dynamics over time 
(Sec. 1.2). These structures are usually concentrated along curves 
and often persist over significant periods of time, while their dy- 
namics are critical for the visual appearance of flows (Fig. 1, cmp. 
top and middle). ISF captures their energy in H,, preserving vor- 
tical structures and their dynamics over a wide range of scales on 
modest resolution grids. 


Note ‘To simplify the derivations we will use Exterior Calcu- 
lus [Abraham et al. 2001, Ch. 6] in the remainder of the paper, 
working with the velocity 1-form 7 = v’ = hd, iy)» instead of 
v and correspondingly «d*7 = 0 instead of div(v) = 0. 


Notation Meaning 

p= (W1,Y2)": M > C? wave function 

y= a time derivative 

dy differential of x 
(b,p\c= 114+ 625 Hermitian form on C? 
(,W)z = Re((¢,W)c) Euclidean product on C? 
Wl? = (We W)e pointwise squared norm 
(u, v) Euclidean product on R® 
27h vortex filament strength 
n=v =AhAldwyp, iw), velocity 1-form 

w=dyn vorticity 2-form 

§ = «dx divergence 
Lww=(v-V)w Lie derivative of w 


Ln = ((v -V)v+ aly)’ Lie derivative of 7 


Table 1: Notations used. 


1.1. Physics Foundations 


In the context of fluids the Schrodinger equation typically only 
appears in the study of fluids at very low temperatures and very 
small scales, so called superfluids. Despite this, superfluid dy- 
namics are of great interest for computer graphics applications 
because of the remarkable similarities between vortex dynamics 
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in ordinary and superfluids, both at the experimental and theo- 
retical levels [Schwarz 1985] (see also the more recent [Stagg 
et al. 2014] and references therein). 


In most situations of practical import the dynamics of ordinary 
fluids are dominated by thin vortex filaments [Saffman 1992]. 
Because of their thinness these are inherently difficult to resolve 
at feasible grid resolutions with traditional representations (ibid, 
p. 201). In superfluids, using wave functions to describe the state 
of the system, such filaments are a topological feature and thus far 
more persistent and computationally resolvable even at relatively 
modest resolutions. 


Early on in the history of the study of superfluids it was recognized 
that they carry quantized vorticity in atomic scale filaments [On- 
sager 1949; Feynman 1955]. These were later experimentally 
verified [Hall and Vinen 1956] and even photographed [Packard 
and Sanders 1969]. In our approach the strength of the filaments 
is 277A, which we can take as a quantization parameter of our 
simulations, setting the strength of vortex filaments and with it 
the level of detail present in the flow. 


A mathematical model for the observed physics of superfluids 
was developed by Gross [1961] and Pitaevskii [1961] (and ear- 
lier Ginzburg and Pitaevskii [1958]). This model is now known 
as the Gross-Pitaevskii (GP) equation, or simply the non-linear 
Schrodinger (NLS) equation 


ing =—E Ay +4 (lyP-1)y, 


for ¢: M — C and a parameter a > 0 which for us corresponds 
to the core radius of vortex filaments. 


The non-linear (cubic) term acts as a potential opposing the de- 
viation of the density p = |y|? from 1. Indeed, in simulations 
using the GP equation the density is near 1 in most of the domain, 
save for the zero set of y in whose vicinity the density smoothly 
decreases to zero [Stagg et al. 2014]. Taking the limit a — 0 one 
expects the non-linear term to converge to the incompressible 
limit and hence the GP equation to recover the Euler equations 
for 7. In 2D this has been rigorously established [Lin and Xin 
1999; Jerrard and Spirn 2015]. 


As a practical matter, choosing a small a in the cubic term leads 
to very stiff numerical problems. Since we are only interested in 
the incompressible setting, we can replace the cubic non-linearity 
with the incompressibility constraints. For single component wave 
functions the uniform density constraint |p|? = 1 yields singular 
y and allows for irrotational velocity fields only. Using instead 
a two-component wave function y: M — C? leaves y smooth 
under the constraint |x|? = 1. Additionally it allows for smoothly 
varying vorticity [Schonberg 1954; Sorokin 2001 ]. 


The interpretation of the Schrodinger equation in terms of flu- 
ids was first pursued by Madelung [1926; 1927] in an effort to 
elucidate the then new quantum mechanics. He showed that 
the Schrodinger equation for single component wave functions is 
equivalent to the quantum Euler equations. What became known 
as the Madelung transform was later applied to C"-valued wave 
functions by Schoenberg [1954]. In particular he introduced 
the form of 7 we use. Sorokin [2001] gave the C? version of 
the Madelung transform including explicit expressions for the 
non-linear potentials which distinguish it from the classical Euler 
fluid. 


Two threads from Physics have influenced our work. On one hand 
the hydrodynamical interpretation of Quantum Physics and on 
the other the GP equation for the modeling of superfluids. Since 
we are interested in incompressibility we replace the numerically 


stiff cubic term in the GP equation with pressure projection. At the 
same time our fluid inherits the robust representation of vortex 
filaments from superfluids. 


1.2 Vorticity in Computer Graphics 


The importance of vorticity for visual simulation has long been 
recognized in computer graphics, as has the difficulty to capture 
its dynamics correctly with numerical methods. 


The success of Jos Stam’s “Stable Fluids” [1999] method, using 
semi-Lagrangian advection and pressure projection on a regular 
grid, quickly led to work addressing its excessive numerical diffu- 
sion [Fedkiw et al. 2001] using vorticity confinement [ Steinhoff 
and Underhill 1994]. Lost detail has also been compensated by 
wavelet turbulence [Kim et al. 2008] or curl-noise [Bridson et al. 
2007]. Unfortunately, these techniques are difficult to control 
and easily lead to objectionable visual artifacts. 


Alternatively one can represent vorticity through Lagrangian vor- 
tex particles, a technique from the CFD community [Rosenhead 
1931; Leonard 1980; Cottet and Koumoutsakos 2000] to avoid 
many of these issues [Park and Kim 2005]. Since vorticity arises 
at boundary layers as sheets and then quickly rolls up into fila- 
ments, purely Lagrangian methods based on filaments [Angelidis 
and Neyret 2005; Weifsmann and Pinkall 2010] and sheets [ Stock 
et al. 2008; Brochu et al. 2012] have also been developed. 


Using vorticity as a primary variable also improves mesh based 
Eulerian simulations [Elcott et al. 2007], motivating | Zhang et al. 
2015] to modify existing velocity based Eulerian solvers to per- 
form as if using vorticity as their primary variable. 


Many recent approaches are of a hybrid nature, integrating La- 
grangian elements into grid based approaches [Selle et al. 2005; 
Kim et al. 2009; Pfaff et al. 2012] or enhancing purely Lagrangian 
methods with grids [Koumoutsakos et al. 2008]. 


Overall we see that proper resolution of vorticity is essential to 
the visual appearance of flows and simultaneously difficult to 
achieve. Grid based approaches battle loss of vorticity with vari- 
ous devices, while purely Lagrangian approaches have their own 
host of problems ranging from inadequate control of sample den- 
sity (particles), to complex reconnection handling (filaments), 
and the need for sophisticated multipole solvers, etc.. 


Our new method is grid based and purely Eulerian. Still we are 
able to simulate vortex driven dynamics, with grids of modest 
sizes, at a quality level comparable to purely Lagrangian methods. 


2 The Algorithm 


All our simulations are performed on a 3D lattice with vertex 
set V = {0,...,NV, — 1} @{0,.. SRV, — 1} x {0,...,N,—1}. For 
a periodic domain, indices are taken modulo their respective 
dimension. Vertices, v € V, need to store samples of the wave 
function y, € C*, the real-valued pressure q, € R, and the real- 
valued divergence ¢, € R. The discrete velocity 1-form is defined 
on directed edges vw € € 


Nw = Rarg(y,, Wwe (4) 


with 7, =—Nyw (App. D) and stored in staggered grid fashion at 
the vertices (Fig. 2). 


The discrete divergence, € = xd x1, is the usual signed sum over 
incident edges, weighted by the quotient of dual facet area A,,, to 
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Figure 2: In all, 9 reals are stored at vertices: y,, €,, q,, and the 
three circulations (1,%2,13)' associated with the edges emanat- 
ing in the positive coordinate directions. The divergence €,, is the 


F A . 
normalized sum of the face fluxes Tew O71 the enclosing cube. 


edge length l,,, and normalized by dual cell volume V, (Fig. 2) 


E=(«den=— > Iw (5) 


VWEE 


following standard Discrete Exterior Calculus (DEC) conven- 
tions [Desbrun et al. 2008; Crane et al. 2013 ]. 


Overall time integration uses operator splitting, performing in- 
tegration of the Laplace term, normalization, and pressure pro- 
jection in order. Later we include obstacles and buoyancy resp. 
gravity forces. 


Time integration requires an initial x), a time step size dt > 0, 
and a quantization strength A > 0. Appropriate values will be 
illustrated with the aid of example simulations (Sec. 3 and Tbl. 2). 


Algorithm 1 Basic ISF 


Input: wy, dt, hi 
1: for j — 0,1,2,...do 


> Initial state and parameters 


2 p™P <— SCHRODINGER(W”), dt, Hi) 

3. yim <— yimP /|apt™P| > Normalization 
4: pot) — PRESSUREPROJECT(w™? ) 

5: end for 


Schrédinger integration diagonalizes in the Fourier domain, 
leading us to use the FFT (on periodic domains). For walls 
OM # @ Neumann boundary conditions are achieved with the 
discrete cosine transform (DCT). Here A, are the eigenvalues of 
the continuous 3D Laplace operator (Eq. (18), App. E). 


Algorithm 2 Time integration of Schrodinger equation 


1: function SCHRODINGER(\, dt, fA) 
2: w <— FFT3D(w) 
z eee ee 
3: yp es eirdt 5 yp 
4: return INVFFT3D(1) 
5: end function 


Pressure projection Eq. (10) uses the scaled (hk) discrete di- 
vergence as the right hand side of a Poisson problem. We use 
an FFT (or DCT) to invert the Poisson problem (Sec. 4.1) using 
eigenvalues A, of the discrete Laplacian for discretely divergence 
free velocity fields (Eq. (17), App. E). 
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Algorithm 3 Divergence free constraint 


1: function PRESSUREPROJECT(w) 


2 for each vw € € do > Scaled velocity 1-form at edges 
3: Aww = are(wy,Wwic > A ' multiple of Eq. (4) 
A: end for 
5: for each v € Y do > Scaled divergence at vertices 
6: Ev = Dames 1 thw > Eq. (5) 
7: end for 
g:  & —FFT3D(é) 
» pla AZO 
9: ee] ee 
0 
10:  q < INVFFT3D(é) 
TA; return e ‘44 


12: end function 


This completes the description of the basic algorithm. While we 
make extensive use of the FFT (or DCT) our method is not tied 
to the Fourier domain. For example, the Schrodinger integra- 
tion could use [Al-Mohy and Higham 2011] while the pressure 
projection step might employ [McAdams et al. 2010]. 


Fig. resolution size[m?] dt[s] fi[m*s '] Video 
1 128° 53 1/4g 0.03 04:48 
4 128x64x64 10x5x5 ‘3/4 O.1 00:05 
5 64° 2° og 0.01 01:17 
6 128° 4° 1/43 0.02 01:42 
7 128x64x64 4x2x2 ‘1/48 0.02-0.04 02:18 
8 192x64x64 6x2x2 ‘1/48 0.015 03:41 
9 192x64x64 6x2x2 1/48 0.015 03:56 

10 96x192x96 3x6x3 1/48 0.02 04:09 
13 128° 4° I/4g 0.02 02:00 
14 64° resp. 128° 53 1/24 0.05 00:39 
15 512x12x208 20x1/2x8 1/48 0.03 02:48 


Table 2: Parameters for simulations. 


3 Using the Algorithm 


With the basic algorithm in place, we now discuss its use, be- 
ginning with simple benchmark simulations and building up a 
set of straightforward tools capable of describing a large set of 
interesting simulation scenarios. All examples used our native 
implementation in Houdini 15 (Fig. 3). Complete source code is 
included. Comparisons with other methods used Houdini as well. 
Performance is controlled by the cost of the FFT. A single step of 
Alg. 1 takes less than 1s at 128° resolution and less than 9s at 
256° on a 3.5GHz i7 iMac. 


Note on initializations ISF is invariant under unitary transfor- 
mations of C*. We exploit this by initializing y), with the desired 
initial state, using yz, = € (we use € = 0.01) merely to guard 
against zeros in w, during normalization. After normalization 
simple quantitative statements such as “the norm of wy, indicates 
the presence of a vortex core” make sense. 


3.1 Vortex Filaments 


In ISF vorticity has the tendency to concentrate in one-dimen- 
sional filaments, as it does in actual fluid dynamics. The ability to 
create and handle filaments gracefully, inherited from its origins 
in the theory of superfluids, is a principal strength of ISE To 
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Figure 3: All algorithm components were implemented entirely 
within Houdini 15. 


elucidate this, we look at the way individual filaments evolve 
and interact in our algorithm. To do so we need a wy which 
represents one or more vortex filaments. Since w for multiple 
filaments are just the componentwise product of single filament 
w functions, we begin by describing a simple method for the 
construction of 7 for a single filament curve y. 


Suppose y is the boundary of 
an embedded topological disk, ae a 

yY = OX. We first construct a ia Ms, 
complex function y whichhasy _ fa a \ 
as its zero set. Consider the vol- *. | 
ume created by a positive and op } 
negative offset of the oriented ; 3h 
surface & along its normal di- 


rection for distance r > 0 (see inset). In this “slab” of thickness 
2r with % as its middle surface, set 





9=n(1+$) 


for d the signed distance function of &. Letting 9 = O outside 
the slab, we set y = e'’. Pointwise normalizing and pressure 
projecting (y,¢)" then gives us the desired y), encoding the 
Biot-Savart velocity field of the curve y. 


A classic example of interesting filament dynamics are the 
leapfrogging vortex rings [Lim 1997, Video]. Two closely 
spaced circular vortex filaments will alternately leapfrog one 
another. This phenomenon is typically very hard to repro- 
duce in standard fluid solvers but runs without difficulty in our 
method. Fig. 4 shows a comparison between a state of the 
art 5 order HJWENO [Osher and Fedkiw 2003, Ch. 3.4] ve- 
locity advection method with 2™ order MacCormack time step- 
ping [Selle et al. 2008], as implemented in Houdini, and our 
ISE HJWENO/MacCormack is never able to complete even the 
first leapfrog cycle, quickly yielding only a merged, single vor- 
tex ring while our method goes through the correct cycle and 
is still proceeding without any “damage” after 2000 time steps 
(approx. four cycles). See also Sec. 4.5 for a discussion of the 
energy behavior during this simulation. 


This method also works for far more 
complex filaments since there always 
exists a Seifert surface, i.e., an em- 
bedded, oriented surface %, which is 
bounded by the closed curve y [Seifert 
1935]. Software to construct it is read- 
ily available (SeifertView). The inset 
shows the Seifert surface for the trefoil knot. Producing wp as 
above with this % we can simulate the evolution of the trefoil knot 








Figure 4: Leapfrogging vortex rings using HJWENO /MacCormack 
(top) and ISF (bottom). Left to right: iteration 45, 360, 2000 (cmp. 
to [Lim 1997, Video ]). 


(Fig. 5) and correctly replicate the reconnection event which oc- 
curs when the initial filament crosses itself. This produces two 
separate filaments with the smaller one moving off and matches 
experiments [Kleckner and Irvine 2013, Video]. 





Cx/ 


Figure 5: Evolution of the trefoil knot with ISF showing frames 50, 
100, 210 (cmp. to [Kleckner and Irvine 2013, Video ]). 


Taken to an extreme we can produce initial configurations of vor- 
tex filaments which optimally approximate arbitrary initial veloc- 
ity fields using the method of Weifsmann and co-workers [2014]. 
Given a target velocity 1-form 7) a single component wave func- 
tion y is found as the ground state of the magnetic Schrodinger 
operator 
yp =argmin ||dp —ij¢ll’, 
I|pl|?=1 

which amounts to finding the principal eigenvector of a SPD ma- 
trix. Pointwise normalizing, (y/|y]|,€)", yields the desired yb 
after pressure projection. 


3.2 Velocity Constraints 


Being able to prescribe a constant velocity in a particular region 
is a basic tool for the construction of initial conditions as well as 
while a simulation is running. In the context of our wave function 
w this amounts to enforcing a plane wave in a particular region. 
Given a wave vector k € R°, a plane wave is given by the function 
pktn = ellk.x—itkt/2) | 

yp = (yk+*, 0)T is then a solution of ISF corresponding to the 
constant velocity field v = fk. 


77:5 © Schrddinger's Smoke 


Consider now the scenario of setting up an initial, divergence free, 
velocity field with two regions, say the Bunny and the Teapot, each 
having some constant but different velocity, e.g., pointing at one 
another. Using the corresponding y as initial condition, we 
can simulate the consequent inertial motion dynamics (Fig. 6). 


Formally, we seek a divergence free w, constrained to have 7, in 
some region 2. Cc M 


Nlo=Ng and x«xd*xn=0. 


In our example 2 would have two connected components, Bunny 
resp. Teapot, and 7, correspondingly specify a velocity for each 
component. We construct such an 7 through constraint projection, 
which enforces the velocity constraint and subsequently ensures 
vanishing divergence through pressure projection. 


Algorithm 4 Velocity constraint projection 


1: function CONSTRAINTPROJECTION(), Q, k, A, t) 


2: ypemp —— 
32 lo — POP (lapal, Ial)™ 
A: return PRESSUREPROJECT(w'™?) 


5: end function 


Starting with an initial guess, e.g., yy = (1,¢)' in M \Q and 
(y*26" ©)T in Q and normalizing it, iterating Alg. 4 is guaranteed 
to converge [Cheney and Goldstein 1959]. We find that 5 — 10 
iterations are sufficient in practice. 


Suppose now that we want to simulate a jet. This is an example 
of enforcing a fixed velocity in some region 2, the jet nozzle, 
throughout the simulation (Fig. 7). To accomplish this we use the 
volume penalization method [Arquis and Caltagirone 1984]. This 
method was devised for standard fluid simulation methods and 
includes the constraint via a parameter a in the Navier-Stokes 
equation 


A+Lyn = vAn—dp—+xo(n—Ng) and *xd*n=0. 





Figure 6: Two regions of red and blue ink are initialized to constant 
velocities pointing towards each other. Running ISF with this initial 
condition produces the corresponding inertial motion dynamics (left 
to right, top to bottom) at frames 1, 25, 85, and 270. The Bunny 
and Teapot centers were 2m apart, each moving towards the other 
at lms. 
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Figure 7: Jet of lms ' velocity and a nozzle opening radius of 0.3m. 
Left to right the vorticity quantization parameter fi = 0.04, 0.03, 
0.02m?s"', illustrating the finer detail with decreasing h due to 
more plentiful and narrowly spaced vortex filaments. The bottom 
row visualizes filaments as level set surfaces |y,|? — |y.|? = 0. 


Here vis the kinematic viscosity and y, the characteristic function 
of 2. For a — 0 the solution converges to a solution of the Navier- 
Stokes equation which respects the constraints [ Angot et al. 1999; 
Carbou and Fabrie 2003]. 


To integrate the constraint we use the implicit Euler scheme 
of [Jause-Labert et al. 2012] as it applies to our =. The method 
has no time step restriction and we can take the limit of a — 0 
directly. It amounts to a single constraint projection with param- 
eters at the end of the time step, ie., pYTY, QUtYD, KUtD, ¢GFD, 
The corresponding call to Alg. 4 follows the pressure projection 
in Alg. 1. 


Fig. 7 shows the resulting simulation also illustrating the effect 
of different values for h. With decreasing fi the strength of vortex 
filaments is lowered and correspondingly their number increased, 
resulting in finer details in the flow. This can also be seen in the 
direct visualization of the filaments as level set surfaces. 





Figure 8: Frame 600 of a spherical obstacle of radius 0.4m in a 
1ms“ flow. 


Maintaining a velocity constraint for some region can also be used 
to incorporate obstacles into our simulation. In that case ny = 0 
while Q may or may not be a function of time. Fig. 8 shows an 
example of a stationary obstacle in a background flow while Fig. 9 
shows a moving obstacle. 


3.3 Gravity and Buoyancy 


Both gravity and buoyancy are important forces in simula- 
tions. A simple model for this is the heavy/buoyant vortex fila- 
ment [ Saffman 1992, Sec. 5.8]. Given the way we have initialized 
©, filaments are zeros of y), while y), is indicative of the fila- 
ment core. Hence buoyancy and gravity enter at the level of the 
Schrodinger equation (1) as linearly varying potentials applied 
to the yy. component 


imp =—“ Aw + pp +(0,(g,x) yo)", 
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Figure 9: Example of a moving obstacle at frames 1, 100, and 240. 


with the vector g € R® controling magnitude and direction while 
x € R® is the spatial coordinate. 


Since the potential does not depend on time, integration is 
straightforward and amounts to multiplying y, with the plane 
wave ys. after normalization but before pressure projection 
in Alg. 1. 





Figure 10: A jet subject to a buoyancy force which “bends” it up- 
wards from its tilted initial trajectory at frame 150, 350, and 500. 


Fig. 10 shows an example of a jet with buoyancy causing a gentle 
upward bend. 


4 Mathematical Foundations of ISF 


To understand ISF we first show that it gives rise to a Clebsch 
variable and then derive the underlying dynamics in terms of this 
variable. Throughout this section we will identify C* with the 
quaternions H, which greatly simplifies the derivations. Before 
diving in, we briefly recall some relevant facts regarding Clebsch 
variables and quaternions. To keep the presentation simple we 


assume henceforth that M is a compact contractible domain in 
R°? with smooth boundary. 


Clebsch variables represent incompressible flows through a 
function rather than directly as a velocity or vorticity vector 
field [Clebsch 1859] (see also [Deng et al. 2005] and references 
therein for a contemporary exposition). For a flow described by a 
vorticity 2-form w, a function c: M — &, for some 2-dimensional 
manifold © equipped with an area form dA,, is called a Clebsch 
variable if 
w=c"dAy 


where c*dA, denotes the pull back of the area form. In other 
words, for each oriented surface 2 Cc M with its corresponding 
image c(Q.) C & we have 


{ wO= dA; = Area;.(c(Q)). 
Q c(Q) 


The preimage of a region in © is a vor- 
tex tube in M with a vorticity flux cross 
section constant along the tube, and 
c serving as a local parameterization 
of the vortex lines within. In a classi- 
cal Euler fluid, the dynamics are then 
characterized by c being advected by 
the velocity field 


é+L,c=0. (6) 





This is typically done for maps to & = R?, but works equally well 
for maps to the sphere & = S* [Kuznetsov and Mikhailov 1980]. 


Quaternions Throughout this section y € C* will be treated 
as a quaternion via the map Ww ~— wW,+ yj, where j is one 


of the quaternionic imaginary units, the others being i and k. 


At times we also exploit that the imaginary quaternions ImH 
are in a natural one-to-one correspondence with 3-vectors in R®, 
(x, y,2) <> xi+ yj+zk. A quaternion can be thought of as the 
sum of a scalar and a vector a =a, +a, = Re(a)+Im(a), in terms 
of which multiplication becomes 


ab = (a, b, _ (dys b,)) as (a,b, . b.a, + a, x b,). 


Lastly for |a|? = 1 the product ab,a represents a general 3D 
rotation of a 3-vector b, € ImH [Cayley 1845]. For a more 
detailed introduction to quaternions see [Hanson 2005]. 


To simplify the derivations we use the shorthands 
= n/h=Re(—pidyp), s:= pir, (7) 
and 
w= —pidep = 5 (dpip —pidy) —3 (dpi + pidy) 
ON eFrT O” 
=Re(dpiw)=(dy, i) p =d(piw)=ds 
=i — 3ds, (8) 


in terms of which we may write 


dy =ip(j—ds) and ds=2(pidyp +7). (9) 


4.1. Incompressible y) 


Let y: M — C” be a wave function and p = |w|? its density. For 
incompressibility we require constant density op = 1 and vanishing 
divergence, «d*7n = 0. 
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The normalization constraint is easily enforced initially for n > 2 
since a C?-valued function generically never vanishes on a region 
in R°. Thereafter the constraint «d * 7 = 0 maintains the point- 
wise normalization (App. A). It is equivalent to (Ay, iw), = 
for A = «dxd since (dy), idy))» = 0. Therefore the dynamics of 
ISF correspond to trajectories yw“ on the manifold 


M:= {pec?(M,C’)| |p| =1, (Ay), ip)y =0, dyp(N) =o}. 


Here N is the unit normal vector field to the boundary of M and 
dy(N ) = 0 is equivalent to the velocity being tangent to 0M. 


To project an arbitrary y € C°(M,C7) with |a)|* = 1 to M 
observe that for a smooth function q: M > R, e “4~w has velocity 
1-form 7 —fdq. Therefore the velocity field will be divergence 
free if q satisfies the Poisson problem 


Ag = «dx7. (10) 


Because constant functions form the kernel of A, e~‘44) is unique 
up to a global phase, which leaves 7 unchanged. In the time 
dependent setting the divergence free constraint is enforced by 
the potential p in Eq. (1) (App. A). 


4.2 Coordinates on Phase Space 


For a proper phase space we need to eliminate the global phase 
degree of freedom left in the definition of M (Sec. 4.1). This is 
accomplished by using s € C®(M, S7) as given in Eq. (7). 


To see that s uniquely describes the state of the system, suppose 
w,o € M with wiw =s = di¢ then there is a function g: M > 
R such that ¢ = e4y). Since wy, € M, q is harmonic with 
zero Neumann boundary conditions, so in particular q is constant 
and yw and @ are the same element of M (up to a global phase). 
The map a+ dia from S? to S? is known as the Hopf map [Hopf 
1931; Lyons 2003 ]. 


We have shown that every w € M is uniquely determined up 
to a constant phase by the map s = iw. Conversely, using the 
contractibility of M, it can be shown that for every s € C“°(M,S7) 
there is a yw € M such that s = wiw. 


Thus s alone represents the state of the system and to study the dy- 
namics of y it suffices to look at the dynamics of s € C™(M,S7). 


4.3. The Clebsch Variable s 


Since s completely describes yy € M we can study the dynamics 
of wy © M under ISF by looking at the time evolution of s € 
C™°(M,S?*). _ 
Theorem 1. The function s = wWiw is a Clebsch variable of ISF 
with 

w= Es*dAg (11) 
where dAg2 is the standard area form on the unit sphere. 


Proof. To see that the vorticity 2-form of ISE w = dyn = Adu, can 
be expressed as the area form on S$? first observe that 


sdu=—pipdpridyp =  widy A pidp =uApy= tds Ads, 
=—dipyp 


where the last equality follows from distributing the wedge prod- 
uct over U = 7} — sds (8). Because s takes on values in S? we 
have ds A ds = 2s(s*dAg2). Explicitly, for X,Y € TM, 


s*dAg2(X, Y) = dAg2(ds(X), ds(Y)) = +(s, ds x ds)(X,Y). 


where the cross product of two R°-valued 1-forms a and f is given 
by a x B (X,Y) = a(X) x B(Y)—a(Y) x B(X). This completes 
our claim. Oo 
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4.4 Dynamics of the Clebsch Variable 


To derive the dynamics of ISF in terms of s we show that the 
symplectic gradient flow of the Hamiltonian 


Arge(p) = Flap? = $lnl? + SlldsiP? = A.p) + SHC) 


(cf. Eq. (3)) coincides with the time evolution of s when derived 
directly from the ISE (Note that the decomposition into kinetic 
and scaled Dirichlet energies follows directly from Eq. (9).) 


First we define a suitable symplectic form on the space of vari- 
ations of s € C™(M,S7). Let s,§ € T,C™(M,S7) be two such 
variations then 


o(s,s)= | (s,s x §) 


is our symplectic form. Next we compute the symplectic gradients 
of the two terms in the Hamiltonian beginning with the kinetic 
energy. 


Let d® = «7 with ®|,,, = 0, the existence of which is assured by 
the Helmholtz-Hodge decomposition for manifolds with bound- 
ary [Schwarz 1995], and consider a variation § of s € C°(M,S7) 


an@)= | prsn= | pnde= | —d(nA®)+dnA® 
M M M 


= | ( (8, ds x ds) +2(s, ds x ds))A® 
M if 


=0 


= | d((s,§ x ds) A&) — (ds, x ds) A® + (s,8 x ds) Ax 
M _— 


=0 


= | (s x §,ds Ax) =O(S,L,s). 
M 


Here we used Stokes’ theorem, ®|;,, = 0, cyclic permutation in 
the determinant, ds,5(p) € TS’, s 1 TS’, and x(ds A*n) = Lys. 


Recalling that the symplectic gradient sgradH, is defined by 
o(s,sgradH,) = dH,(s), we have shown that the Hamiltonian 
flow of H, is 

s =—sgradH, =—L,s 


which establishes Eq. (6) for the Clebsch variable s and by implica- 
tion that 7 behaves as an ideal fluid modified by the term arising 
from the second part of the Hamiltonian, the scaled Dirichlet 
energy H,. 


Once again let § be a variation of s € C°(M, $7) 


nn ° n? 
7 dH, (5) aa e | 


M 


(di Axds) = © | d(8, xds) — (8, dx ds) 
M 
2 


—_ sal (s x s,s x *As) = —fo(§,s x As), 
M 


where we used that the restriction of «ds to the tangent bundle 
on the boundary is ds(N)dA,;,, and ds(N) = 0 since dy)(N) = 0. 


Consequently the Hamiltonian flow due to HH, is 
S=5 (SxAs), 


the isotropic Landau-Lifshitz equation (LLE) [Landau and Lifshits 
1935] which describes the evolution of magnetization in ferro- 
magnets with s giving the orientation (“spin”) of the magnetic 
field at each point in the domain. Interestingly the LLE by itself ad- 
mits (magnetic) vortex rings | Cooper 1999; Sutcliffe 2007 ] which 
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travel through the material and exhibit such complex dynamics 
as leapfrogging [Niemi and Sutcliffe 2014]. 


Theorem 2. JSF is the Hamiltonian flow of Hicgg = H, + © H. That 
is, the evolution of s under ISF is 


$+L,s = 3(s x As). (12) 


Proof. We compute s directly in terms of y using only zw € M 
and that it is a solution of the Schrodinger equation (1) 


$= pi + Diy) =2Im(Piw) 
= 21m (qi (—L,p — tip (s x As) — pip) 
= —(Lypip + pil) + 2 (sx As) 
=—Lys+ H (s x As) 


where we used Eq. (14) (App. B). O 


As a corollary we find the evolution equation for the velocity 
1-form as 


H+L,n =dpt © (As, ds), *d*xn =0 


for a suitable pressure p (App. C). In particular we see that the 
velocity 1-form 7 is advected by the velocity field subject to a 
tension force. 


4.5 Landau-Lifshitz Modified Fluid 


In Sec. 4.4 we showed that the dynamics of ISF arise from the 
standard kinetic energy of 7, which would correspond to an ideal 
fluid, with an ~-multiple of the Dirichlet energy of s added in. 
The latter by itself would have resulted in Landau-Lifshitz dynam- 
ics. What is the impact of this modification from the point of view 
of incompressible fluid simulation? We begin with some basic 
observations. 


For initial data as we set up, almost all the mass of w is contained 
in y, over most of the domain. Only near the zero set of y,, 
which generically consists of closed curves or curves beginning 
and ending on the boundary, does w, take up significant mass due 
to the normalization constraint |y|* = 1. Since s,. = |y,|* —|y.|? 
for s = wiw, s is mostly near (1,0,0) and moves towards the 
antipode (—1,0,0) only near zeros of w,, inbetween “sweeping” 
over the entire sphere S? (Fig. 11). Eq. (11) then implies that the 
vorticity integral over a small surface transversal to a zero set of 
w, is 27h. In particular, filaments in ISF carry vorticity quantized 
to 27h. 


This characterization of s continues to hold as the simulation 
progresses since the mean of s is an invariant of the flow. This 
follows from ||w,||? and ||7,||* being invariants since w=, and y, 
by themselves are solutions to the Schrodinger equation, whose 
time evolution is unitary. Hence the mean of s,. is invariant. Due 
to the symmetry of S* so must be the mean of s. 


But what does this imply for the Landau-Lifshitz energy? Since 
the Dirichlet integrand is bounded below by the absolute value 
of the area density, Eq. (11) implies 


gids’ = Flo, 


with equality achieved for a conformal map s. Here |w| is the 
norm of the corresponding vorticity vector field. In practice we 
observe that the Landau-Lifshitz energy is near this minimum and 
hence the Dirichlet integrand supported mostly in the vicinity of 





Figure 11: A slice of a 3D domain with a vortex filament moving to 
the left. The color on the plane visualizes s according to the color 
map on the sphere. Large regions of the plane are nearly the same 
color, i.e., covering a small area on the sphere = little vorticity. As 
the filament translates the gridded region maps to an ever larger 
area on the sphere. The preimage of the equator (the level set 
lab, |? — |ab.|* = 0) visualizes the vortex tube (see also Fig. 7). 


the filaments, with an integral over a surface transversal to the 
filament yielding approximately 27tf independent of the thickness 
of the filament. Consequently, the integral of the Dirichlet energy 
for a tubular neighborhood of a filament y yields ¥ 471L(y). 


—— H,(p) = 4 Ill? 
--- Bop) = %llds|? 


g 
ae Hisp(w) = *{I\d-p|l? 
— HJWENO/MacCormack 





0 20 40 time (s) 

Figure 12: Energy plot for the leapfrogging vortex filaments (Fig. 4). 
The kinetic energy tracks closely between HJWENO /MacCormack 
and ISE The additional Landau-Lifshitz component in the latter 
stays nearly constant, keeping the vortex filaments “alive” in ISE 


What then are the dynamics consequences of the Landau-Lifshitz 
energy term? In examples of ISF we observe phenomena which 
are difficult to reproduce using grid simulations without exces- 
sively high resolution (Figs. 4, 13, 14). Since such phenomena 
arise from vortex filaments whose thickness is comparable to the 
grid resolution, a velocity or vorticity representation of fluids 
tends to lose the energy contained in the vortex cores, directly 
impacting the dynamics of the coherent vortical structures. For 
ISF the Landau-Lifshitz energy, proportional to the total length 
of the filaments independent of their thickness, maintains this 
otherwise lost energy. Fig. 12 demonstrates this quantitatively 
for the simple example of the leapfrogging vortex filaments of 
Fig. 4. 
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Additionally, the length L(y) of filaments is also the Hamiltonian 
of the local induction approximation (LIA) for the motion of thin 
vortex filaments [Rios 1906; Hasimoto 1972] (see [Saffman 1992, 
Ch. 11] for a modern exposition). This hints at a deeper relation 
between the LIA and the Landau-Lifshitz term in ISE Fig. 13 
shows the simulation of the Teapot/Bunny collision using Eule- 
rian HJWENO/MacCormack (top left) and a Lagrangian filament 
method [WeifSmann and Pinkall 2010] (top center), comparing 
it to our ISF (top right; see also Fig. 6 bottom right). Remark- 
ably, the Lagrangian filament simulation which uses sub-grid scale 
vortex thickness and explicitly includes the LIA forces, yields qual- 
itatively the same results as our Eulerian ISF method. 










i a i: ae 5 
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Figure 13: Frame 270 from colliding Teapot/Bunny simulation 
using Eulerian HJWENO /MacCormack (top left), Lagrangian vor- 
tex filaments [Weifsmann and Pinkall 2010] (top center), and our 
Eulerian ISF (top right). The center bottom shows the filaments of 
Weifsmann and Pinkall which, at thickness 0.017m, are below the 
grid size of the ISF simulation. ISF vortex tubes (|\y,|? — |y.|? = 0) 
and their cores (~, = 0) are shown on the bottom right. 


A further example of vortex filament dynamics, which are chal- 
lenging to simulate, are the obliquely colliding vortex rings [Lim 
1989, Videos front & top]. Fig. 14 compares our method with sta- 
ble fluids and HJWENO/MacCormack at two resolutions. Stable 
fluids, due to its excessive numerical diffusion cannot reproduce 
this experiment at all. HJWENO/MacCormack does somewhat 
better and successfully reproduces the reconnection event only 
at 256° resolution (not shown). ISF on the other hand produces 
the correct dynamics already at 64°. 


5 Additional Results 


An important characterization of classical fluids is given by the 
Reynolds number Re = |v|D/u where D is a characteristic size, 
e.g., the diameter of an obstacle in a flow, |v| a characteristic 
speed, e.g., speed of the background flow, and wu the kinematic 
viscosity. For superfluids as well as our setup there is no kinematic 
viscosity, but one can define an equivalent superfluid Reynolds 
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Figure 14: Comparison of methods for oblique smoke ring collision 
at resolution 64° (top) and 128° (bottom) showing from left to 
right the initial configuration, stable fluids with RK4 back trace, 
HJWENO/MacCormack, and ISE each at frame 600. The vortex 
filaments have strength 27h for i = 0.05m7s~", they are of radius 
0.6m making an angle of £45° to the domain with their centers 
separated by 2m (cmp. to [Lim 1989, Videos front & top J). 


number Re, = |v|D/(27h) [Volovik 2003], where |v|/(27th) is the 
number of filaments per unit of distance. Note that the typical 
range of Re, is quite different from the classical Reynolds number. 
In our experiments 1 < Re, < 10. 


The superfluid Reynolds number is helpful when scaling simula- 
tions. For example, scaling f and the velocity by the same factor 
leaves Re, invariant. 





Figure 15: Von Kdrmdn vortex street forming behind a cylindrical 
obstacle with Re, © 3.18 and St ® 0.14 at frame 1050. The cylinder 


measured 0.3m radius in a flow of Ims™!. 


A phenomenon that is characterized by the Reynolds number is 
the shedding of vortices with a particular frequency f from an 
obstacle. This can be captured by the Strouhal number [Strouhal 
1878], St = f D/|v|. For a large range of Reynolds numbers (Re € 
[800, 200000 ]) one finds St ¥ 0.2. Such measurements, though 
far fewer in number, have also been performed for superfluids 
(simulation and experiment) where the corresponding Strouhal 
number falls into the range of 0.12 — 0.18 [Reeves et al. 2015]. 
In our own experiments we have observed 0.14 < St < 0.18 for 
2 < Re, < 8 (see Fig. 15 for a typical example). This agrees well 
with the numbers reported for superfluids and is close to, though 
smaller than, the universal 0.2 for classical fluids. 


6 Discussion 


ISF is a close relative of the GP equation, replacing the cubic non- 
linear term with the limiting case of pressure projection to enforce 
a divergence free velocity field. The corresponding dynamics are 
related to superfluids but can be used for the visual simulation 
of classical fluids as we have demonstrated. Because ISF adds a 
Landau-Lifshitz energy term to what would otherwise have been 
an Euler fluid, we find that coherent vortical structures and their 
dynamics are captured, even at modest resolutions, with fidelity 
rivaling purely Lagrangian methods. 
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Shortcomings There are a number of practical issues and open 
theoretical questions at this time. 


The simple splitting method we employ for time integration ex- 
hibits loss of kinetic energy. Even though its impact is not as 
problematic due to the Landau-Lifshitz term, design of an in- 
tegrator more tailored to the particulars of ISF would be very 
interesting. 


Design of flows is not yet fully general. For example, we do not 
know how to express arbitrary forces at the level of a, limiting 
the use of standard special effects design tools. 


With discrete circulation given by the difference of phases along 
an edge (4), there is an upper bound on the velocity that can 
be represented without aliasing. Ideas from [Knoppel et al. 
2015] may help here. A related issue arises at the global level, 
where the periodicity of the domain leads to quantization of grav- 
ity /buoyancy. 


Our method has only one parameter, f, which controls the quanti- 
zation of vorticity. Notably there is no parameter to control viscos- 
ity, though there are diffusion | Frisch et al. 1992] and drag [ Sasaki 
et al. 2010] phenomena. 


The implicit integrator for constraints is of low order, occasionally 
leading to leakage of fluid into the interior of obstacles. A higher 
order integrator for the constraints would therefore be desirable. 


Open Questions and Opportunities In this paper we have only 
scratched the surface of the theory underlying ISE 


An interesting direction to pursue is the simulation of multi- 
phase fluids, e.g., air and water. When applying gravity/buoyancy 
forces we have already treated the two components of y differ- 
ently. How far can this be taken? How do interface forces enter? 
Are there interesting physics to be modeled with wave functions 
w:M —>C" forn> 2? 


At the theory level we would like to understand the limit of A — 0. 
Does it yield Euler fluids? How can we characterize the velocity 
fields which can be represented by y € M resp. s € C®(M,S7)? 


Clearly we are only at the beginning of exploring this new ap- 
proach to fluid simulation and are hopeful that many new math- 
ematical, physical, and simulation tools will spring from it. 
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A Pressure Potential 


For incompressibility we require constant density p = |w|? = 1 
and div(v) = 0. When p = 1 the constraint div(v) = 0 is equiv- 
alent to (Aw,iw). = 0. This condition maintains the pointwise 
normalization under Eq. (1) since 


sD? = 2), bp =—H(Awp, ie — Zp lip, pp = 0. 


To ensure (Aw, iw), = 0, observe that for an arbitrary y € 
C°°(M, C7) with |y|? = 1 there exists a smooth phase q: M > R 
so that e“4y) is divergence free (Sec. 4.1). Therefore the term 
pw in Eq. (1), which generates general phase shifts, suffices to 
keep w divergence free. 


Explicitly, the scalar potential p: M — R in Eq. (1) ensures a 
divergence free velocity field when it solves 


Ap =" ((A*p,)2—(Ap,Ay)g) with | p=0. (13) 


To see this, note that (Aw, iy). must have vanishing time deriva- 
tive, yielding 


O=AZ (Ay, ip)e = (Ay, itp), — (Aity, y)p 
= (Ay, —S Ap t pb)a + (EA2p— Ay), P)e 
= (Ap, Pp) — (Ap, Av)2) —Ap—2(dW(Vp), Px, 
which proves Eq. (13) since dy 1 y due to ||? = 1. 


This p serves as the Lagrange multiplier for the divergence con- 
straint in the continuous time setting. For discrete time, i.e., at 
each time step, this role is played by q of Eq. (10), which pressure 
projects a given y to the constraint space (Sec. 4.1). 


B Evolution of ~ 


Here we rewrite the Schrodinger equation to expose the advection 
going on “under the hood” and we will find that w is advected by 
the velocity field subject to a modified pressure potential and a 
Landau-Lifshitz term 


pt Lp =—iy (4(s x As) +5) (14) 


where p = 2 — 2\dy|? — 2(s, As). 


Eq. (14) follows from replacing Aw in Eq. (1) with 


Aw = xd (itp (* — § «ds)) 
= i+ (dy A*f)— 4 «(dy Axds)— SpAs 
= iL + sy * (4 — ;ds) A «ds) — spss 
= tLiy +w (x (HA*(H+ widw)) + ; * (ds A *ds) — ssAs) 
=i2Lp+y (17? + z\ds|? — ssAs) 
=i2L,ptw (IA + Zldsl? + 5(s,As)—$(sx As)) (15) 
which used Eq. (9), *d*7 = 0, the Hermitian product of two 
quaternionic k-forms, (a, 8) = *(a@ A *f), the Lie derivative of a 


function, £,% = dy(v) = «(dy Ax*n) = *(n A xdw) for v = 7}, 
and sAs = —(s, As) +s x As. 


C Evolution of 7 


To derive the evolution equation for 7 we differentiate Eq. (14), 
multiply the result on the left with —i, and finally take its real 
part. 


We begin with the lhs of Eq. (14) 


—Re(pi(dy + L,dyp)) 
=Re(—piipy + u—pil, (ipu)) 
=Re(ypu) + +Re(plLput ppl) 
= Re (wp) (—i} W(s x As) + pip) uw) + 44+ LF 
H+ Lyh— Z(As, ds), 


since daQpy) = 0 = dyw + wdw implies wdyw € ImH and 
hence pwiwu = —pwiwwidy = pywdw € ImH. Also, since 
s x As € ImH and orthogonal to s, s(s x As) € ImH and hence 
Re(s(s x As)) = —i(s x (s x As), ds) = + (As, ds). 


Differentiating the rhs of Eq. (14), multiplying on the left with 
—wi, and taking the real part leaves only 


—1 Re (ypdyp (s x As)) + dp =—4 Re(s (7 — $ds) (s x As)) + dp 
— 2 (s x ds,s x As) + dp. 


It follows that the velocity 1-form 7 evolves as 


H+L,NH= hdp+ > “(As, ds). 


D_ Discrete Circulation 
Here we prove that the discrete velocity 1-form is given by 


Nw = A arg (yy, Wwe 


77:13 ¢ Schrddinger's Smoke 


and arises, as is standard, from computing the circulation of the 
smooth velocity 1-form along the straight edge vw 


Nw = nf (dy, i)p- 


This formula presupposes that we have chosen along vw a curve 
y: [0,1] — C?, |y(t)|? = 1 that interpolates between w, and 
w,. We will assume (yw, W,,)c # 0, so there is a unique short- 
est geodesic path c: [0,1] — S* with c(0) = iw, and c(1) = 
Wylw,,. We will construct y in such a way that yiy =c. 


We distinguish two cases: (1) yw, and w,, are linearly dependent 
or (2) independent, and construct y explicitly in each case. 


Suppose they are dependent, i.e., py is a complex scalar multiple 
of y,, then in fact py = ewy,. Letting a: [0,1] > R with 
a(0) =O and a(1)= ,,, y(t) := e'* w, will serve our purposes 
and 


1 1 
n| (dy, iY) = n| da = vw = Rarg(y,, Wwe: (16) 
0 0 


since (dy, iy)p = (iday,iy)z = da. 


Suppose now that y, and w,, are independent, Le., (W,, Py)c = 
cos(f )e"™, with 7,,, € (—7, 7) and B € (0,7/2). "Deane od i= 
esc()(e"™ y,,—cos(B )ap,). One easily checks that (y,, @)- = 0, 
|p|? =1 and 


Py = e'™ (cos(B)yp, + sin(B)p) 


Let now y(t) := e'*(cos(Bt)w, + sin(Bt)d) and substitute in 
Eq. (16) to find the desired result. 


E Laplacian Eigenvalues 


To use Fourier methods for Schrodinger integration as well as 
the Poisson solve we need the eigenvalues of the 3D Laplacian. 
Letting the index of a vertex be v = x,y,z, the spatial resolution 
becomes a triple of edge a l= (l,, l,,1,), and the overall 
cube measures L = (1,.N,., LN, 1,N;). 


The eigenvalues for the ae (A) resp. continuous (A) Lapla- 
cian on a periodic domain are 


A, =— sin’ ()— g sin’ (Ht) — Z sin” (=) (17) 
2, =n (3+ +2 z). (18) 


The corresponding eigenvalues for the 3D Laplacian with Neu- 
mann boundary conditions, i.e., flow tangential to the wall, arise 
from Eqs. (17) and (18) by replacing L < 2L. In that case the 
DCT replaces the FFT in Algs. 2 and 3. 


In the continuous case Eq. (18) can be checked by direct differen- 
tiation of the Fourier basis, while Eq. (17) uses a centered second 
difference and some straightforward trigonometric identities. 
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Clebsch map. This makes Clebsch maps useful for visualization and fluid 
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1 INTRODUCTION 


In his study of the equations of hydrodynamics, Alfred Clebsch 
in 1859 proposed what is now known as the Clebsch representa- 
tion [Clebsch 1859]. It encodes the fluid velocity field u: M > R?° 
(M C R?®) with the aid of a function y := (A, , ¢): M > R’, writing 


u = A grad p — grad @¢. (1) 


The vorticity vector field w, in turn, is represented by the function 
si=(A,p): M> R2 


w = curlu = grad J xX grad (2) 
due to curl(grad) = 0. 
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Fig. 1. Visualization of the vorticity field produced by the flapping wings 
of a Hummingbird. Flow data and photogrammetrically acquired bird 


geometry courtesy Haibo Dong, Flow Simulation Research Group, University 
of Virginia [Ren et al. 2016]. See the video at 02:00. 


Such “encodings” of velocity resp. vorticity fields are useful for flow 
visualization [Kotiuga 1991], analysis [Jeong and Hussain 1995], 
simulation [Brandenburg 2010; Cartes et al. 2007; He and Yang 2016; 
Yang and Pullin 2010], and enhancement among others. For example, 
Eq. (2) implies that pre-images of points, s-!({p}), correspond to 
closed vortex lines (integral curves of the vorticity field), and pre- 
images of regions, s~!(Q), to vortex tubes (Fig. 1). Similarly, given s on 
some computational grid one can add properly aligned turbulence at 
the subgrid level through simple manipulations involving s (Fig. 11). 


So what’s not to like? There are a number of problems with Clebsch’ 
original proposal. For example, it can only represent fields with zero 
helicity (Thm. 4.1). Moreover, near points of vanishing vorticity, 
smooth functions y may not exist, even locally [Graham and Henyey 
2000]. These problems can be addressed with generalized Clebsch 
maps [Cartes et al. 2007; Graham and Henyey 2000; Zakharov and 
Kuznetsov 1997]. Unfortunately, most of these no longer yield a 
level set representation for vortex lines. To maintain the latter we 
need to replace the range space of s with a surface more general 
than R? (and similarly for /). Indeed, using the 2-sphere S? for s and 
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the 3-sphere S° for w yields spherical Clebsch maps. These are the 
only known example capable of representing fields with non-zero 
helicity and giving a level set representation for vortex lines [Chern 
et al. 2016; Kotiuga 1991; Kuznetsov and Mikhailov 1980]. 


What is missing, however, is an algorithm to find a Clebsch map for 
a numerically given vector field. In this paper we give an algorithm 
to produce spherical Clebsch maps i approximating a vector field 
u on some domain M. (While such vector fields often arise from 
incompressible fluid simulations, there is no requirement that the 
field be divergence free.) The desired i is found through minimiz- 
ing a non-linear Dirichlet energy, which encapsulates fidelity and 
smoothness terms with a variable trade-off. 


Overview. Before we delve into the details of our algorithm, we set 
the stage with a more abstract definition of Clebsch maps (Sec. 2). 
To make this manageable in the face of manifold-valued functions 
we employ the formalism of exterior calculus. (Readers wishing 
for accessible explanations of differential geometric concepts are 
directed to [Frankel 2012], while [Abraham et al. 2001, Ch. 7] provides 
a comprehensive review of exterior calculus.) With that machinery 
available we are equipped to describe spherical Clebsch maps (Sec. 3) 
and formulate a strategy for finding such maps for a given velocity 
field (Sec. 3.1). The remainder of the paper is devoted to validation 
(Sec. 4) of the basic method and suggestions for applications (Sec. 5). 
Proofs and implementation details are delegated to Apps. A to E. 


Table 1. Notation. 


Notation Meaning 

r a point on M 

u velocity vector field on M 
n=u-dr velocity 1-form 

w = curlu vorticity vector field 

@ = dn=w- (dr x dr) vorticity 2-form 

H =R* quaternions 

(+, +) scalar product on R# 
Q*k(M:V) V valued k-forms on M 
yf=fow pullback of a function f 


y*a(-) = a(dy(-)) pullback of a 1-form @ 
y*o(-,:) = o(dy(-), dW(-)) pullback of a 2-form o 


2 GENERAL CLEBSCH MAPS 


Instead of working with the vector field u we now work with the 
corresponding 1-form (co-vector field) 7. 


Let 7 := u- dr bea velocity 1-form on M, then Eq. (1) reads 
n = Adu — d¢. 


This can be expressed succinctly as the pullback of a fixed 1-form 
a = xdy — dz by a function yp = (A, p, d) 


n=Woa. 
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The formalism of pullback encapsulates the underlying change of 
variables. Since exterior derivative and pullback commute Eq. (2) 
becomes 


a = dy = d(y"a) = "da = "(dx A dy), 


where we used dod = 0. Since the wedge product dx A dy is the area 
2-form dAp2 on R’, we can also write w = s*dApe for s = (A, 11), 
giving us 


[io [ saan: = Area(s(Q)). (3) 


This equation tells us that the vorticity flux over some surface Q C M 
can be found by computing the (signed) area (with multiplicity) of 
the image of Q under s. 


Importantly, having rewritten Clebsch’ original proposal using 
pullbacks we now see the underlying geometry. A Clebsch map 
describes a variable 1-form through a fixed 1-form and a varying 
function. This leads to 


Definition 2.1. Clebsch Representation Let C and » be manifolds 
with a € Q'(C:R) a fixed 1-form and o € Q7(=;R) a fixed 2-form, 
and 2: C — » such that 2*o = da. We call a map 


e wy: M > C aClebsch map for n € Q'(M;R) if y*a = 7. 
e s: M > YaClebsch map for w € 27(M;R) if s*o = w. 


A Clebsch map y: M — C for n automatically implies that s := 2 oy 
is a Clebsch map for dy = w. Conversely, in many situations it is true 
that given a Clebsch map s: M — * for w = dn there will always 
exist a Clebsch map : M — C for n such that s = 2 o w (Thm. 3.1). 


Def. 2.1 tells us that Clebsch maps are distinguished by their choice 
of manifolds (C, =), 1- resp. 2-forms (@, 0), and the functions (1, s) 
connecting them: 


e Clebsch’ original proposal [1859] uses 7: M > R°, y = 
(A, u, p) with a = xdy — dz to get a Clebsch map for n, and 
mw: R? — R? with o = dAg: for w. Such maps can only 
represent fields with vanishing helicity (Thm. 4.1). 

e A natural generalization of Clebsch’ original proposal uses 
n-dimensional A and p [Cartes et al. 2007; Graham and 
Henyey 2000; Zakharov and Kuznetsov 1997]. Now yy = 
(A, pd): M > R” XR” XR for n = 2,3 anda = x-dy-—dz 
using the coordinate functions (x, y,z) on R” x R” XR, 
yields a Clebsch map for y. In turn w has a Clebsch map 
s:M —> R” XR"”,s = (A,p) with o = dx A dy. Since 
dim() = 2n > 2 there is no accessible representation of 
vortex lines. 

e Chern et al. [2016] gave a Clebsch map w: M — S? for 7. 
Using the embedding q of S° into the space of quaternions 
H, we can write a = hi(dq, iq) where h > 0 is a dimensional 
constant. A Clebsch map for w with o = hd Ag is obtained 
with the Hopf map zr: S° — S*, 2 = gig [Hopf 1931; Lyons 
2003]. Their Clebsch map s for w alone was also considered 
earlier [Kuznetsov and Mikhailov 1980] though only Chern 
et al. used it in a computational context. 


The particular form of Clebsch’ original proposal was motivated 
by the fact that the expression must be non-integrable to be able 


to describe flows with non-zero vorticity. In fact, @ is a contact 
form whose kernel is nowhere Frobenius integrable [Geiges 2006] 
(see [Delphenich 2017] for its relation to Clebsch’ proposal). 


For completeness we note that Clebsch maps using three-component 
(A, 4) can recover the Cauchy-Weber transform [Cauchy 1815; Weber 
1868] of the Euler equations for incompressible as well as barotropic, 
ideal (inviscid) fluids [Euler 1757]. These representations have been 
extended in the Eulerian-Lagrangian approach [Constantin 2001a,b] 
to include incompressible viscid (Navier-Stokes) fluids. For an his- 
torical account tracing these developments back to Cauchy [1815] 
see [Frisch and Villone 2014]. 


In Computer Graphics the classic Clebsch maps (A, 1) have made 
an appearance as representations of divergence free vector fields 
for purposes of geometric modeling [Angelidis and Singh 2007; von 
Funck et al. 2006]. In this setting (A, 1) are control variables for the 
deformation field and no inverse problem, finding the variables if 
the field is given, needs solving. 


Summary. The only known Clebsch maps which maintain the com- 
pact representation of Clebsch’ original proposal, can recover vortex 
lines as pre-images of points, and approximate velocity fields of 
arbitrary helicity, are those of Chern and co-workers [2016] and 
Kuznetsov and Mikhailov [1980]. Hence we will focus on spherical, 
i.e, S°- resp. S*-valued, velocity and vorticity Clebsch maps in the 
remainder of this paper. 


3. SPHERICAL CLEBSCH MAPS 


Before diving into the algorithms we need to clarify the relationship 
between velocity and vorticity Clebsch maps. In the classical setting 
this was quite straightforward. In the spherical case it is very similar 
but it needs a little more work to see this. 


In this paper we follow [Chern et al. 2016] and work with C = S°, 
a = h(dq, iq) (for the choice of h > 0 see Sec. 5), & = S*, o = Ad Ag 
and a the Hopf map, z = gig. The fact that z“o = da follows from 
[Chern et al. 2016, Thm. 1, Sec. 4.3] applied to the special case yy = q. 


Conversely, if we have a vorticity Clebsch map s: M —> S* then 
there exists an almost unique Clebsch map for 7: 


THEOREM 3.1. Existence of Lifts Suppose we have a Clebsch map 
s: M — S* for an exact 2-form w. Then there is a Clebsch map 
w: M > S? for some n with w = dn and oy =s (see App. A for a 
simplified proof). 


Note that 7 is not required to be divergence free, though dn is of 
course always divergence free. 


CoROLLARY 3.2. Non-Uniqueness If is a Clebsch map for a velocity 
1-form n, it cannot be unique. If f : S? — S* is any area-preserving 
map, thens := f omow isalsoa Clebsch map for w = dy. By Thm. 3.1 
there is another Clebsch map W for n with 1 0 " =. 
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3.1. Finding a Spherical Clebsch Map 


For a given no an approximating velocity Clebsch map w can be 
found as a minimizer of 


EW) =f in-m? (4) 
M 
for 7 := W*a, yielding the vorticity Clebsch map s = z 0 w as well. 


Suppose we have found an optimal approximation y for the given no. 
By Cor. 3.2 such a y gives rise to infinitely many more minimizers "i 
through area preserving diffeomorphisms of S*. To fix this degree of 
freedom we add the Dirichlet energy of s as a regularizer to Eq. (4), 
seeking now a minimizer as € — 0 for 


argmin E*() 


EW) where EW) = | §lds + dln nol. 6) 
y: MS 


This leads to our main algorithm: 


Algorithm 1 Optimization scheme for Eq. (5) 


Input: 79 € Q'(M;R) andh > 0. 
Randomize yO: M => 8S’; 
e) = 1; 
for k = 0,1,2,...do 
yPorD) = argminy pe (/) with initial guess py"), 
elk+) < €8)/19 
end for 


3.2. The Energy E* 


In this section we show that the energy of Eq. (5) can be rewritten 
as the Dirichlet energy of y under a suitably chosen connection and 
metric. To facilitate this derivation we represent elements y of S° as 
unit quaternions. This allows us to use straightforward quaternion 
algebra to arrive at the final expression (Eq. (7)) of our energy. 


Given no, define the connection (covariant derivative) V7 for S>- 
valued functions y 


Voy := dp -iPy. (6) 


For a givenr € M, (V”),; is a linear map from T;M to TyxyS° C H. 
Choose {y, i, jw, kw}; as an orthonormal basis for H, where {i, j, k} 
denote the quaternionic imaginary units, and note that Ty(xyS° = 
Span{iy, ji, ky}. Now split H into the orthogonal sum of the two 
subspaces 


Cy, := Span{y,iv}; and Cj, := Spantiy, kw};, 
and decompose (V°y), accordingly 


(VP): = Poy, (VP) + Pejy (Vy), 


where Poy, and Pcjy, denote the corresponding projection operators. 
It is not difficult to work out (App. B) that 


Poy (Vy) = <n — noi and Poy (Vp) = —Zipds. 
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Thus we can recover the integrand of Eq. (5) with the aid of an 
anisotropic metric on S°. At each point w(r) € S° define a norm on 
tangent vectors X € TyxyS° C Has 


IXIE := |Pey (XI? + €lPejy (XI? 


Such a metric on S°, with an anisotropy along the iy direction, is 
known as a Berger metric and S* with such a metric as the Berger 
sphere. Using this metric, Eq. (5) is seen to be the connection Dirichlet 


energy of y 
ES() = /, VOY, (7) 


Summary. We cast the problem of finding spherical Clebsch maps, 
which approximate user supplied velocity fields no, as a minimiza- 
tion problem. The objective is the Dirichlet energy of using the 
connection induced by no employing a Berger metric. The met- 
ric anisotropy parameter € controls how much of the regularizer 
||ds||* is added to the fidelity term ||7 — 70||*. For each fixed e, ina 
decreasing sequence, the energy is then minimized. 


3.3. Discussion 


We have reduced the problem of finding an approximate Clebsch map 
y for a given no to finding a minimizer of the connection Dirichlet 
energy of i (Eq. (7)). While we can hope for good approximations 
there are theoretical obstacles to finding exact solutions: (1) Isolated 
zeros in the vorticity field cannot be recovered by a spherical Clebsch 
map (Sec. 4.2); (2) vortex lines for S* valued Clebsch maps are always 
closed (or begin and end on the domain boundary) while generically 
they are open (Sec. 4.3); (3) spherical Clebsch maps have quantized 
helicity, while general velocity fields can have arbitrary helicity 
(Sec. 4.4); and (4) owing to the non-linearity of the energy it is 
possible to get stuck in local minima. 


In Sec. 4 we demonstrate that these theoretical limitations do not 
stand in the way of the practical utility of the approximate maps we 
compute. 


There are also practical issues which are independent of the contin- 
uous setup: (1) finite resolution of the discretized system impacts 
the topology of the field; (2) the connection Laplacian can be very 
ill-conditioned; and (3) boundary conditions need custom tailoring 
when M “cuts” the field. 


In our proof of concept implementation we minimized Eq. (7) through 
its L? gradient flow, w = — grad E€(). This is a rather simple first 
order method whose main virtue is its straightforward implemen- 
tation. To discretize this PDE in time we used the backward Euler 
method to allow for large, stable time steps. This requires the solu- 
tion of C-valued Poisson problems for which we used a standard 
conjugate gradient solver. For production use the latter would need 
an effective preconditioning strategy to address the ever increasing 
ill-conditioning under refinement, common to discretizations of 
second order elliptic operators. Implementation details can be found 
in App. C. Our implementation in Houdini 15 is included with the 
ancillary materials in the ACM Digital Library. 
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Table 2. Performance statistics. Time is given in units of minutes on a 
MacBook Pro, while “steps” gives the number of time discrete steps taken 
by the minimizer. 


Fig. Resolution #(steps) time ea ah 
1 64° 120 9:55 1.8x107! 1.05 
2 128° 100 1:08 1.8x107* 0.0012 
4 50° 130 6:10 1.9x 107 0.23 
5 649 150 20:00 5.4x 107 0.15 
7 64% x 48 30 2:12 1.6x107! 0.3 
8 81x41? 100 6:04 3.0x10~7 0.86 
10 64° 100 8:00 8.4x 107% 121 
12 64x32? 120 1:25 1.3x 107% 0.047 


4 VALIDATION 


Before discussing benchmark problems we briefly describe our 
approach to visualization. 


To visualize Clebsch maps we use pre-images of sets on S* under s. 
For individual points p € S*, s-'({p}) C M gives vortex lines, while 
pre-images of regions 2 c S* yield vortex tubes. 


To construct s~!({p}) we represent s as a complex function ¢: M > 
C through stereographic projection from the antipode —p. The set 
{r € M|s(r) = p} is then given by the zeros of ¢ and extracted as 
in [Weif{mann et al. 2014, Sec. 3]. 


To visualize a vortex tube s-!(Q) C M we take some level set 
function on S*, y: S* > R with y = 0 on OQ. Extracting the zero 
iso-contour of y os then yields the vortex tube surface in M. For the 
images of vortex tubes shown in this paper we used s = (51, s2, 53)? 
directly, drawing sj = 0.8 in gold and s; = —0.8 in blue. The opacity 
of the vortex tube was set proportional to vorticity magnitude, 
making it inversely proportional to the cross-sectional area of the 
vortex tube. Treating p = (+1,0,0)™ as the north resp. south pole 
of S*, the longitude angle arg() — arg(/2) for W = (4, f2)T was 
used as a texture coordinate on the vortex tube surfaces for a brush 
texture, making the brush “lines” tangent to vortex lines. 


4.1 2D Example 


Fig. 2 shows a velocity field and its S*-valued vorticity Clebsch map. 
The disk, rotating rigidly, has constant, non-zero vorticity. This forces 
the Clebsch map to wrap the central disk multiply around a small 
(4 = 0.15) 2-sphere in an area-preserving manner. Simultaneously, 
the map is as conformal as possible, since the Dirichlet energy 
of s is the same as the conformal energy, up to a constant area 
term [Hutchinson 1991]. Outside the central disk, due to vanishing 
vorticity, the map is (nearly) constant, i.e., it “covers” no area. This 
shows that our minimizers are not necessarily smooth. However, 
they appear (empirically) to be Lipschitz and smooth away from 
finitely many points. Fig. 3 shows the convergence plot for Fig. 2 
which is typical of all the examples shown in this paper. 





Fig. 2. A velocity field on a square (left) and its S*-valued vorticity Clebsch 
map (right; visualized via an earth texture). The velocity field has non-zero, 
constant vorticity in the gray disk and zero vorticity outside. 


— |? error 
— |” error 
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Fig. 3. Convergence of the Clebsch map in Fig. 2 as a function of gradient 
descent iterations with At = 1 and e = 1 (0-20), e) = 0.1 (20-40), 
e() = 0.01 (40-60), e%) = 0.001 (60-80), and e) = 0 (>80). 


4.2 Presence of Isolated Zeros 


Graham and Henyey [2000] proved that a vorticity field w with an 
isolated zero at some r € M does not admit an R* valued Clebsch 
representation in the vicinity of r. Their argument also applies to S? 
valued Clebsch maps. In our case the presence of the regularizer 
ensures that an approximate solution is found nevertheless. Fig. 4 
shows a Clebsch map for a linear vorticity field on R°?, and we see 
that our algorithm deals gracefully with the isolated zero at the 
origin. 


Even though no exact Clebsch map (at least not a smooth one) can 
exist for this field at the origin, we still get meaningful approximations 
of the field, as visualized with the vorticity integral curves. 


4.3. Non-Closed Vortex Lines 


In a generic flow most vortex lines are not closed. But Clebsch maps 
with target space R? or S* possess only closed (or beginning and 
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Fig. 4. On the left vortex lines traced in the vorticty vector field w = 
(x, y, -2z)™ ona cube with side length three. On the right the pre-image of 
ten points on S? for our Clebsch map approximating w (fi = 0.05). 


ending on the boundary) vortex lines. Hence flows with vortex lines 
that neither close nor hit the boundary do not possess such a Clebsch 
map [Hadamard 1903, §68]. 


By the Poincaré recurrence theorem [1890], almost every such 
vortex line will return arbitrarily closely to its initial point, making it 
“almost closed.” Our Clebsch maps approximate these almost closed 
vortex lines by closed ones. Note that this “closure” depends on the 
resolution of the data. Fig. 5 gives an example of such a situation 
where most vortex lines of the original field do not close up, while 
the vortex lines of the field corresponding to the Clebsch maps are 
all closed. 


Even though no exact Clebsch map exists for such fields, we still get 
a meaningful approximation with our method. 





Fig. 5. Clebsch map for a vorticity field w = 1.5w, + We, confined to a 
ball, with wy = (xz, yz, 1 — 2(x* + y*) — z”)T and we = (-y, x, 0)7. Both 
W1, W2 have closed vortex lines, but the ratio of their respective periods is 
irrational, ensuring that most vortex lines of w are non-closed. The field 
generated by the Clebsch map, visualized as pre-images of areas on S? (left, 
blue tubes), approximates the field w (left, green trails) in a way that each 
vortex line is closed (right, two such vortex lines, i.e., pre-images of two 
points on S*). 


4.4 Helicity (Linked Vortex Lines) 


Consider a vorticity vector field w on R°, vanishing outside a closed 
ball M. Then there is a (non-unique) vector field u on M such that 
curlu = w. The quantity 


How) =f uw= fare (8) 
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is independent of u (resp. 4) and called the helicity of w (resp. 
w) [Arnold and Khesin 1998, Ch. 3]. 





Fig. 6. Example of two linked curves (linking number —4). 


To understand the geometric meaning of helicity consider two closed 
vortex lines, one through r € M and the other through r € M. Let 
Ik(r, r) denote their linking number [Gauf} 1833, p. 605], an integer 
indicating how often one vortex line winds around the other (Fig. 6). 
For arbitrary r, r € M, whether the vortex lines through r and r close 
up or not, one can still define lk(r, r) even though it is no longer an 
integer [Arnold and Khesin 1998, Chapter III, §1]. As before, lk(r, r) 
measures the amount by which the two curves “spiral around each 
other.” 
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Fig. 7. Clebsch maps for vorticity fields, supported in the interior of a torus, 
with increasing helicity and a fixed # = 0.2. The graph shows the helicity 
recovered by the Clebsch map and the L? error in velocity. Each cross section 
of the torus is passed by 5 vortex tubes for each of the two colors. In this case 
the linking number between any two tubes must be a multiple of 5 [Bush 
et al. 2017], giving rise to the gaps 5(27f)* ~ 7.9 in each jump discontinuity 
in the plot of helicity. Note that vortex lines connecting to the boundary 
allow continuous helicity; such events occur at large given helicity. See the 
video at 00:36. 
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The relevant fact for us is that helicity can be expressed in terms of 
these (generalized) linking numbers 


H(w) = J J Ik(r, r) 


So we see that helicity measures the average linking of vortex lines 
in a field. This reveals the main reason that classical Clebsch maps 
are of limited use: 


THEOREM 4.1. Vanishing Helicity If w can be represented by an 
R*-valued Clebsch map then H(w) = 0 (see App. D for a proof). 


On the other hand, for S*-valued Clebsch maps helicity can be 
non-zero, but is quantized: 


THEOREM 4.2. S*-valued Clebsch Maps If w can be represented by 
a Clebsch map that takes values in S* and w = A s*dAcpo, then there is 
an integer n such that H(w) = n-h* for h := 2h (see App. D for a 


proof). 


Even though for S*-valued Clebsch maps the helicity is quantized to 
integer multiples of a (usually small) number, it does not present 
a practical obstacle to the approximation of fields with arbitrary 
helicity (Fig. 7). 


Summary. In this section we demonstrated that spherical Clebsch 
maps perform well in practice even in the presence of otherwise 
challenging input fields. Specifically, they yield good approximations 
in the presence of isolated zeros (Sec. 4.2), gracefully approximate 
non-closed vortex lines (Sec. 4.3), and work well with fields carrying 
non-zero helicity (Sec. 4.4). 


5 APPLICATIONS 


After considering specific benchmark configurations we now turn 
to brief application vignettes to suggest possible uses for Clebsch 
maps. 


5.1 Vorticity Visualization 


Vorticity fields are of great interest in the study of low Mach number 
flows because the evolution of vorticity reveals much about the 
underlying flow dynamics [Saffman 1992]. One way to visualize 
vorticity is through vortex lines. A challenge in using vortex lines is 
the difficulty of selecting seed points so that the vortex lines have a 
spatial density proportional to vorticity magnitude. For S*-valued 
vorticity Clebsch maps this is achieved by picking points on S? 
equidistributed with respect to area. 


More specialized methods try to identify regions associated with 
vortices (for a comprehensive review see [Jiang et al. 2005]). One 
such method visualizes level sets of the vorticity norm |w| to capture 
regions of high vorticity (Fig. 8, upper left). Since directional infor- 
mation is ignored the resulting surfaces are misaligned in general, 
no longer being tangent to the vorticity field. A more sophisticated 
approach, the so-called A2-method [Jeong and Hussain 1995], anal- 
yses the velocity gradient tensor, but can still yield results which 
do not correctly represent the underlying vortex lines (Fig. 8; cf: 


Fig. 9). In contrast, with a vorticity Clebsch map one can draw the 
pre-image of one or several closed curves on S*, yielding proper 
vortex tubes (Figs. 8 and 1). 








Fig. 8. Vorticity visualized through a |w| iso-surface (top-left), the A2- 
method (top-right), and a Clebsch map (bottom) for the Delta Wing data 
set [Ekaterinaris and Schiff 1990] (40° angle of attack, Mach 0.3, Reynolds 
number 10°). Gray lines in the background are integral curves of the velocity 
field to give an overall sense of the flow. See the video at 01:39. 





Fig. 9. Comparison of vortex tubes from the vorticity Clebsch map with 
traced integral curves of the original vorticity field (seeded at the wing 
edges), verifying the Clebsch map result (cf. Fig. 8; bottom). 


The Parameter h. Since the parameter f shows up as the factor in 
= A s*d Aso, the vortex tubes drawn by the pre-image s~!(Q) 
of a region Q c S* have strength 2 Area(Q). For a given flow 
and a region ©, the parameter fi controls the spatial frequency of 
visualized vortex tubes, increasing them as hi is lowered. To avoid 
aliasing artifacts h must be chosen relative to the sampling rate 
Ar (inverse grid resolution) and velocity magnitude |u|, to satisfy 
th > 2Ar|uloo (fh has physical dimension m? s~‘). 


Time Coherence. We visualize pre-images of fixed sets in S*. However, 
the vorticity Clebsch map s arising from the minimizers of Eq. (7) 
are only unique up to a global rotation of S*. Hence, the locations 
of the selected vortex geometries depend on the initial guess yO 
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in Alg. 1. For a discrete time sequence of flow data sets one can 
ensure time coherence by initializing Alg. 1 with the time advected 
minimizer from the previous time step. For an example see the video 
of the Hummingbird flapping. 


5.2 Initial Data for Incompressible Schrédinger Flow 


With the fluid state represented in a Clebsch map one might 
ask what time evolution should obey for fluid simulation? A 
straightforward approach is to have passively advected fy + 
di/(u) = 0 by its associated pressure projected velocity u. This yields 
dynamics in which all vortex lines are advected by u and therefore 
u satisfies the Euler equation. Unfortunately this type of Eulerian- 
Lagrangian approach leads to chaotic mixing with initially small 
||dy|| and ||ds|| amplified unboundedly. The necessary resampling 
of involves averaging based on the grid resolution and hence 
erases velocity information encoded in the derivative of yy (an even 
more severe version of the numerical diffusion seen in backward 
advection schemes [Fedkiw et al. 2001; Stam 1999]). 


Surprisingly there is a variant for the evolution of ¥ called incom- 
pressible Schrédinger flow (ISF) [Chern et al. 2016] which avoids 
Lagrangian chaos by including a small amount of the Dirichlet en- 
ergy of the Clebsch map s in the energy. The resulting time evolution 
is Hamiltonian, implying that ||dy|| and ||ds|| are both uniformly 
bounded in time under ISF. 


One drawback of the approach in [Chern et al. 2016] was that the 
initial wo data had to be carefully handcrafted for every simula- 
tion. While the authors mention the possibility of using filament 
sets [Weif§mann et al. 2014] the approximation quality of such an 
approach is unclear. Our method directly searches for an optimally 





Fig. 10. Fluid simulation by ISF with initial Yo derived from a general uo. 
Here ug is the divergence-free projection of a rigid-body rotation interior 
to the bunny (and zero velocity else). The corresponding initial vorticity is 
uniform in the bunny driving a rigid rotation, and has a concentrated vortex 
sheet on the surface giving rise to a Kelvin-Helmholtz instability. See the 
video at 01:16. 
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close representation of an arbitrary input field which can then serve 
as initial condition for ISF (Fig. 10). 


5.3 Flow Processing 


A vorticity Clebsch map encodes vortex lines as level sets of s. This 
makes it easy to “rearrange” the vortex lines through manipulat- 
ing (and thus s), enabling a whole new class of flow processing 
approaches. 


An example of this is post-composition  := = o y with a map 
=: S° — S?. Here we only consider maps = with a corresponding 
E: S* — S? satisfying 2 o & = & oa. Under this assumption the 
modification of y corresponds to a modification s ‘= € os of s. 


An interesting example of &: S? — S* “wraps” S? multiple times 
over itself (App. E gives an explicit example of a multiple, branched 
covering). The number of times £ wraps around S? is called the 





Fig. 11. Top: Jet simulation using RK4 backward advection and MacCormack 
time marching. Middle: the same scheme with an additional flow processing 
step (Sec. 5.3) at an earlier time to represent finer vortical structures aligned 
with coarse ones. Bottom: the result of adding isotropic turbulence at an 
earlier step. All simulations on a 256 X 128 x 128 grid while the Clebsch map 
finder used a resolution of 64 x 32 x 32 (due to performance reasons) which 
was interpolated to the high resolution grid. See the video at 00:54. 
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mapping degree 

_ Area(é(S7)) 

~~ Area(S?) 
where Area(.) computes the signed area with multiplicity. The 
mapping degree mz shows up as a renormalization when reading 
off the processed velocity 7 := mig (MW iy’), ensuring the overall 
vorticity flux remains unchanged. 


Algorithm 2 Flow Processing 


[oe 


Input: 70,4, 5 
yw < Find Clebsch map (79, h) 
ycsoy 
i — ane (Ay, ip) 


return PressureProject (7) 


In Fig. 11 we modify a velocity field at a single frame of a jet 
simulation by Alg. 2 with a degree four map &€ = &nobius ° tetra 
(App. E). This particular modification cascades each vortex into four 
finer ones, and then concentrates the vorticity in the fine scale as 
shown in Fig. 12. Effectively this adds small eddies aligned with 
the large scale vortex direction in a vorticity conserving fashion 
(Fig. 11, middle) as predicted by the Kelvin-Helmholtz instability that 
is otherwise not captured at the current resolution of the simulation 
(top) or by adding isotropic turbulence [Bridson et al. 2007] (bottom). 





Fig. 12. A Clebsch map s (left) obtained from the velocity data of a jet 
simulation is modified (right) by § = € o s with a sphere map é: S* > S? 
that effectively concentrates vorticity. It is used for Fig. 11 middle. 


6 OUTLOOK 


The appearance of spherical Clebsch maps in the recently introduced 
method for incompressible fluid simulation using wave functions 
y [Chern et al. 2016] motivated us to study Clebsch maps more 
closely and seek for a method to find such maps for a given flow 


field. 


While there are theoretical limitations to such spherical Clebsch 
maps these do not stand in the way of producing useful approxima- 
tions (Sec. 4). Because these Clebsch maps encode the velocity resp. 
vorticity field they contain all information about the field in a novel 
form. This can be useful for visualization and flow processing, not 
just simulation (Sec. 5). 


There are a number of theoretical questions which deserve further 
study. Can we develop methods which are guaranteed to find a 
global minimum of Eq. (7)? While we can’t expect to find exact 
Clebsch maps for generic flow fields (Sec. 4.3) it would be highly 


desirable to derive approximation bounds. How well can we do? The 
role of fi also deserves further scrutiny. How does the approximation 
quality depend on this parameter? For applications which require 
only a vorticity Clebsch map, is there a simpler algorithm? 


At present a signficant obstacle to practical deployment is the com- 
putational cost of minimizing Eq. (7). So far we have used only a 
semi-implicit gradient descent approach. The biggest challenge is 
the efficient solution of (screened) Poisson problems involving the 
connection Laplacian. In our experience these systems can be very 
ill conditioned (and more so as the grid is refined). An effective 
preconditioning strategy, which we wish to develop in future work, 
appears essential to make the method practical. 


We are most excited about developing the idea of flow processing 
further. Manipulating the flow field through manipulation of its 
Clebsch representation as we suggested in Alg. 2 and App. E, ensures 
important properties such as conservation of the amount of vorticity 
in a given simulation cell. It also ensures alignment of the finer level 
structures with the coarser vorticity [Pfaff et al. 2010], something 
often lost in purely local manipulations. 
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A VELOCITY FROM VORTICITY CLEBSCH MAPS 


Here we prove a simplified version of Thm. 3.1, i.e, we show that a 
spherical Clebsch map s for w can be lifted to a Clebsch map y for a 
velocity field 7 with dy = w, under the additional assumption that 
M is star-shaped with respect to the origin. Note that the theorem 
remains valid for multiply connected domains, but the proof becomes 
more elaborate [Chern 2017]. 


Choose any yo € S? with Woivo = s(0). Define s,: [0,1] > S* by 
s-(t) = s(tr) for r € M. Similarly, for r € M define y;: [0,1] > S? 
as the solution of the initial value problem 


(0) = Yo. 


oi = -ty, Ss, and 
Then 
£(Writy) = Gs, and — (rife) (0) = 5; (0) 


and therefore Writ = sr. Now we can define ¢: M — S? by g(r) := 
(1). We have already achieved $id = s. Defining 4 := h(dd, if) we 
get dy = dy. Since M is simply connected we therefore can find a 
function 0: M — R (unique up to an additive constant) such that 
dO = n —f. Then w := e!"¢ is the Clebsch map for n that we were 
looking for. 


B ORTHOGONAL SPLITTING OF V7°y~ 


Since S° can be represented through unit quaternions we will avail 
ourselves of quaternionic algebra in this section to derive the or- 
thogonal splitting of V7 y. 


Using the definition V7” = d —i% ;~ and Eq. (8) of [Chern et al. 2016] 
we see that 

—iV y = £( — No) — 5s 
and thus 

Vy = Ly no)iy — Sipids, (9) 
where we used Wy = |p|? = 1. Since the first summand is a real 
multiple of iy it takes values in Cy = Span{y, iv}. That the second 
summand takes values in (Cy/)+ = Cj follows from 


(, ids) = Re(Wipds) = Re(sds) = —5d|s|* = 
(iw, ids) = Re(ds) = 0. 
Here we used (a, b) = Re(ab). 


To summarize, Eq. (9) is the orthogonal splitting of V7’ onto the 
subspaces Cy and Cj. 


C IMPLEMENTATION DETAILS FOR E* 


In this section we give all the necessary details for the numerical 
implementation of Alg. 1. Above we used unit quaternions to 
realize S°. For the derivations in this section it turns out to be 
convenient to realize quaternions y € H as elements (1, W2)™ € C* 
through the relation = yy + Yj. For numerical libraries which 
support a complex data type the final expressions translate directly. 
If only real data types are supported, pairs of complex numbers 
become elements of R* and a complex entry x + iy = z € C in the 
matrix is realized as a real 2 x 2 matrix @ Bs ). 
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Discretization. For the energy, Eq. (7), we use a standard discrete 
exterior calculus (DEC) [Crane et al. 2013; Desbrun et al. 2008] 
discretization. It requires a graph G = (V,&) where ‘V is the set of 
vertices and & the set of directed edges. All examples in this paper 
take G as the 1-skeleton of a 2D or 3D lattice. Each vertex i € V 
has a vertex weight w; > 0 and each edge ij € & an edge weight 
wij > 0. Functions (0-forms) are discretized as values per vertex, 
e.g., Wj = W(r;) while 1-forms & are discretized as values per directed 


edge gj = ij E. 


The vertex weights and edge weights are designed to give the 
approximation 


[uh = > wl, jut Swill? 
ieV 


ied 
On a regular 3D lattice wj = V; and wij = a where V; is the volume 


of the dual cell surrounding rj, Ajj the area of the face dual to i j, and 
fj the length of the edge rj — rj. 


Covariant Derivative. Given a C*-valued function y = (1, 2)T on 
vertices and a R-valued 1-form no on edges, the C2-valued 1-form 
V7 is given [WeiSmann et al. 2014, Eq. 10] by the difference 


(no)ij , oi 
(V%y)j =e 7m Y—e Mm yy. 


Projection Operators. Using (¥1, W2)™ as our representation the pro- 
jectors Pey resp. Pcjy_ can be expressed in terms of Pauli matrices 
oj; € C2x2 


_ {0 1 _ {0 -i _ {1 0 
ae lap 0}? a aaa 0 |? a3 = Ve 4" 


Even though the projectors appear to be functions of y, they are in 
fact only functions of s = Wi = syi + soj + s3k where 


= Wil’ —|W2l’, so = 2Re(i~iy2), 53 = 2Re(Wiya). 
The coefficients are given by the Pauli spin vector z = (21, Z2, z3)™ € 
R°? with 


as 
S1 = 23, S2=—£2, $3 = 41. 


To find the C*** representation of Poy, we note that the underlying 
subspace is spanned by complex multiples of (¥, w2)™ and hence 


= — A p 
Poy, = ae LW ye ~ ey ee 


5 (I + Z101 + Z202 + 2303), 


where I is the C?** identity matrix. Due to their orthogonality, Pe ith 
is just the residual 


Poy, =[- Poy, = 5 (I = £10] — 2907 = Z303). 
In the implementation it is convenient to combine both projectors 
P* := Poy + VePojy 
and rewrite the integrand of E* as 


IV vle = |PE(VPy)I’. 


Inside Fluids: Clebsch Maps for Visualization and Processing « 


Since Vy is a 1-form on edges, we need Pr for sij = fate] the 
edge midpoint value of s. 


The Discrete Energy. With these definitions we arrive at the discrete 
energy in terms of the vector w = (V)icy 


ES(p) = >) wi [PS ((v™w);:)| = pT Ly. 
Se ij 
ijeé 
Where the quadratic form L is a |V| x |V| sparse matrix of 2 x 2 
complex blocks. Each edge ij € & gives rises to four blocks which 
are accumulated into the global L, summing over all edges 


_,4 ih 
Zz 
ii) 





Lij = = wi(P5)° Lij = Wife 


poy 





Lii = —wije iy Lij = 





wii(Pa)”. 


Minimization. To minimize the discrete energy for a given €, we 
use a semi-implicit gradient descent step followed by pointwise 
normalization of w. Each gradient step is the solution yk +l) to 


(May + AtL™) pt) = Myy™), 


with Mv the vertex mass matrix and step size At > 0. We typically 
take 20 steps with 0.1 < At < 1 before decreasing e€ (Alg. 1). 


D HELICITY 


The helicity of a divergence-free vector field was first introduced 
in [Woltjer 1958] and received its name in [Moffatt 1969]. The result 
that helicity vanishes for fields coming from a classical Clebsch 
map is due to [Bretherton 1970]. Here is a proof in the language of 
differential forms. 


As in section Sec. 4.4 we consider an exact 2-form w = dy on R° 
which vanishes outside of a large closed ball M. In terms of 7 and w 
the helicity is given by (cf. Eq. (8)) 


Hlo)= [aro 


Suppose w comes from a classical Clebsch map, ie., 7 = Adu — dé, 
w@ = ad A du then 


Hw) = | ap A di.n du = f -a(p dh.» dy) = f —pw = 0, 
M M OM 
proving Thm. 4.1. 


The result that helicity is quantized for fields coming from S*-valued 
Clebsch maps is due to [Kuznetsov and Mikhailov 1980]. They 
assume fields that tend to zero at infinity, instead we want a version 
that also works in a bounded region. Additionally they assume that 
the Clebsch map itself has a limit at infinity. We now turn to the 
proof of Thm. 4.2 which does not need such assumptions. 


Since the pullback w of the area form of S* vanishes on 0M, the 
derivative of the map 


$= slay: OM S? 
does not have full rank anywhere. Sard’s Lemma [Milnor 1965, §2] 


then asserts that there is a point p € S* that is not in the image 
of §. Therefore we can smoothly homotope § to a constant map 


142:11 


from OM to S*. Thus s: M — S? can be extended to a continuous 
map §: R® U {co} > S? and the rest of the argument is the same as 
in [Kuznetsov and Mikhailov 1980]. 


E RATIONAL MAPS FOR FLOW PROCESSING 


Any map &: S* — S* can be written as a function on the extended 
complex plane by acta as S* = CU {co} through stereographic 
projections rR ¢ = —2% + i and its inverse s = (1+ Io vee 


| i= =< 
Ic, 2ReC, ~2Re(iz)). Similarly, each map =: S* — S? is a map 


B:C* > C’, BL, 02) = (Q1%1, 62), Qa(e1, f2)) where 1,02 € C. 
Thus designing a sphere map for Sec. 5.3 amounts to finding functions 
& on the extended complex plane and E on C’ so that 70 E = fog. 








For our purpose good candidates for & are the rational functions 
E(C) = Py(©)(P2(f))! where P1, Pz are polynomials without com- 
mon divisor. Rational functions give all the conformal maps S* —> S?. 
The mapping degree mz is given by max{deg(P}), deg(P2)}. For each 
rational function é a function = = (Q1, Q2) with 7o& = oz is easily 
found with both Q;, Q2: C* — C homogeneous polynomials: &(¢) = 
Ox(Z, 1)(Qo(Z, 1))-1. Simple examples are scaling, Emopius(2) = a, 
corresponding to Emobius(21,02) = (ali, 2), 
and squaring ésq(¢) = ¢* with Bsq(Q1, 62) = 
(Cf . C. pe Belyi gave a gallery of functions that 
have particular discrete symmetries. For in- 
stance, 





Ftetra(f1, 62) = ee * be] 
etra > 2V2e4 — 80,03 
gives tetra that maps the four vertices of a tetrahedron on S* to the 


same point [Magot and Zvonkin 2000]. The inset figure shows the 
result of applying étetrg to a world map on Sf 


To randomize the choice of the scrambling map with rotational 
symmetry, given a desired mapping degree m, one can choose 


= = (Q1, Q2) with 


Qj(%1; ¢2) = 3 an ae mk 


where Zj, € C are ie ade normally distributed random 
numbers for j = 1,2 andk = 0...m [Woit 2017, Sec. 8.2]. 
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ABSTRACT 


The main objective of this research was to investigate the lift and drag characteristics of a 
stepped airfoil with backward facing steps; apply active flow control technique to enhance the 
aerodynamic performance of stepped airfoils and examine the possibility of using such airfoils on 
Unmanned Aerial Vehicles (UAV’s). A step was introduced at mid-chord, with a depth of 50% of 
the airfoil thickness at mid-chord position extending till the trailing edge of a NACA 4415 airfoil. 
Computational studies were conducted with the use of passive flow control constituting the 
activation of step and active flow control with the use of air injecting jets placed in the step cavity 
of the NACA 4415 airfoil with a goal of enhancing the aerodynamic performance. The jet angle 
and jet momentum coefficient were varied independently to identify the best setting for 
optimizing the aerodynamic performance of the stepped airfoil. Experimental studies of a scaled 
wing model with the same airfoil were conducted in a wind tunnel for a range of Reynolds 
numbers to validate some of the numerical results obtained for the cases of base and stepped 
airfoils. The results produced show that as much as 37% increase in C; and as much as 12 % 
increase in L/D ratios over conventional airfoil values could be obtained using stepped airfoils 
and further enhancement could be made with the employment of jets placed in the step cavities. 

The case study conducted as a part of this research focuses on the UAV RQ-2 Pioneer 
employing a stepped airfoil configuration by comparing its aerodynamic characteristics with the 
conventional NACA 4415 airfoil originally used on this aircraft. The primary objective of the 
case study was to identify and outline a step schedule for the flight envelope of the UAV Pioneer 
using a stepped airfoil configuration while applying active flow control to obtain enhanced 
aerodynamic performance over conventional NACA 4415 airfoil originally used and hence 


improve the flight performance characteristics like Range and Endurance of the aircraft. 
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distance from airfoil LE to lower surface step LE 
dimensionless upper surface step location, x,/c 
dimensionless lower surface step location, xj/c 
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Cs 
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1. INTRODUCTION 


Since the advent of successful aviation in commercial, defense and experimental research 
sectors, people have been constantly researching on improving the quality of flow over the 
aircraft so as to enhance the overall performance in terms of lift, drag and stability characteristics. 
Flow control has been and still is one of the most promising areas in aerodynamic research and 
will continue to be a prospective research topic as there will always be an upper limit to 
efficiency. Flow over submerged bodies such as an aircraft*” and or a submarine can be worked 
upon to delay boundary layer transition, postpone separation, increase lift, reduce skin-friction 
and pressure drag, augment turbulence, enhance heat transfer, or suppress noise. Preventing 
separation is highly critical as it is desirable for lift enhancement, stall delay, improved pressure 
recovery, and reduction in the form drag. Future applications in the field of aeronautics include 
providing structurally efficient alternatives to flaps or slats; cruise application on conventional 
takeoff and landing aircraft including boundary layer control on thick span-loader wings; as well 
as increased leading edge thrust, and enhanced fuselage and upper surface lift with most of the 
new developments to be made pertain to the employment of various flow control techniques. 
Thus far, extensive research was done in passive flow control that involves control of flow sans 
use of external energy. Active flow which involves energy expenditure has been a hot topic of 
research in the recent decades and promises to be good for the future developments to come in the 
area of flow control. This research focuses on using both these techniques to enhance the 
aerodynamic performance of airfoils. 

A promising passive flow control technique which is primarily dealt with in this study 
might be to modify the geometry of a conventional airfoil by introducing backward facing steps, 
which forms a relatively new family of airfoil designs popularly known as “KF (Kline-Fogleman) 


airfoils”. Introduction of a step could be greatly effective in changing the overall flow quality in 


the step cavity, which may not be quite obvious to intuition. The flow transformation could be 
understood as due to rotation of the flow promoted by the introduction of the step along a length 
of an airfoil edge which otherwise had been continuous. Stepped airfoils use the concept of 
trapped vortex cavities and consequently there is the trapped vortex flow control which deals with 
the various flow alteration techniques to enhance the flow field characteristics. The flow which 
otherwise had been separated is forced to reattach by an intense system of vortices trapped in the 
cavity. Figure 1-1 compares the flow over a conventional with that over a modified airfoil with a 
step. Flow rotation induced near the step face generates a primary vortex as shown in Figure 1-2. 
A secondary vortex forms in the opposite direction after the flow reattaches itself to the boundary. 
Thus an airfoil with a step traps vortices in the cavity which primarily are the noticeable as well 
as distinguishing flow features as compared with the flow over conventional airfoils. The 
formation of these vortices alter the flow field characteristics thereby altering the lift, drag or 
pitching moment characteristics depending on whether the step is introduced on the upper or 
lower edge of the airfoil. The advantage of using the passive flow control technique of trapped 
vortex cavities is that the existence of the cavity on an airfoil / wing surface naturally aids the 
formation of vortices inducing reattachment of flow thus requiring no additional energy 
expenditure. An intelligently engineered design of the cavity could very well help get the best out 
of the flow field thereby eliminating the need for use of energy to achieve the desired flow 
quality. 

In addition to passive flow control techniques, active flow control could possibly be used 
for obtaining positive results. Active flow control involves usage of energy to alter the flow field 
characteristics, meaning the use of either blowing or suction to control the flow near the boundary 
as to prevent flow separation. Flow control through blowing could be accomplished using air 
injecting jets or plasma actuators mainly to generate a directed stream of fluid to accelerate the 


slow moving air near the solid boundary thus delaying separation. Also, suction created using a 





Figure 1-1: Comparison of flow over conventional airfoil and stepped airfoil. 
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Figure 1-2: Typical flow field around an airfoil with steps on upper and lower edges. 


pump aimed at sucking the slow and retarded fluid near the wall thus helping in forming a fresh 


boundary layer which could be used for avoiding or delaying flow separation. Blowing / Injection 


flow control technique was used as a part of this investigation. 


11. LITERATURE REVIEW 


Richard L.Kline and Floyd F.Fogleman introduced a breakthrough concept in airfoil 
design with their stepped airfoils developed in early 1960s. This heralded a new chapter in the 
aerodynamics with the break-through design of high performance airfoils with extended stalling 
capabilities and improved lift and drag characteristics. The Ultimate Paper Airplane’ by Richard 
L.Kline & F.Fogleman (1985, Simon & Schuster) describes that the object of this design was to 
develop an improved airfoil with enhanced lift, drag and stability characteristics and adaptability 
over a wide range of speeds, achieved by the generation of vortical flow that alters the flow field 
resulting in favorable effects. Several other articles published later had also stated that Kline- 
Fogleman (KF) airfoils were capable of combining the best features of conventional airfoils 1.e., 
better lift with thick ones and higher speeds with thinner ones and further that they worked 
extremely well for achieving higher lift as well as forward speed. These statements are supported 
by the world record that Dick Kline still holds for the farthest flying paper plane equipped with 
stepped airfoils. In addition to the above mentioned claims, in an email that Kline sent to an RC 
modeling group on issues concerning KF airfoils, he highlighted the advantages of KF airfoils. 
Most noteworthy ones being the capability of KF airfoils to handle a wide range of speeds; the 
much greater range for its center of gravity which its case could be moved as much as 40% chord 
location from the leading edge thus allowing it to carry a heavier load; better air penetration based 
on the flight experiences of model planes built with KF airfoils; high strength to weight ratio; 
great stability and control. 

Some of the earliest citations of airfoils with vortex trapping cavities were made in a 
paper by Ringleb F.O.° (1961). W.A.Kasper’ claimed the first successful use of a trapped vortex 
in a flight experiment in the seventies using a concept so-called as Kasper wing. Experimental 
studies undertaken by Kruppa’' (1977) verified that the Kasper-wing produced vortex shedding as 


against a steady trapped vortex thus resulting in lift enhancement. Some promising results were 


obtained by researchers of Saab-Scania (1974) using a wing with a vortex cavity as reported by 
Kruppa in his paper. 

Through the entire period of this research effort at Missouri University of Science and 
Technology, the biggest question in the minds of researchers was, “Why have the KF airfoils not 
been used for commercial applications till date even several decades after they were first 
developed by Kline and Fogleman?” The answers might be found through a set of developments 
that have taken place over a period of time. According to three independent scientific studies 
conducted in 1979 at the NASA’s Langley Research Center in Virginia, it was concluded that the 
L/D ratios of wing with KF airfoils weren’t encouraging, thus further studies on the wing’s 
resistance to stalling became a low priority. Around the same period of time, Max Davis, of the 
Air Force Flight Dynamics Lab at Wright-Patterson Air Force Base in Dayton, OH, through his 
preliminary study stated the Kline-Fogleman wing was not suitable for a full-size aircraft because 
it had too much drag and not enough lift. In addition to these contradictory theories, the fact that 
KF airfoils are limited to use in radio-controlled model planes might be one of the reasons why 
KF airfoils have not been quite a thriving topic of research or their application was not extended 
to the commercial/defense aviation sectors of aeronautics. One of the objectives of this study to 
reinforce by the way of sound technical results obtained through both numerical and experimental 
research and to encompass the application of KF airfoils to a UAV by conducting a case study. 

Shifting to the review of studies conducted on stepped airfoils, it was in 1994 that the 
benefits of KF airfoils were scientifically proven through experiments and flight testing by 
Demeter G.Fertis” when he compared the results obtained for the stepped airfoil with those for 
conventional NACA 23012 airfoil and confirmed that the airfoils developed by Kline and 
Fogleman were potential designs to obtain better lift characteristics over a broad range of angles 
of attack, improve or eliminate stall at all possible operational airspeeds, increase lift to drag 


ratios over a wider range of operational angles of attack and be adaptable for both fixed and 


rotary wing aircraft. This set the direction for a new domain in flow control research, which till 
then involved focusing on use of flow control techniques on conventional airfoils. 

Aerodynamic studies on stepped airfoils were conducted by Stephen Witherspoon’ and 
Fathi Finaish (1996) for different configurations defined by the step lengths, depths, and the 
location of steps on airfoil chord. Experimental tests and numerical investigation with steps on 
NACA 0012 and 23012 airfoils showed that higher lift coefficients were obtained with lower 
surface step located at half-chord, extending till the trailing edge at all angles of attack ranging 
from 0 to 10 deg. Further, upper surface steps located at half-chord and extending till 62.5% 
chord generated higher L/D ratios when compared with unmodified NACA 0012 airfoil at 
incidence(s) around 10deg. 

BAE SYSTEMS“, through their “AEROMEMS” research program (2000) concluded 
that MEMS based fully developed flow control system could result in a timeframe of 10-15 years. 
Research and Development work was intensified with the launch of “AEROMEMS II” program 
that focused on extensive wind-tunnel and flight testing of MEMS applications for flow control 
on a commercial scale. Quality research was also conducted with the initiatives taken at the Air 
Force Research Laboratory’s Air Vehicle Directorate (AFRL/VA), NASA, DARPA apart from 
the support rendered by technical organizations such as AIAA through The Fluid Dynamics 
Technical Committee (FDTC) and ASME through The Fluid Dynamics Technical Committee 
(FDTC). 

Research at UCLA/Caltech by Ho™ and Tai, 2001 involved flight testing of a smart skin 
attached to an UAV, integrated with several shear stress sensors and balloon actuators distributed 
over the skin. It was aimed at controlling the pitching, rolling and yawing moments by controlling 
the position of leading-edge vortices, achieved by micro-actuators coupled with a delta wing 
boundary layer. T. Crittenden'’, A. Glezer, E. Birdsell and M. Allen used MEMS-based sensing 


devices and pulse jets integrated into the flow boundary to achieve aerodynamic control. 


W.W.H. Yeung’ (2006) conducted flow visualization studies of corrugated airfoils using 
steps to compare them with conventional Joukowsky airfoil incorporating a backward facing step 
based on conformal mapping calculations. The results were in favor of the corrugated airfoils 
confirming that stepped airfoils produced better lift characteristics due to the formation of 
vortices. Triple corrugated airfoils produced as much as 10 % more than their conventional 
counterparts for the same camber, thickness and angle of attack. 

Fabrizio De Gregorio and Giuseppe Fraioli'’ (2008) conducted an experimental study of 
flow control using a trapped vortex cavity on a high thickness airfoil with an objective to apply 
the results obtained to blended wing designs. Data from PIV measurements showed that passive 
Trapped Vortex Control (TVC) flow control is neither an effective separation control mechanism 
nor capable of confining the vortex, and controlling the vortex shedding. On the other hand, 
active flow control is capable of controlling flow separation. 

Masoud Boroomand and Shirzad Hosseinverdi’’ (2009) numerically investigated the 
turbulent flow around a NACA 2412 airfoil with backward facing steps at high Reynolds number 
with the objective of enhancing the aerodynamic performance by trapped vortex lift 
augmentation. All the results obtained through their study were in total agreement with those 
obtained by Stephen Witherspoon® and Fathi Finaish (1996) through their study of NACA 0012 
airfoil with a step. Conclusions from their study conducted in 2009 include increase in drag for all 
stepped airfoil configurations; increase in lift coefficients and lift to drag ratios at some angles of 
attack for upper step configurations; positive effect shown by lower step configurations on 


delaying the stalling angle. 


1.2. SCOPE OF THE CURRENT STUDY 


Vortical flow dominant with swirling vortices resulting from the modification of airfoil 


geometry forms the basis for the desired flow alteration achievable, which is the prime advantage 


of using stepped airfoils while studying them with a goal of enhancement of aerodynamic 
performance. The resulting flow field around a stepped airfoil has improved lift and/ or reduced 
drag and produced better stall characteristics, depending on the airfoil configuration and whether 
or not other flow control techniques are used in conjunction. The study here was aimed at 
identifying the best setting of steps in the airfoil of choice and making the effective use of 
vortices resulting from stepped configurations at various flight conditions so as to enhance the 
aerodynamic performance of airfoil all of which to be achieved using both active and passive 
flow control techniques discussed in the Introduction Section of this thesis. Numerical studies 
involved both active and passive flow control techniques while the experimental studies involved 
the use of only the passive flow control technique. In other words, numerical results were 
obtained for flow over modified airfoil configurations at different flight conditions to find out 
whether the usage of step, 1.e. whether passive flow control using step produced better results 
than the conventional NACA airfoil and if it did, the next goal inline was to identify the best 
configuration while exploring the possibility of use of active flow control on the stepped airfoil 
with the use of jet placed in the step cavity, 1.e. the application of passive and active flow control 
using a jet placed in the step cavity. Computational Studies were conducted by placing a jet at 
two different locations to identify the setting that produces the best results with regard to the 
aerodynamic characteristics of the stepped airfoil configuration studied. Wind tunnel testing was 
conducted to obtain experimental data for the purpose of validating the results obtained from the 


numerical studies. 


1.3. FORMULATION OF THE FLOW PROBLEM 
The flow problem was set up as an airfoil fixed at a certain angle of attack with air 
flowing over it at a given free stream velocity as illustrated in Figure 1-2. The objective was to 


enhance the aerodynamic performance of the airfoil using steps and identify the most favorable 


step configuration for a given flight condition. The configuration of a step is defined by its 
location on the airfoil chord, its length and depth expressed in terms of airfoil chord length. The 
employment of step alone to control the flow field around an airfoil comprises the passive flow 
control technique used in this study. Figure 1-2 shows the typical flow developments on a 
modified airfoil with backward facing steps on both the upper and lower surfaces. A jet placed at 
either of the two locations as mentioned before was used in conjunction with step on the modified 


airfoil. 


14. PARAMETER RANGE 


There are several parameters which can affect the flow field around an airfoil or a wing in 
flight. Both the airfoil and step profiles constitute the geometric parameters. The most important 
ones are the shape of airfoil (symmetric or cambered); chord length; angle of attack; chord-wise 
location of the step; its length and depth on the upper and /or lower surface. Figure 1-3 shows a 
sketch of a sample stepped airfoil configuration defining the main geometric parameters. The 
flow parameters are the free stream velocity, density and viscosity grouped together as the 
Reynolds number (Re). Re ranges from 0.6 million to 2.5 million for this flow problem while a 
was varied from 0 to 10 degrees. Based on the results obtained by Stephen Witherspoon and Fathi 


Finaish (1996), a limited range of parameters were chosen. The airfoil configuration of prime 


Freestream c 
——— “sae. 


=i 
SS 


Figure 1-3: Characterization of geometric parameters associated with stepped airfoil studied. 
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interest was one with the step located on the upper edge at X; =0.5 with L; =0.5 and D, =0.5. 
NACA 0012 airfoil tested with this step configuration produced higher lift coefficients than the 
unmodified airfoil for a range of angles of attack varying from 0 to 10 degrees. Thus cases 
studied were limited to those considering NACA 4415 airfoil as the base airfoil and those with 
the modified airfoil with a lower surface step located at mid-chord position. The table in the next 
few sections lists all the major parameters corresponding to each simulation for a set of cases for 
preliminary study. A jet was used to possibly improve the flow field characteristics and achieve 
an enhancement in the aerodynamic performance. The jet was characterized by the jet momentum 
coefficient, Cu and the jet angle, ¢. The Cu is the ratio of the product of mass flow rate of the 


flow through the jet and the jet velocity to the free-stream dynamic pressure as defined below: 


Cu = (mVj)/ qs (1) 


The Cu was varied from 0.00027 to 0.01731 while the jet angle was varied in increments of 15 


degrees from 0° to 45°. 
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2. METHODS OF INVESTIGATION 


Studies involving investigation of stepped airfoils with/without using jets were carried 
out using both computational and experimental approaches to examine the associated flow field 
developments and the resulting aerodynamic forces. The numerical studies were conducted using 
a commercially available computational fluid dynamics package to simulate flow around the 
airfoils. The latter part of the research was conducted by employing a wind tunnel for running 
tests on the airfoils studied numerically to generate some quantitative and qualitative data for 
validating the numerical results. This approach required building a test wing model using a 
conventional NACA airfoil and redesigning it to obtain the modified airfoil configurations to be 
tested. Force data including lift and drag were recorded during the tests. Experimental studies 
were conducted to complement the results obtained by the computational studies. Besides this, 
they were sought to provide information for a better understanding of the complex flow physics 


involved with the stepped airfoils. 


2.1. COMPUTATIONAL APPROACH 

2.1.1. Governing Equations. The three fundamental principles governing the fluid 
flow characteristics are the conservation of mass, conservation of momentum, and conservation 
of energy. As there is no heat addition or rejection involved in this investigation, the conservation 
of energy principle is not discussed in this section. The conservation of mass principle applied to 


a fluid element in a fixed control surface is given by the following equation 
4-(pV) =0 
at (2.1) 


where V is the gradient operator defined as 
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v= {ictictic} 2.2) 


The momentum equation is obtained by applying the Newton’s second law of motion to a fluid 


element passing through a fixed control surface. It is expressed as below. 


(pV) 
at 


The first term in the above equation defines the rate of change of momentum in the control 
surface; the second term defines the rate of loss of momentum by convection through the control 


surface. The first term on the right-hand side of (2.3) represents the body force while the second 


term represents the surface forces resulting from the normal and shear stresses defined as 


Ijj = —Pdij + Tij 


(2.4) 
ty = w| (Su + Se) — Foy Se i,j,k =1,2,3 (2.5) 
where 6;; is the Kronecker Delta function (6;, = 1ifi=J);( 6, =OUft #7). 
u,; and xX; represent the three components of the velocity vector V and the position vector 
respectively. Applying (2.4) and (2.5) to (2.3), it can be expressed as below. 
p— = pf — VP +2 |u (S422) — 2 5yu oe (2.6) 


The Navier—Stokes equations discussed in the momentum equation above are complex partial 
differential equations which by themselves are difficult to solve without making any sort of 
approximations. Thus in modeling a turbulent flow, time-averaged flow variables are used in the 
form of the Reynolds Averaged Navier-Stokes (RANS) equations to obtain approximate 
solutions to flow problems. To obtain the RANS equations the flow variables in the conservation 


equations are split into the mean (constant) and fluctuating (time-varying) component and the 


entire equation is averaged with respect to time. In the turbulence-modeling of a fluid flow, the 
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flow variables of velocity, pressure, and temperature can be expressed as 


P=P+P'’ T=T+T' (2.7) 
where u and v represent the x and y components of the velocity vector respectively. The 
continuity equation can now be modified as 

Op — 
a, +1 V pV = 90 (2.8) 


and the modified momentum equation can be expressed as 


p— = —-VP+— ale (FE a) — puiu purus (2.9) 


Ox; 


An additional transport equation is required to solve the RANS equations discussed above. The 


transport equation used in this investigation is the Spalart-Allmaras model defined below. 





2 
O(pv ) i 
oO ME = G+ Aix (u+ pv) or} Coop (= “| -14 Sp (2.10) 
This one-equation model solves for the kinematic eddy or turbulent viscosity. G,, represents the 
production of turbulent viscosity; Y, represents the destruction of turbulent viscosity that occurs 
due to wall blocking and viscous damping; v is the kinematic viscosity; V is a variable identical to 


the turbulent kinematic viscosity accept in the near-wall region which is affected by viscosity. 


S>5 is the user-defined source term while oj and Cj, are constants. 


2.1.2. Tools Used. All of the computational work was done using commercial 
software packages. All the Pre-processing comprising mesh generation, quality control and 
setting up of the boundary conditions was done using GAMBIT. “GAMBIT 2.4” was used for 


generating meshes for all the airfoil configurations. The meshes were then imported into ANSYS 
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12 - “FLUENT”, a finite volume method based CFD tool which solves the governing equations of 
conservation of mass and momentum was the flow solver used for the entire numerical analysis. 


Post processing including generation of plots and figures was done using FLUENT and Tecplot. 


2.1.3. Computational Details. The problem was set up as a two dimensional flow 
problem in single phase. The Cartesian co-ordinate axes were fixed at the leading edge of the 
airfoil in each case. The computational domain comprises of four blocks for all base airfoil 
configurations, while the single step configurations had eight mesh blocks. The blocks 
encompassed a region of length three chords above and below the X-axis which coincides with 
the chord line of the airfoil, 2.5 chords ahead of the leading edge and 4 chords behind the trailing 
edge as shown in Figure 2-1. For the most basic mesh or the coarsest mesh with the base airfoil, 
the grid density within the blocks was 45 cells above and below the X-axis normal to the airfoil 
surface distributed in a successive ratio of 1.15, concentrated near the airfoil surface. In the 
chord-wise direction, the flow domain was meshed with 60 nodes concentrated near the airfoil 
trailing edge. The mesh becomes coarser towards the end of the flow domain to take advantage of 
the property of fully developed flow farther away from the airfoil. The upper and lower edges of 
the airfoil are split to distribute the mesh points resulting in a mesh of better quality. Both the 
edges of the airfoil meshed with 76 nodes. Figure 2-2 shows the grid distribution around a NACA 
4415 airfoil with a step. 

The flow was modeled as steady, incompressible in the flow domain comprising several 
blocks. The fluid chosen was air at standard atmospheric conditions which were set as the 
reference values along with the free-stream velocity set corresponding to the flow Re for each 
simulation run for the preliminary cases of study. The inlet was set as “Velocity Inlet” which 
includes all the front, top and bottom edges of the flow domain. The x and y-velocities are “u’’, 


which is defined by the Reynolds number used and “v”’ that equals 0 m/s. “Spalart-Allmaras” one 
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Figure 2-1: Block structure of the flow domain over an airfoil with a step on the lower surface. 


equation turbulence model was used in FLUENT for all computations. This computational model 
is well suited for modeling wall driven flows which transpire most commonly over aerodynamic 
bodies. 


All the exit boundaries combined were set as “Pressure Outlet”. All the airfoil edges 





together were set as “Wall” with no-slip condition. The convergence criteria was set at le-06 for 
residuals, continuity, x and y velocities uniformly. The flow solver used the SIMPLE algorithm 
for pressure-velocity coupling. Least Squares Cell Based method was used for gradient 
computations. Pressure was set as second order, Momentum, and Modified Turbulent Viscosity 


were set as Second Order Upwind for Spatial Discretization computations. 
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Convergence Study. Since the grid densities normal to parallel to the 
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flow direction and fineness of the mesh near the airfoil surface can affect the accuracy of the 


did not deviate much as the 


solution, the effect of these parameters was studied so that results 


mesh size was increased. The mesh size in terms of number of nodes was increased in “‘n”’ 


multiples and the simulations were run for each case till the converged results were close to those 


obtained with previous mesh size. This way study was done till the grid convergence was reached 


and the best mesh configuration was identified beyond which the results do not vary with the grid 


density of the mesh points. Also during the study the results obtained in each case were compared 


with the experimental data available to identify the mesh generating the most accurate results 


bearing in mind the grid quality in each case. Figure 2-2 illustrates the mesh around a NACA 
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4415 airfoil with a step. As shown by the figure, the grid is much finer near the edge of the airfoil 
and grid points are concentrated in the step cavity to obtain more accurate results as well as better 
resolution of the flow field near the airfoil edge and in the step cavity. The final refined meshes 
around the base airfoil had around 95,000 grid points, while the meshes for stepped airfoil cases 
had about 120,000 grid points. The other meshes for cases of jet placed in the step cavity around 


150,000 points each. Figure 2-3 shows the grid convergence for the lift and drag coefficients. 
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Figure 2-3: Illustration of Grid Convergence for the lift and drag coefficients. 


2.1.5. Preliminary Results. Some initial numerical results obtained were compared 
with the experimental data available in Reference 10 in order to evaluate the validity of the results 
as tabulated in Table 2-1. As the Reynolds numbers used were of the order of millions, all the 
computations were performed considering the flow to be turbulent. The initial results generated 
especially in the case of lift computations for the Reynolds number of three million were in 
descent agreement with the experimental data. Figure 2-4 compares experimental data available 
with the preliminary numerical results for the cases studied. The error lies within 5-10 % as 


compared with the experimental values of lift, while the corresponding error for drag values is 
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Table 2-1: Summary of NACA 4415 airfoil configurations studied for validation. 


Case No Airfoil Configuration poke U,, [m/s] a. [deg. | 


3e6 | 4361 | 0 
| Base | 306 | or | 8 
| _airfoil with step | 3e6 | 4361 | 0 
| _airfoil with step | 3e6 | 4361 | 8 
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Figure 2-4: Comparison of numerical results with experimental data. (a) Lift; (b) Drag 





higher lying between 23-65 %. As there is high error in the drag results obtained, the author 
would like to caution researchers that the accuracy of the quantitative results presented in this 
thesis may not be highly reliable. On the other hand if the error in the drag results were to be 
much less of the order of the error in the lift data, then the values of lift to drag ratio presented in 
this thesis would be much higher which is beneficial. Use of the results for future studies may be 


made while being conscious about these observations. Preliminary results were obtained for 
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NACA 4415 airfoil at three different angles of attack (0°, 5°, and 10°). Also results were also 





obtained for modified airfoil with step at these a(s) to examine whether use of step yielded better 
aerodynamic performance measured in terms of the L/D ratio, the higher the better. As expected, 
L/D ratios for the stepped airfoils were higher than those for the base airfoils. All the six cases 


mentioned above constitute the ones studied for validating the computational results. 


2.2. EXPERIMENTAL APPROACH 


The second part of this research involved experimental investigation of stepped airfoils to 
validate some of the results obtained with numerical studies conducted as a part of this research 
effort. All the experimental studies were conducted in the Aerospace Flow Laboratory located in 
the Department of Mechanical and Aerospace Engineering [MAE] at the Missouri University of 
Science and Technology. The flow lab at Missouri S & T houses the wind tunnels used for 
aerodynamic testing along with the test model fabrication facilities. This section presents the 
approach followed for the wind tunnel testing of the modified NACA 4415 airfoil with lower 


surface step. 


2.2.1. Wind Tunnel Facility. The Aerospace Flow Laboratory at Missouri S & T 
houses a subsonic wind tunnel which was used for all the testing done as a part of the current 
research to measure the airfoil force data This facility has an open return type wind tunnel. The 
tunnel is equipped with three interchangeable test sections that can easily be installed downstream 
of a convergent section. A computer station equipped with an experimental data acquisition 
system is available for recording data. The aerodynamic forces on the test models were measured 
for a range of tunnel speeds corresponding to Re’s ranging from 0.4 million to 0.8 million on the 
model that was tested based on the conditions of pressure and temperature in the wind tunnel that 


were measured during the tests. The tunnel consists of a bell-mouth inlet, a honeycomb structure, 
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a settling chamber, and a 6:25:1 fiberglass/foam lamination contraction. Flow turbulence is 
controlled by a combination of three anti-turbulence screens and the honeycomb panel installed in 
the tunnel settling chamber. Figure 2-5 depicts a sketch of the wind tunnel and the flow 
visualization system used. These sections are followed by an 18" x 18" x 60" Plexiglas test 
section, and a diffusor. The tunnel, driven by a 20 hp DC motor that provides feedback control for 
a tubular accoustafoil fan, is capable of a maximum flow velocity of 68 mph (or) 30.5 m/s. The 
tunnel is equipped with a smoke rake located upstream of the settling chamber section in the 
tunnel. The rake is connected to a pressurized tank fed by a Rosco fog generator, which produces 
the fog/smoke by heating Rosco fog fluid. A pressure regulator on the tank and a fog volume 
control on the generator allow for optimal adjustment of the smoke sheet as per the test model 
size and/or tunnel speed. A still camera was used to record the images and a basic lighting system 
was available for illuminating the test section during the testing. The test model in the wind 
tunnel was supported using a piece of copper tubing passing through holes made in the test 
section walls and the test model. The ends on the tubing rested on the load cells. One end of the 
tubing was fitted with a mechanism to alter the angle of attack the test model during the testing. 
aerodynamic forces acting on the model were measured using these load cells. The load cells are 


capable of measuring forces between 0.0015 kg and 5 kg. The other end is connected to the data 


pressure tank, 
smoke generator =| 
anti-turbulence screens 
high-intensity light, contraction. 





Figure 2-5: Schematic of the wind tunnel and flow visualization facility. Courtesy: MAE Dept. 
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acquisition system used for recording the force data. 


2.2.2. Test Model. The wing model used for wind tunnel testing was built with a step 
whose depth could be varied remotely during the testing. This mechanized wing model consisted 
of wing sections employing the NACA 4415 airfoil with a lower surface step configuration of X, 
=0.5, L, =0.5, and a variable depth. The chord of the square plan form model was 16 inches while 
the span was 16 inches resulting in a surface area of 1.78 ft”. The span of the model allowed for 
only 1 inch of clearance between the model and the wind tunnel on either side so that the flow 
could be assumed to be entirely two dimensional. The model was fabricated using a 1/8" thick 
sheet of balsa wood for the wing sections, a 1/16" thick balsa sheet for outer sheeting, 1" x 3/8" 
spruce for spars, 1/8" aluminum sheet for the variable lower surface step. During the testing, the 
step was actuated using a remote controllable mechanism involving a servo motor fitted within 
the model using a set of plastic linkages. The various step depths were marked in black ink on the 
wing-tip airfoil made of Plexiglas for easy detection during the testing. The servo motor was 
fitted onto the balsa framework inside the wing. To facilitate easy access to the motor and the 
wiring, an access plate was designed and fabricated out of a 1/8" balsa sheet and then filed to give 
it a smooth finish so its upper surface matches with the contour profile of the rest of the wing 
surface. It was then screwed in place onto the framework. Figures 2-6 (a), (b), (c), and (b) show 
the set of NACA 4415 airfoils made, wing framework, servo and step surface linkage mechanism, 
perspective view of the model with the access plate and the servo motor mounted on a set of wing 
ribs. The step-activating mechanism was operated using a transmitter remotely. The model was 
mounted with the step fully retracted so that the model could be assumed to be corresponding to 
the conventional NACA 4415 airfoil considered the base airfoil for all the cases. During the 
testing, the step depth was varied gradually to record the experimental data for each modified 


airfoil configuration of X1=0.5, Li=0.5, and the particular step depth setting. 
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Figure 2-6: Illustrations of the test model. (a) NACA 4415 airfoils; (b) wing framework; (c) servo 
and step surface linkage; (d) perspective view of the wing model; (e) model mounted in the test 
section; and (f) lighting system inside the test section. 
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3. RESULTS AND DISCUSSION 


Through numerical studies and experimental investigation, useful results were generated 
for airfoil configuration comprising of conventional NACA airfoil and the corresponding 
modified airfoil with step introduced on the lower surface. This section presents the 
comprehensive description and analyses of all the usable results obtained. Results are presented 
for airfoils by varying some geometric and aerodynamic parameters along with discussion of the 
results with regard to the flow structures discussed in the first chapter of this thesis, pressure 
distribution, and aerodynamic characteristics associated with the modified NACA 4415 airfoil 
configuration set at a predefined attitude and a given set of inlet conditions. The numerical results 
obtained were used to compare the various stepped airfoil cases with the corresponding cases 
using jet by varying the jet parameters. Further, this section presents the results comprising the 


experimental force measurements leading to aerodynamic coefficient data. 


3.1. COMPUTATIONAL RESULTS 


Results showing the flow field developments are presented as velocity vector, static 
pressure contour and pressure coefficient plots using FLUENT and Tecplot as post processors. 
Further, aerodynamic characteristics such as lift, drag and pitching moment coefficients are also 
presented as obtained from the resulting pressure and shear stress distributions at different flight 
conditions. Conventional NACA 4415 airfoil and the modified airfoil configuration were 
evaluated for a given set of inlet conditions. On the basis of the results generated by Stephen 
Witherspoon” and Fathi Finaish (1996) according to which airfoil with step on the lower edge 
with X; =0.5, L; =0.5, and D,; =0.5 is the best in terms of aerodynamic performance, further 
analysis was sought for the cambered NACA 4415 airfoil. Table 3-1 enlists the various cases 


investigated numerically as a part of this research. The operating conditions matching the 
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standard atmospheric values of pressure, temperature, and density were kept the same for all the 


cases during the study. 


3.1.1. Passive Flow Control. In this study, this flow control technique involves the 
use of step to modify the flow field around a NACA 4415 airfoil. The resulting airfoil 
configuration brings about some flow field developments which might help control the flow so as 


to enhance the aerodynamic performance. 


Table 3-1: Summary of NACA 4415 airfoil configurations studied. 


Airfoul Configuration niet U, [m/s] 


= X=0.5; D=0.5; L=0.5 
2 | _Hate gg WW! G06 | 8.36 | 4 
| 10 | X=0.5; D=0.5; L=0.5 


| 12 | X=0.5; D=0.5; L=0.5 
1s | Bawe | t8e6 | 2640 | 4 
| 14 | X=0.5; D=0.5; L=0.5 
pas [pe __ [ses [oss [8 
| 16 | X=05;D=0.5;L=05 | 2506 | 3753 | 8 





3.1.1.1. Velocity contours and streamlines. In order to explore the flow field 
developments around a NACA 4415 airfoil with a step, let us start by considering a sample 
velocity contour and streamline plot. Figure 3-1 shows a global view of the streamlines and 
velocity contour field around airfoil with step configuration of X; =0.5, L; =0.5, and D,; =0.5 on 
the lower edge. Figure 3-2 presents the pressure contours for the same configuration. The legend 


Figure 3-1 shows the velocity contours colored by magnitude of velocity in meters per second, 
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where the free stream velocity is 26.4 m/s corresponding to a Re of 1.8 million. The angle of 
attack is 4 degrees. In the figure it can be seen that as the flow progresses, it accelerates over the 
front portion of the airfoil, slightly decelerating past the quarter chord leaving a wake behind the 
trailing edge of the airfoil. Figure 3-3 shows the magnified view of the flow inside the region of 
stepped airfoil shown in Figure 3-1 along with the plots for cases of Re = 0.6 million and 1.2 
million. As can be noticed, the flow separates at the leading edge of the step forming a vortex 
behind the step face and then reattaches itself some distance downstream. A secondary vortex 1s 
formed after the reattachment near the top step corner as shown. The secondary vortex is much 
smaller than the primary one. Recirculation zones are clearly shown in the figure, which are 
similar to the predicted flow developments illustrated in Figure 1-2. Though the step extends till 
the trailing edge, the center of the primary vortex occurs closer to the step face as can be seen in 
Figure 3-1. Figures showing velocity contour and streamline plots for the modified airfoil with 
lower edge step configurations of X; =0.5, L; =0.5, and D, =0.5 at a = 4° and at Re(s) of 0.6, 1.2, 
and 1.8 million are grouped together. Also figures showing velocity contour and streamline plots 
for the same airfoil configurations at a = 8° at Re of 2.5 million and at a = 2° at Re of 1.7 million 
are grouped together. The legend on all the velocity contour and streamline plots defines the 


dimensionless velocities represented by the colored vectors defined as: 


v= V/U.., where V = [(X-Velocity) + (Y-Velocity)’] (3.1) 


With the increase in Re, the overall flow velocities in the flow field will be higher. But the 
strength of the vortices may not be high enough for all the stepped cases for any benefit in terms 
of the aerodynamic characteristics in each and every case which is discussed further in the later 
sections. Higher inlet velocities produce stronger vortices in the step region thus allowing the 


possibility of producing better lift characteristics which to prove is one of the goals in this study. 
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From the plots in Figure 3-3, not much variation can be seen as the Re is increased. There is a 
slight variation in the velocity contour plots shown in Figure 3-4 in comparison with those in the 
Figure 3-3. Figure 3-5 shows the plots illustrating the vorticity distributions for the various cases 
studied at a = 4°. As can be noticed from the plots, the vorticity gradients within the step cavity 
increase as the Re is increased. The increase in the vorticity gradients is also clear from the Figure 
3-6. As there is a consistent rise in drag for the modified airfoils over the base airfoil cases, the 
focus of this study was to identify the benefits in terms of the lift to drag ratios. The 


corresponding results are discussed in the section of aerodynamic coefficients. 


od el By? Pressure distributions. The effect of these flow developments on the 
pressure around the airfoils and the resulting aerodynamic characterisitics can be better 
understood by analyzing the plots of static pressure contours which are curves along which the 
pressures are equal. Figure 3-2 shows a global view of pressure contours around the airfoil 
configuration of Figure 3-1. The legend indicates values of pressure expressed as pressure 
coefficient C, varying from -2.5 to 1. Figures 3-7 and 3-8 contain the pressure contour plots 
corresponding to the velocity vector plots grouped together as mentioned earlier for the same set 
of parameters. The C, varies from -1.3 to 1 on all these plots showing the magnified view of the 
pressure distribution within the step cavity. The recirculation in the step cavity due to the 
formation of vortices can be seen as concentric circles of pressure contours with their center 
coinciding with the center of the vortex. Regions with denser spacing of pressure contours are 
indicative of stronger pressure gradients and vice-versa. These intense regions can be visualized 
downtream of the vorex where there is reattachement of flow and thus a pressure gain. Also 
regions near the airfoil leading edge and the upper side of the front portion have high pressure 
gradients. It can be observed that there is high pressure at the leading edge due to flow stagnation. 


As the flow passes over the convex upper portion, there 1s decrese in pressure due to the flow 
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Figure 3-1: Global View of Velocity Contours and Streamlines around a stepped NACA 4415 
airfoil at a = 4°; Re = 1.8 million; X; =0.5; L; =0.5; and D, =0.5. 





Figure 3-2: Global View of Pressure Contours around a stepped NACA 4415 airfoil at a = 4°; 
Re = 1.8 million; X, =0.5; L,; =0.5; and D, =0.5. 
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Figure 3-3: Velocity Contours and Streamlines in the step cavity of a modified NACA 4415 
airfoil at a = 4°; X, =0.5; L; =0.5; and D, =0.5. (a) Re = 0.6 million; (b) Re = 1.2 million; (c) Re = 
1.8 million. 
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Figure 3-4: Velocity Contours and Streamlines in the step cavity of a modified NACA 4415 
airfoil; X; =0.5; L, =0.5; and D, =0.5. (a) a = 8°, Re = 2.5 million; (b) a = 2°, Re = 1.7 million. 
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Figure 3-5: Vorticity Contours in the step cavity of a modified NACA 4415 airfoil at a = 4°; 
X, =0.5; L; =0.5; and D, =0.5. (a) Re = 0.6 million; (b) Re = 1.2 million; (c) Re = 1.8 million 
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Figure 3-6: Vorticity Contours in the step cavity of a modified NACA 4415 airfoil; X, 
=0.5; L; =0.5; and D, =0.5. (a) a = 8°, Re = 2.5 million; (b) a = 2°, Re = 1.7 million 
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Figure 3-7: Pressure Contours in the step cavity of a modified NACA 4415 airfoil at a = 4°; X; 


=().5; L, =0.5; and D,; =0.5. (a) Re = 0.6 million; (b) Re = 1.2 million; (c) Re = 1.8 million. 





(b). 


Figure 3-8: Pressure Contours in the step cavity of a modified NACA 4415 airfoil; X, 
=0.5; Lj=0.5; and D, =0.5. (a) a = 8°, Re = 2.5 million; (b) a = 2°, Re = 1.7 million. 
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acceleration shown by the Figure 3-1. Further downstream, the pressure builds up till it reaches 
the undisturbed air pressure level. In contrast to the pressure field on the upper surface, there 1s 
not much variation in the pressure on the lower surface except in the step cavity due to the 
formation of vortices and rettachment of flow downstream. From the plots for stepped 
configurations in Figure 3-7, it can be seen that with the increase in Re, there is higher change in 
pressure (AP) around the airfoil though the overall trend remains the same for all the cases shown 
in Figure 3-7. The magnitude of of pressure in the region of flow reattachment in the step cavity 
is higher at higher angles of attack which can be observed from the pressure plots by comparing 
Figure 3-7(c) and Figure 3-8 in the for a(s) of 2°, 4°, and 8° and Re(s) of 1.7, 1.8, and 2.5 million 
respectively, as from the Figure 3-7, it can be seen that Re does not have much effect on the 
pressure in the region of flow reattachment in proportion with the overall changes in the flow 
field pressure around the airfoil. 

In addition to the pressure contour plots, pressure coefficient plots facilitate further 
understanding of the pressure variation along the edges of the airfoil configurations. Figure 3-9 
illustrates the plots of pressure coefficient versus airfoil chord for a range of Re(s) and a = 4°. 
The upper curve in all the plots indicates the variation of pressure coefficient along the airfoil 
lower surface and vice versa. As can be noticed from the plots for base airfoil cases in Figure 3-9 
and Figure 3-10, the lower surface pressure distribution flattens out after an initial curvy portion 
which clearly is not the case with the stepped airfoil configurations. The effect of introducing a 
step is evident from this observation. From the plots for stepped airfoil cases, it is very clear that 
starting from the step face there is a slight drop in pressure due to the presence of trapped 
vortices. As we move further downstream, there is an increase in the pressure on the lower 
surface resulting from the force exerted by the large rotating vortex on the airfoil surface. As can 
be observed from the plots, the pressure on the airfoil edge comprising the step is higher than that 


for the base airfoil cases. This observation is consistent for all the stepped airfoil cases. This 
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explains the higher C, values and the corresponding lift to drag ratios which will be discussed 


further in the following section of this thesis. 


3.1.1.3. Aerodynamic coefficients. All the aerodynamic characteristics 
including lift, drag, and moment coefficients directly result from the pressure distributions around 
the airfoil at a given set of operating conditions. As the present study considers a range of Re(s) 
and airfoil configurations at different a’s, the whole set of flow conditions bring about many 
cases for study. The plots in Figure 3-11 show the % change in lift coefficient (AC) for various 
cases studied at a = 2° without the use of active flow control using jet. AC; here represents the 
difference in C, obtained with the unmodified airfoil and the modified airfoil with step. A positive 
value indicates an increase in C; and a negative value indicates a decrease in C, with respect to the 


unmodified airfoil. Percent change in lift coefficient for any modified airfoil case is defined by 


AC] P 
C; for base airfoil 


% changeinC, = 100 (3.2) 


Figure 3-12 shows the lift to drag ratio plots obtained from the lift and drag data for a = 
2°. Figure 3-13 shows the plots for % change in lift coefficient for cases studied at a = 4° 
followed by the lift to drag ratio plots shown in Figure 3-14. Figures 3-15 and 3-16 present the 
corresponding plots for cases studied at a = 8°. Figures 3-11; 3-13; and 3-15 show that AC, 
obtained with the use of step on the lower surface of a NACA 4414 airfoil is positive for all the 
cases meaning the stepped airfoil produces higher lift than the base airfoil. It is the highest for the 
case with Re = 2.5 million, the trend being the same at all a’s ranging from about 20% to 35% 
increase over the base airfoil. In Figure 3-11 the AC, obtained as Re increases from 0.6 million to 
1.2 million is higher than that obtained as Re increases from 1.2 million to 1.8 million. Figures 3- 


12; 3-14; and 3-16 have plots comparing the lift to drag ratio for the base airfoil and stepped 
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Modified airfoil cases 
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Figure 3-9: Pressure Coefficient versus chord of airfoils at a = 4° for a range of Re(s). Left 
column: NACA 4415 (base) airfoil; Right column: Modified airfoil with step on the lower surface 
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(b). a = 2°; Re = 1.7 million 
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1.0 Pressure 
Coefficient 





OO O81 O<2 O08 O44 Of ODF ODF OB OF 10 


(d). a = 2°; Re = 1.7 million 


Figure 3-10: Pressure Coefficient versus chord of airfoils for different a(s) and Re(s). Left 
column: NACA 4415 (base) airfoil; Right column: Modified airfoil with step on the lower edge. 
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Figure 3-11: Comparison of lift characteristics for modified airfoil 
configurations for o = 2° and a range of Re(s). 
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Figure 3-12: Comparison of Lift to Drag Ratio of base and modified airfoil 
configurations for a = 2° and a range of Re(s). 
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Figure 3-13: Comparison of lift characteristics for modified airfoil 
configurations for a = 4° and a range of Re(s). 
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Figure 3-14: Comparison of Lift to Drag Ratio of base and modified 
airfoil configurations for a = 4° and a range of Re(s).S 
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Figure 3-15: Comparison of lift characteristics for modified airfoil 
configurations for a = 8° and a range of Re(s). 
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Figure 3-16: Comparison of Lift to Drag Ratio of base and modified airfoil 
configurations for a = 8° and a range of Re(s). 
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airfoil for the various cases studied. This trend with the L/D ratios for the stepped airfoils is also 
noticeable in the plots shown in these figures. The values of L/D ratio are higher for the stepped 
airfoil cases than those of the base airfoil cases for all Re’s except for Re = 0.6 million. Though 
there is an increase in C, over the base airfoil cases for all the stepped airfoil cases, there is rise in 
Cy as a consequence of the use of step. The contributions to the higher lift to drag ratios come 


from the significant increments in C, for the modified airfoil cases. 


3.1.2. Active Flow Control Using Jet. The previous section dealt with all the cases 
studied without the use of jet or external energy for flow control. This section deals with the use 
of air injecting jets to alter the flow field by controlling the trapped vortex in order to enhance the 
aerodynamic performance of stepped airfoils which parallels with the title of this thesis. Thus this 
section was titled as such. The effect of using an air injecting jet in the step cavity was studied on 


a NACA 4415 airfoil with a backward facing flat step introduced on the lower surface. 


CP Influence of air injecting jet placed in the step cavity. This study 
focuses on understanding the effect of an air injecting jet placed in the step cavities on the 
aerodynamic characteristics of stepped airfoils with an objective of enhancing their aerodynamic 
performance. The cases were studied with the jets placed at two different locations and the jet is 
injected at four different angles 0°, 15°, 30°, and 45° made with the direction of free stream 
velocity which coincides with the X-axis respectively from each location as shown in Figure 3-17 
thus forming several cases for study. Figure 3-17 shows the locations of the jets placed on the top 
and bottom of the step. Table 3-2 enlists all the cases studied using a jet as a part of this research. 
In all the cases discussed in this section, the passive flow control technique realized as a step 
created on the lower side of a NACA 4415 airfoil is augmented by the use of external energy in 


the form of an air injecting jet to control the flow field within the step cavity. 
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(a). (b). 
Figure 3-17: Jet placed in the step cavity at different locations. (a) on the top; (b) on the bottom 


Table 3-2: Summary of modified NACA 4415 airfoil cases 
studied using jet in the step cavity at a = 2°; Re = 1.7 million. 
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The mass flow rate of injection corresponds to the jet velocity (V;) set for each case 
studied. The other parameter used for characterizing the jet is the angle of the jet (C). The width of 
the jet is about 3.22% of the depth of the step. Since the free-stream density and width of the jet 
are fixed, the mass flow rate is a direct function of V;. When the jet velocity takes the value equal 
to the free-stream velocity of 24.93 m/s, the mass flow rate is about 0.07 kg/s. Use of jets in the 
step cavities produced interesting results. Preliminary results obtained for cases with jet placed at 
each location and for different angles of injection show that jet placed on the bottom of the step 
face produces the best results overall with regard to the aerodynamic characteristics. Kitsios”' et 
al through their research involving numerical study of NACA 0015 airfoil using Zero Net Mass 
Flux (ZNME) jets concluded that jets placed closer to the uncontrolled separation point require a 
lower jet velocity to achieve the desired lift enhancement. In other words, for a given jet velocity, 
higher lift coefficient is obtained when the jet is placed closer to the separation point. In the 
present study, it was observed that the jet placed at the corner of the step where the flow separates 
from the edge / surface of the airfoil which proves the observation made by them. Thus further 
investigation was conducted by placing the jet on the step bottom. Another observation made by 
the same research group mentioned above is that spreading rates of the jet effluxes and decay 
constants of oscillatory jets were higher than those of continuous jets in case of circular jet 
orifices. However, the current study involves a preliminary step in the studies on application of 
active flow control on stepped airfoils. Hence continuous jets are chosen for the entire study. All 
the results for the cases with jet were obtained for fixed inlet conditions of a@ = 2° and Re = 1.7 
million. Since this particular flight attitude condition corresponds to the cruise regime of the 
UAV Pioneer considered for the case study discussed later in this thesis, the focus in this 
investigation with regard to the enhancement of aerodynamic performance of stepped airfoils 
using jets was kept on the modified airfoil cases at a = 2° with the Reynolds number fixed at 1.7 


million and the corresponding U,, at 24.93 m/s. Figure 3-18 illustrates the various jet and step 
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Figure 3-18: Illustration depicting the influence of jet on the stepped airfoil. 


geometry parameters along with the key flow developments in the step cavity of a modified 
airfoil using jet. As can be seen the vortex system existing within the step cavity comprises the 
large, primary vortices rotating counter-clockwise and the small, counter-rotating, secondary 
vortices. There are three main observed effects of jet on the flow field characteristics. First, the 
noticeable effect of jet is the pinching of the vortex system. It can be observed in all the jet cases 
that the jet is squeezing the large primary vortex. This can be easily realized by comparing the 
velocity contour and streamline plots presented in the next section for the modified airfoil cases 
without jet with those for the jet cases in the Figures 3-3, 3-4, and the plots for the jet cases 
discussed in the next section. Table 3-3 quantifies the dependence of the location of the center of 
vortex formed in the step cavity on the jet angle by comparing the jet cases with those of the 
stepped airfoil without a jet. The degree of the squeeze increases with the increase in the angle of 
the jet placed within the step cavity. Secondly, the effect of the jet is to result in an increase in the 


lift due to the increased force exerted by the vortex as illustrated in Figure 3-18. This increase in 
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Table 3-3: The effect of ¢ on the location of center of vortex 
with Cu fixed at 0.01731; V;= 2U.. 


Jet Angle, ¢ Distance of the votex center from 
the step face / step length 
0.180 
wm 0.110 


0.105 
0.100 
0.090 





the force exerted upon the airfoil surface by the vortex is because of the introduction of jet in 
which case the vortex system is strengthened with regions of acceleration visible in the outer 
layers of the large primary vortex in the flow visualization plots which will be discussed further 
in the next section. The third effect of the jet could be explained using the basic principles of 
action and reaction. Reflecting on the fundamental concept of a jet flap’® to better understand the 
influence of jet on the flow field characteristics, a jet flap is a thin, high energy jet that is directed 
downward at some angle with respect to the free stream. The effect of the jet flap is to creat a 
reaction force / lift due to the vertical component of the jet momentum. With a slight modification 
in the purpose of jet which in this research is to influence the flow field and attempt to control the 
trapped vortices in the step cavity, the effect of the jet could possibly be understood as one which 
generates a reaction force that acts in the direction opposite to the drag thus canceling part of the 
drag force acting on the airfoil. The result is reduced drag which 1s evident from the plots 
showing the dependence of the aerodynamic characteristics of stepped airfoil on the jet 


parameters of ¢ and V;. The results will be discussed in detail in the Section 3.1.2.4 of this thesis. 


Sal P2998 Velocity contours and streamlines. Figure 3-19 illustrates the flow 


field as streamlines superimposed on velocity contours colored by velocity magnitude over a 
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modified NACA 4415 airfoil with a step using a jet placed in the step cavity as shown in the 
Figure 3-17(b). Figure 3-20 presents the pressure contours for the same case. The velocity 
contours in Figure 3-19 are colored by dimensionless velocity v as defined in the Section 3.1.1.1. 
The dimensionless velocity ranges from 0 to 2 in steps of 0.25 as shown by the legend. This case 
of study is same as that of the case for a = 2°, Re = 1.7 million discussed earlier in this chapter in 
addition to which active flow control realized using a jet was employed for enhancing the 
aerodynamic performance. The jet is ejected at angle of 45° with a jet coefficient Cu = 0.01731 
corresponding to jet velocity V;= 2U... The effect of jet is to energize the flow field within the 
step cavity which is more obvious from the Figure 3-21. In the figure, regions of acceleration can 
be spotted in the outer layers of the vortex in all of the three jet cases. From the figures, it is 
evident that the jet has an identifiable effect on the flow field by accelerating the flow within the 
vortex system. In other words, the jet is energizing the flow thereby strengthening the vortex. The 
pinching nature of the jet along with the flow acceleration caused by the jet in the step cavity act 
together to strengthen the vortex system prevalent in the step cavity. This explains and is evident 
from the benefits in terms of the lift and drag characteristics obtained from the use of jet at 
different angles over the modified airfoil with a step without jet which will be discussed in detail 
in the following sections. From the vorticity contours shown in Figure 3-22, it can be observed 
that there is a slight variation in the vorticity distribution within the step cavity when Cu 1s varied 
keeping ¢ fixed. The vorticity gradients increase as the jet velocity is varied which can be 


understood by comparing the plots in Figures 3-22 and 3-23. 


341.2.3. Pressure distributions. Plots illustrating the pressure distributions 
facilitate better understanding of the various flow developments associated with the use of jet 
placed in the step cavity. Figure 3-20 presents the global view of pressure distribution around the 


modified NACA 4415 airfoil at a = 2°, and Re = 1.7 million with a step using a jet placed in the 
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Figure 3-19: Global View of Velocity Contours and Streamlines around a stepped NACA 4415 
airfoil using a jet placed in the step cavity with jet parameters ¢ = 45°, Cu = 0.01731 at a = 2°; Re 
= 1.7 million; X, =0.5; L, =0.5; and D, =0.5. 





Figure 3-20: Global View of Pressure Contours around a stepped NACA 4415 airfoil using a jet 
placed in the step cavity with jet parameters C = 45°, Cu = 0.01731 at a = 2°; Re = 1.7 million; 
X, =0.5; L, =0.5; and D, =0.5. 
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Figure 3-21: Velocity Contours and Streamlines in the step cavity of a modified NACA 4415 
airfoil using a jet placed in the step cavity with jet parameter Cu = 0.01731 at a = 2°, Re = 1.7 
million; X; =0.5; L; =0.5; and D, =0.5. (a) step without jet; (b) ¢ = 0°; (c) ¢ = 15° 
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Figure 3-21: Velocity Contours and Streamlines in the step cavity of a modified NACA 4415 
airfoil using a jet placed in the step cavity with jet parameter Cu = 0.01731 at a = 2°, Re = 1.7 
million; X; =0.5; L; =0.5; and D, =0.5. (contd.) (d) ¢ = 30°; (e) ¢ = 45° 


50 


# Vorticity —=120-- — 











4 .' Adesso 5 A et 
120 wet , feee ew a! 
m0 7! pein ee ao 
= 7 \—__—__ a 
See Wer 6 =e 12u-—> =| — 
Se ee et TOE 
— eer Oe 
. 


(c). 


Figure 3-22: Vorticity Contours in the step cavity of a modified NACA 4415 airfoil using 
a jet placed in the step cavity with jet parameter Cu = 0.01731 at a = 2°, Re = 1.7 million; 
X; =0.5; L; =0.5; and D, =0.5. (a) step without jet; (b) ¢ = 0°; (c) ¢ = 15° 
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Figure 3-22: Vorticity Contours in the step cavity of a modified NACA 4415 airfoil using 
a jet placed in the step cavity with jet parameter Cu = 0.01731 at a = 2°, Re = 1.7 million; 
X,; =0.5; L; =0.5; and D, =0.5. (contd.) (d) ¢ = 30°; (e) C= 45 
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(C). 
Figure 3-23: Flow field within the step cavity of a modified NACA 4415 airfoil using a jet placed 
in the step cavity with jet parameters ¢ = 15°; Cu = 0.00108 at a = 2°, Re = 1.7 million; X; =0.5; 
L, =0.5; and D, =0.5. (a) Velocity Contours and Streamlines; (b) Vorticity Contours; (c) Pressure 
Contours 
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step cavity. The jet parameters are ¢ = 45°, and Cu = 0.01731 corresponding to V;= 2U... The 
contours in the plot showing the global pressure distribution are colored by the magnitude of 
pressure coefficient which ranges from -2.5 to 1. From the figure, it can be seen that there is flow 
stagnation in front of the airfoil. The pressure increases on the bottom surface as the flow 
progresses while on the top surface it decreases. As can be observed from the plot, there is higher 
variation in the pressure on the upper surface than that on lower surface. The pressure takes the 
highest values in the stagnation region around the airfoil leading edge while the lowest values 
occur on the upper surface. Figure 3-24 illustrates the magnified pressure contour plots showing 
the variation in pressure within the region of the step cavity. The C, on the legend varies from - 
1.3 to 1 on all these plots. Comparing the case of stepped airfoil using jet at ¢ = 0° with that of the 
modified airfoil case without jet shown in Figure 3-24(a), there is a slight variation in the pressure 
due to the introduction of jet. There is a substantial increase in pressure as C is increased to 30° 
and further increased to 45°. Pressure gradients are high in the region where the large vortex 1s 
stationed and also in the vicinity of the jet indicated by the presence of many pressure contours 
spaced closer. In all the C, plots it can be noticed that the minimum pressure in the step cavity 
occurs at the vortex core thereby proving the fact that a vortex exhibits a pressure minimum at the 
center. As compared with the case of stepped airfoil without jet, there is significant variation in 
the pressure in the step cavities in the jet cases studied due to the influence of jet on the flow 
field comprising the system of contra-rotating vortices. 

Figure 3-25 shows the pressure coefficient plots illustrating the pressure distribution on 
the airfoil surface for the base airfoil, stepped airfoil, and stepped airfoil with jet cases for a = 2°; 
Re = 1.7 million. As can be seen from Figure 3-25(a) for base airfoil, the lower surface pressure 
distribution indicated by the lower curve is smooth with not much variation when compared with 
the plots for various cases. From the plot for stepped airfoil without jet, it is very clear that 


starting from the step face there is an increase in the pressure on the lower surface. This increase 
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Figure 3-24: Pressure Contours in the step cavity of a modified NACA 4415 airfoil using 
a jet placed in the step cavity with jet parameter Cu = 0.01731 at a = 2°, Re = 1.7 million; 
X;=0.5; L, =0.5; and D, =0.5. (a) step without jet; (b) ¢ = 0°; (c) ¢ = 15° 
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Figure 3-24: Pressure Contours in the step cavity of a modified NACA 4415 airfoil using 
a jet placed in the step cavity with jet parameter Cu = 0.01731 at a = 2°, Re = 1.7 million; 
X; =0.5; L, =0.5; and D,; =0.5. (contd.) (d) ¢ = 30°; (e) ¢ = 45° 
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(c). step with jet; C = 0°; Cu = 0.01731 
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(f). step with jet; C = 45°; Cu = 0.01731 


Figure 3-25: Pressure Coefficient versus chord of NACA 4415 airfoil configurations at a = 2°; Re 


= |.7 million for various jet parameters. 
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in pressure results from the force exerted by the vortex on the wall of the airfoil. This explains the 
increase 1n C; due to the introduction of a step on the lower surface. Figures 3-25(c), (d), (e), and 
(f) show the pressure coefficient plots for the cases with jet for different jet parameters as 
mentioned below each plot. The plots for all the cases of step with jet show that the upper surface 
suction peak is unaltered as shown by the lower curves on the plots though there is much 
variation in the pressure on the lower surface as shown by the wavy upper curves depicting the 
distribution of pressure. There is a significant change in the lower surface pressure distribution as 
compared to the case of step without jet. The sudden drop in the pressure at the step face in all the 
jet cases is due to the presence of jet itself as the flow velocities are high in the vicinity of the jet. 
The pressure rises back to normal just past the step face. Moving a little further along the airfoil 
wall there is a drop in pressure at the point on the wall corresponding to the location of the vortex 
core directly beneath it. This feature is noticeable in all the jet cases. Moving further downstream, 
there is a significant rise in the wall pressure due to the force exerted by the vortex on the airfoil 
surface. This variation in the pressure again, can be observed in all the jet cases. The pressure on 
the airfoil edge comprising the step in all the jet cases is appreciably higher than that for the 
stepped case evident from the plots in Figure 3-25. This pronounced variation in the lower surface 
pressure explains the higher values of C, obtained for all the jet cases and the corresponding 


values of L/D ratio which are discussed in detail in the following section. 


3.1.2.4. Aerodynamic coefficients. All the information gathered in terms of 
the lift, drag and moment characteristics results from the pressure distributions discussed earlier 
in this thesis. Figure 3-26 has three plots showing the lift, drag and lift to drag data generated for 
the various cases of stepped airfoil using jet along with the case of step without jet. Looking at 
the figure at a glance to get the broader picture of the influence of jet on the aerodynamic 


performance of the stepped airfoil, it is evident that use of jet produces benefits by enhancing the 
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Figure 3-26: Dependence of the aerodynamic characteristics on ¢ and Cu when compared with 
the NACA 4415 airfoil cases without jet at ao = 2°; Re = 1.7 million. (a) Lift; (b) Drag; (c) Lift to 
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aerodynamic characteristics. The values of lift coefficient for all the jet cases are higher than that 
of the case without jet. Though there is a variation in C, as the jet angle is varied while also 
varying the jet velocity from 0.25U,, to U., thereby increasing Cu from 0.00027 to 0.00108, its 
dependence on jet angle varied from 15° to 45° seems to be diminishing as the jet velocity is 
increased beyond the magnitude of U... As can be seen in the figure, C, for the case of step 
without jet is 0.8324. With the activation of the jet at C= 0°, and Cu = 0.00027 corresponding to 
V,=0.25U.., the C; increases to 0.8460. As the jet velocity is increased to its peak value of V;= 
2U., with Cu = 0.01731 at ¢ = 0°, there is a further rise in C; to 0.8802 which is about 6% higher 
than the value for the step without jet case. There 1s a steady increase in C, as the jet velocity is 
increased keeping C fixed at 0°. For the case of jet at C= 15°, the C, obtained is 0.8427 which is 
lesser than the value for the case of step without jet. There is a slight drop in C, as the jet velocity 
is increased from V, = 0.25U., to 0.5U,, correspondingly Cu from 0.00027 to 0.00108. This 
observation could possibly be understood from the Figure 3-23 which shows the velocity 
contours and streamlines colored by dimensionless velocity in the region of step cavity. The jet 
injected at an angle of 15° hits the outer layers of the large primary vortex. The jet injected at this 
particular jet velocity of 12.465 m/s seems to be ineffective in controlling the trapped vortex 
which is supported by the pressure contour plots shown in Figure 3-23. As can be noticed from 
the plots, there is only a slight variation in the pressure when compared with the pressure plots for 
the cases of step without jet and the jet cases shown by Figure 3-24. By comparison, the C, 
obtained for another jet case for the same angle by increasing the jet velocity Vj from 0.5U,, to 
0.75U,, with Cu = 0.00243 is 0.8443 which is higher than that of the value for the case of step 
without jet. Also, the C; increases consistently with increase in the jet velocity till it takes its 
maximum value of 0.8921 at Cu = 0.01731 corresponding to the peak jet velocity V;= 2U., as 
shown by the corresponding plot in Figure 3-26(a). When the jet angle is set at 30°, the C, 


obtained is 0.8451 at V,; =0.25U.,. With increase in the jet velocity there is a gradual rise in C; 
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until the jet velocity takes the value of U.,.; the corresponding Cu increased from 0.00027 to 
0.00433. As V; is increased further beyond U., till Vj = 2U..; Cu = 0.0173, there is a rapid increase 
in C; consistent with the increment in the jet velocity. Increasing the jet angle to 45° produces 
more interesting results. The jet parameters € = 45°; Vj = 0.25U... Cu = 0.00027 give a C, of 
0.8467, the highest for any jet case studied here at the lowest jet velocity. The C, plot for the case 
of jet angle fixed at 45° follows the same trend as that of the case of ¢ = 30° as the jet velocity is 
increased from 0.25U,, to U..; Cu from 0.00027 to 0.00433 and increased till V; equals 2U., and 
Cu = 0.0173. But beyond U,, the values of C; are higher for the jet case at ¢ = 30°. The best results 
overall are obtained for the jet cases of jet angle equal to 30° and 45°. The case with jet angle ¢ = 
30° produces the highest C; about 8% higher than the value for the stepped airfoil case without 
the use of jet at the maximum value of Cu equal to 0.01731 corresponding to a jet velocity V;= 
2U., for the case of a = 2°, and Re = 1.7 million which is evident from the C, plot shown in 
Figure 3-25(e). The maximum pressure occurring on the airfoil surface downstream of the step 
face is higher in case of the jet case for ¢ = 30° than that for ¢ = 45°. Figure 3-26(b) shows the 
drag data. It can be observed from the plots that the drag for some jet cases is lesser than that of 
the case without jet, the trend is opposite for some other jet cases, while for the rest of the cases, 
there is a slight or no variation as compared to value for the case of step without jet. The least Cg 
is obtained for the case with ¢ = 45° at Cu = 0.01731. For the all cases studied but one with ¢ = 
45°, the Cy curve shows a decreasing trend which 1s quite interesting. In the cases associated with 
jet used in this study, the concept of jet flap could possibly be used to understand the reduction in 
drag in the jet case for ¢ = 45°. A component of the jet momentum acts in the direction opposite 
to the free stream thus canceling a part of the drag resulting from the introduction of the step. 
According to Richard L.Kline as described in his 1985 article titled The Ultimate Paper 
Airplane’, a stepped airfoil traps some of the displaced air molecules, reverses their direction, and 


produces a forward “push” which he called the “drag utilization”. In other words, a force in the 
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direction opposite to drag is created due to the flow field developments or the vortex systems 
energized by the action of jet placed within the step cavity due to which there is an identifiable 
reduction in the drag which is supported by the results shown in Figure 3-26(b). Use of jet with ¢ 
= 45° at Cu = 0.01731 to control the flow field in the step cavity appears to be working the best 
following this concept of “drag utilization” and the result 1s, optimum aerodynamic 
characteristics, in this case the consistent decrease in Cy. 

These changes in the lift and drag resulting from the use of jet might affect pitcihng 
moment considerably. Further investigation is required to rule out the possibility of having to 
expend usable power to obtain the benefits in aerodynamic performance to control the resulting 
changes in the pitching moment when the jet is used in the step cavity which might nullify the 
benefits produced by the jet. Figure 3-26(c) shows the plots showing the variation in the lift to 
drag ratio as the jet parameters are varied. The lift to drag ratio is the highest for the jet case with 
C equal to 45° at the highest value of Cu. Thus the jet case with ¢ = 45° and Cu = 0.01731 
produces the best results in terms of the aerodynamic performance quantified by the L/D ratio of 
the modified NACA 4415 airfoil case with jet which is about 21% higher than that of the case for 


step without jet as illustrated in the figure. 


3.2. EXPERIMENTAL RESULTS 


This section deals with the results and discussion of the experimental data obtained 
through the wind tunnel testing of a NACA 4415 airfoil based wing-model. The experimental part 
of the present research comprises of the measurements of forces. Force measurements were made 
to calculate the aerodynamic forces and the corresponding coefficients about the test model built 
with a panel for varying the step depth on the lower step configuration during the testing. Forces 
were measured over a stepped wing test model for a step length of L; = 0.5, D, = 0.0 (base airfoil), 


D, = 0.1, 0.2, 0.3, 0.4 & 0.5 (stepped airfoil configurations). Due to the limitation of the 
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maximum airspeed that could be achieved in the tunnel, force measurements were feasible only 
for Re as high as 0.6 million. Efforts were made to record some useful flow visualization data 
using the in-house LED based lighting system. However, due to the unavailability of a 
sophisticated flow visualization system, quality images revealing the flow field developments 
including vortical structures, reattachment information for the various tested configurations could 
not be captured. Figure 2-6 shows a sample image capturing the flow over the wing model with 
step fully deployed during the wind tunnel testing. 

Aerodynamic forces acting on the NACA 4415 based wing model were recorded during 
the wind tunnel testing using force balances. The lift and drag data obtained were processed to 
compute the aerodynamic coefficients, C; and Cy. The values of lift to drag ratio were then 
computed using the C; and Cy values. Figure 3-27 presents the plots illustrating the variation of 
C,, Cg, and the lift to drag ratio for the cases of base airfoil (the case with D;= 0.0) and the 
modified airfoil configurations studied (cases formed by varying the step depth from D,= 0.1 to 
0.5). Lift coefficient plots presented in Figure 3-27(a) show that the lift increases slightly as the 
step depth is increased from D,;= 0.1 to 0.5. The C, values obtained are the highest for the case 
with D,= 0.5 amongst all the cases studied. Drag coefficient plots presented in Figure 3-27(b) 
show that the trend is exactly the same as that observed in the C, plots. The drag increases as the 
step depth is increased. The base airfoil case produces the least drag while the stepped airfoil case 

with D,= 0.5 produces the highest. The results shown in this figure follow the same trend as that 
observed in the numerical results. As can be seen from the Figure’s 3-27(a) and (b), the 
experimental values of drag obtained in proportion with the lift are significantly high as compared 
with the numerical lift and drag values for a given step configuration. Hence the experimental 
values of lift to drag ratio obtained are lower than the numerical values. Figure 3-27(c) shows 
various plots illustrating the variation in the value of the lift to drag ratio with change in the angle 


of attack. While analyzing in qualitative terms, it can be noted that the lift to drag ratios decrease 
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Figure 3-27: Variation of aerodynamic characteristics with angle of attack at Re = 0.6 million. 
(a). Lift; (b). Drag; and (c).Lift to Drag ratio 
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with increase in the step depth for the stepped airfoil configurations. 

One of the cases of particular interest is the airfoil configuration set at a = 4°, Re = 0.6 
million. This particular case was tested to serve the purpose of validating the numerical results 
(presented in Figures 3-13 and 3-14) obtained for the same set of aerodynamic parameters and 
inlet conditions. Comparing the experimental and computational results , it 1s apparent that at Re 
= 0.6 million, the base airfoil produces better aerodynamic characteristics, 1.e. higher lift and 
lower drag thus resulting in higher lift to drag ratios than those obtained with the stepped airfoil 
configuration. Thus the experimental result validates the one and only numerical case among all 
the cases studied for which the introduction of step on the lower surface seems to be ineffective in 
delivering better aerodynamic performance than the base airfoil case. The consistency noticed in 
the lift to drag ratio results from the increase in lift coefficient due to the introduction of step on 
the lower surface at about the entire range of attack tested just as predicted by the computational 
results obtained for the base airfoil and stepped airfoil configurations. Referring to the Figures 3- 
3(a) and 3-7(a), it can be concluded that for the case with Re = 0.6 million for the stepped airfoil 
configuration set at a = 4°, the vortex formed in the step cavity is not strong enough to alter the 
flow field characteristics so as to enhance the aerodynamic performance. The influence of step 


and jet on the pitching moments however needs to be investigated. 
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4. CASE STUDY 


This research effort explored the possibility of using a combination of passive and active 
control techniques to investigate the aerodynamic performance of conventional NACA 4415 
airfoil using steps. The case study discussed here was aimed at the application of usable results 
obtained from the research conducted to a reconnaissance plane on a strategic mission 
considering various stages of flight to define our problem. The flight regimes include take-off & 
climb, cruise/reconnaissance, descent & landing, before the mission is completed. The current 
study focused on pointing out the benefits with regard to the aerodynamic performance, arising 
from a stepped airfoil replacing a conventional airfoil on a real life Unmanned Aerial Vehicle 
(UAV). The aircraft of interest was the RQ-2 Pioneer which was operational with the Navy, 
Marine Corps, and Army since 1986. Pioneer uses a NACA 4415 airfoil. This tactical 
reconnaissance and surveillance UAV operates in a Reynolds number ranging from 1.8 million to 
3 million. Figure 4-1 shows the drawings of RQ-2 Pioneer. All the major technical 


specifications” are shown in Table 4-1. Figure 4-2 shows an operational RQ-2 Pioneer in flight. 





Figure 4-1: Schematic drawings’® of RQ-2 Pioneer (RQ stands for Reconnaissance). 
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Table 4-1: Technical Specifications of the UAV RQ-2 Pioneer. 





Figure 4-2: RQ-2 Pioneer in flight. 


Courtesy: http://www.military.com/ 
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The phases of flight envelope studied as a part of this research effort were limited to take- 
off, climb, and cruise which corresponds to the reconnaissance mode, assuming that the descent 
and landing modes of flight are similar to climb and take-off with regard to the flight conditions. 
Amongst the flight phases, take-off and climb are the shortest, cruise being the longest and the 
most crucial phase for a reconnaissance plane. Since endurance is directly affected by the lift to 
drag ratio, it is an important performance parameter for any a reconnaissance plane and any 
reduction in drag achievable with complementary increase in lift can considerably enhance the 
aerodynamic performance of the plane. Otherwise any combination of lift and drag resulting in a 
lift to drag ratio higher than that obtained with a conventional NACA airfoil is beneficial which 
brings the application of passive and active flow control techniques into the current picture. 

The best stepped configuration was identified for each flight mode on the envelope. The 
data was mapped for the mission-specific objectives of a reconnaissance mission. Figures 3-9 
through 3-14 illustrate the summary of the various cases studied from which useful results were 
picked for this case study. The flight regimes associated with the various cases are discussed in 
the next section. Results for the three phases of the entire flight envelope viz., the take-off, climb 


and cruise are discussed in the following sections. 


4.1. WORKING ASSUMPTIONS 

The chord of the NACA 4415 airfoil is 1 meter. All the experimental airfoil data 
available are for airfoils of unit chord length. For uniformity and ease of comparison of results 
obtained, the NACA 4415 airfoil with its actual chord of length 0.77 m as used on the UAV 
Pioneer was scaled to | m. The climb velocity of the UAV RQ-2 Pioneer studied here was 
assumed to be about 1500 ft. per min. The stall velocity was assumed to be around 95 km/h. The 


lift-off velocity was presumed to be equal to 1.3 times the stall velocity. The angles of attack 
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during the take-off, climb, and cruise were assumed to be 4°, 8° and 2° respectively. Based on the 


technical specifications available for the UAV, the wing loading was computed to be 502 N/m’. 


4.2. RESULTS AND DISCUSSION 

4.2.1. Application of Passive Flow Control. This section discusses the 
application of the results obtained from studying the modified airfoil cases for a range of angles 
of attack and Reynolds number’s. Corresponding to the flight conditions for each phase of flight, 


the results were picked for specific values of a and Re. The case study presented below discusses 


the Take-off; Climb; and the Cruise phases of flight of the UAV RQ-2 Pioneer. 


4.2.1.1. Take-off phase. Numerical simulations were conducted using a 
conventional NACA 4415 airfoil considered as the base airfoil and the modified airfoil as an 
airfoil with an intermediate step on the lower surface. Results obtained show that lift coefficients 
are higher for the stepped airfoil than the base airfoil and the magnitude of the change in C, 
increased with increase in the angle of attack. On the other hand, the drag values are higher for 
the stepped airfoil. Based on the C; and C, values obtained for each of the cases, the take-off 


distance is calculated using the following formula: 


Sto = (1.44 Wo’) /[g Po S Cimax (T- (D + u;(Wo-L) ) J (4.1) 


The thrust T was estimated from the rated power of the UAV per the technical 
specifications listed in the Table 4-1 and the average velocity during the take-off which was 
obtained by averaging the velocities corresponding to the Re’s of 0.6, 1.2, and 1.8 million 
occurring during the take-off phase. Also, the lift L and the drag D were computed using the 


average velocity occurring during the take-off phase. The minimum take-off distance for the base 
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airfoil case is about 155m while for the stepped airfoil case it is about 127m. Hence the ratio of 
Sto for the stepped airfoil to the Sto for the base airfoil equals 0.82 which means the use of an 
intermediate step defined by Xs=0.5, Ls=0.5, Ds=0.5 yields an 18% reduction in the minimum 
take-off distance required as supported by the computational results discussed in the previous 
section. This reduction mainly results from the increase in C; due to the step. With the rise in C, 
there is an increase in the lift. Also, the L/D ratio computed from the lift and drag data for the 
stepped airfoil case is higher than that for the base airfoil case. 

Some simulations were run for cases at two intermediate Reynolds numbers lower than 
the value for lift-off. The two Reynolds numbers chosen are 0.6 and 1.2 million, while the 
maximum take-off Re is 1.8 million. Figure(s) 3-13 and 3-14 illustrate the AC; and L/D ratio plots 
for all the take-off cases. At the peak Re which is 1.8 million, the C; values just like the L/D ratios 
for the stepped airfoil are higher than those for the base airfoil as shown in the figures. The trend 
is the same for the Re 1.2 million. When the Re is 0.6 million, the results obtained do not follow 
the same trend as that of the other cases, though the C, values are still higher than those for the 
base case at the same Re number. The L/D ratios for stepped airfoil cases are less than those for 


the base airfoil because of relatively higher drag. 


4.2.1.2. Climb phase. The conditions for the fastest climb/ maximum climb rate 
were assumed for the numerical investigation. The climb velocity at these conditions is 37.53 
m/s. The corresponding Reynolds number is 2.5 million. The climb angle 1s 9°. The angle of 
attack 1s 8°. There is about 20% increase in the C, over the base airfoil as illustrated in Figure(s) 


3-15 and 3-16. Also the L/D ratio increases by 11%. 


4.2.1.3. Cruise phase. The cruise Reynolds number is 1.7 million, the velocity 


being 24.93 m/s. The use of step yields about 34% higher C, than the base airfoil and the L/D 
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ratio increases by 3%. Figure(s) 3-11 and 3-12 show the AC, and L/D ratio plots for the cruise 
case corresponding to a = 2°. This increase in the L/D ratio yields about 20% increase in the 
endurance (E) of the UAV RQ-2 Pioneer with the use of stepped airfoil over the conventional 
NACA 4415 airfoil. The endurance of the UAV with a conventional NACA 4415 airfoil based 
wing is 5 hours. The ratio E stepped airfoil / E base airfoil WaS Computed to be 1.2 meaning the endurance 
could be increased to about 6 hours or a 20% increase in the endurance could be obtained by 
employing the step on the lower surface. The endurance and range calculations were made using 


the equation’s below: 


E = (n/ey)*(Cr"/Cp)*(2p.08) (Wy — Wo”) (4.2) 


R = (1/es)*(Ci/Cp)*In W/W 1) (4.3) 


1) was assumed to be 0.8 for all the calculations while c, was assumed to be 2.27e-7 ft’. Range 
was computed for both the base and stepped airfoil configurations. The ratio R stepped airfoil / R. base 
airfoil WaS Calculated to be around 1.03 which means the activation of lower surface step during the 
cruise results in an increase of about 3% over the conventional NACA 4415 airfoil based wing 


originally used on the UAV Pioneer. 


4.2.2. Step Schedule. The step is activated as the airspeed approaches the value 
corresponding to a Re of 0.6 million during the take-off. Results obtained from simulations at 
higher Re occurring during climb and cruise phases of flight show that the step defined by 
Xs=0.5, Ls=0.5, Ds=0.5 produces better aerodynamic characteristics than the conventional 
NACA 4415 airfoil. Thus the step could be held activated during the flight regimes of climb and 
cruise. Similarly, it could be held activated during the descent and landing. Complementing the 
rise in C, and correspondingly the L/D ratios, there will be increase in the pitching moments 


which will also affect the performance of the airfoil configurations. The current study did not 
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focus on the effect of variation of the pitching moments which needs to be investigated. 


4.2.3. Application of Passive and Active Flow Control. The effect of using an air 
injecting jet into the step cavity was studied on a conventional flat faced step on the lower surface 
of a NACA 4415 airfoil. Figure 3-17(b) shows the location of the jet placed on the bottom of the 
step for all the jet cases discussed in this study. Since cruise is the longest and the prime 
operational phase of reconnaissance planes, this study focuses on the cruise phase. Active flow 
control using jet was employed on the modified NACA 4415 airfoil configuration to enhance the 
aerodynamic performance of the stepped airfoil in the cruise regime. Any benefits yielded by the 
jet would also be applicable to other flight regimes based on the results discussed earlier in this 
case study. Table 3-2 enlists all the jet cases studied along with the associated jet parameters C , 
and Cu. Figure 3-25 shows the plots illustrating the aerodynamic characteristics of C,, Ca, and 
L/D ratio for the various jet cases studied comparing them with the case of stepped airfoil without 
jet. The jet case for ¢ = 45°, and Cu = 0.01731 produces the best results with regard to the 
aerodynamic performance. This case gives about 7.5% increase in C, over the corresponding case 
without jet. Further, there is about 12% reduction in Cy. As a result, this jet case yields about 22% 
increase in the L/D ratio over its stepped counterpart without jet. To emphasize on the fact that 
the jet proves to be enhancing the aerodynamic performance, the results obtained here are 
enhancements over the stepped airfoil without jet which itself delivers enhancement of the L/D 
ratio over the conventional NACA 4415 airfoil. Getting back to the performance calculations, the 
endurance increases by about 26% due to the rise in the L/D ratio resulting from the employment 
of jet in the step cavity over the stepped airfoil case without jet. Employment of jet in conjunction 
with step on the lower surface of a NACA 4415 airfoil used on the UAV RQ-2 Pioneer could 
possibly result in as much as 50% increase in the endurance over the case of the aircraft using the 


conventional NACA 4415 airfoil. With the use of jet, the range increases by about 5% over the 


stepped airfoil case without jet. This is a tremendous benefit to the aircraft in terms of the 


efficiency and could result in a huge money savings and considerable reduction in emissions. 
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5. CONCLUSIONS AND OUTLOOK 


The results presented in this thesis constitute the next step towards understanding the 
effects of a step-induced vortex on the aerodynamic characteristics of a conventional NACA 4415 
airfoil for a range of Reynolds number’s and angles of attack. This study takes on a relatively new 
family of airfoils which are yet to be researched extensively. The objective of the study was to 
enhance the aerodynamic performance of the stepped NACA 4415 airfoil for a chosen range of 
Reynolds numbers and angles of attack, following on the preliminary studies conducted on 
stepped airfoils in the past by other researchers from around the world. The airfoil with a 
backward facing step on the lower surface was shown to have the potential to enhance the 
aerodynamic characteristics by increasing the lift coefficients and lift-to-drag ratios considerably 
in most cases and substantially in some of the cases studied through this research. Use of jets was 


proved to be effective in enhancing the aerodynamic performance of stepped airfoils. 


Based on the results obtained by Stephen Witherspoon’ and Fathi Finaish (1996), the 
airfoil configuration with a step on the lower edge step located at mid-chord, with a step depth 
half of the mid-chord airfoil thickness and extending till the trailing edge was chosen as the 
common configuration for all the cases studied 1n this research effort. Special focus was placed 
on enhancing the aerodynamic performance of this stepped airfoil configuration with the 


application of active flow control using air injecting jet placed in the step cavity. 


Results obtained show that the lift coefficients were higher by as much as 37% for the 
best case among all the stepped airfoil cases studies in the range of a’s 1.e. a = 2°, 4°, and 8° and 
the entire range of Reynolds number’s 1.e. Re = 0.6 million to 2.5 million. The application of 
active flow control proves to be effective in enhancing the lift of the stepped airfoils studied at Re 


= 1.7 million; a = 2°. The lift coefficients were higher for all the jet cases, significantly high in 
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many jet cases and slightly high in some based on the setting of the jet parameters C, the jet angle 
and V,, the jet velocity. The stepped airfoil case using jet with ¢ = 45° and Cu = 0.01731 
corresponding to V;= 2U., produces the best results in terms of the aerodynamic performance 
enumerated by the values of L/D ratio of the modified NACA 4415 airfoil case with jet which are 
significantly higher than that obtained for the case of step without jet and other jet cases, though 
the maximum C; occurs at Cu = 0.01731 for ¢ = 30°. The contribution to the highest L/D ratio 
obtained in the jet case for ¢ = 45° comes from the reduction in Cy which 1s significanlty higher 
than in the jet case for ¢ = 30° at the same Cu as compared with the Cg obtained in the case of 


step without jet. 


Drag-coefficient data indicate that with the introduction of a step, drag increased. This 
observation is consistent in all the modified airfoil cases studied. The use of active flow control in 
the form of an air injecting jet in the step cavity of the modified airfoil configuration produces 
some promising results for the jet cases studied at Re = 1.7 million; o = 2°, with higher benefits in 
terms of drag reduction over the stepped airfoil cases when compared with the drag data for the 
base airfoil cases. For jet cases formed by the combination of various C’s of 0°, 15°, 30°, and 45°; 
Cu’s of 0.00027, 0.00108, 0.00243, 0.00433, 0.00974, and 0.01731 corresponding to V;= 
0.25U,,, 0.5U., 0.75U., U., 1.5U., and 2U., respectively, the drag data obtained are either less 
than those of the case of step without jet or show little variation from those values. However, 
there is a reduction in drag by about 12% over the stepped airfoil in the step with jet case with jet 


parameters ¢ = 45° and Cu = 0.01731 which is the case of interest in this study. 


The lift-to-drag ratios were higher for all the stepped airfoil cases studied except for the 
case of Re = 0.6 million at a = 4°, proving once again that a stepped airfoil is superior to a 
conventional NACA airfoil with regard to the aerodynamic performance. The lift to drag ratio 1s 


maximum for the case of Re = 2.5 million at a = 4°. Higher lift-to-drag ratios are obtained using 
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jet in the step cavity for some of the cases studied at Re = 1.7 million; a = 2°. For the jet case 
with ¢ = 30°, as the jet velocity is increased, the values obtained show little variation from that of 
the stepped airfoil case without jet. For the jet cases with C = 0° and 15°, the lift-to-drag ratios 
obtained by varying the jet velocity are lesser than the value for the case of step without jet in 
most of the cases studied. The case of interest 1s the jet set at 45° in which case the lift-to-drag 
ratios are considerably higher for most cases studied by varying the jet velocity. The best results 
are obtained for the case of ¢ = 45° and Cu = 0.01731 which produces an L/D ratio which is about 
22% higher over the stepped airfoil case without jet that gives a value about 3% higher than the 
NACA 4415 considered as the base airfoil for all the cases studied in this research. As discussed 
earlier, the influence of the jet on the pitching moments due to the changes in the lift and drag 


needs to be investigated. 


Flow control is one of the most promising and the most sought after areas of research in 
the field of fluid mechanics and aerodynamics. Achieving an enhancement in the aerodynamic 
performance correlates to improving the overall efficiency of the aircraft as a significant amount 
of power could be saved due to the reduction in drag and / or increase in the lift resulting in huge 
fuel and money savings and most importantly reduced emissions. Future work in the direction of 
the current research with the application of flow control and the preliminary studies conducted on 
stepped airfoils in the past could possibly involve experimental investigation of a full-scale model 
of RQ-2 Pioneer using the modified NACA 4415 airfoil based wing; and further numerical and 
experimental investigation towards the design, development and testing of feedback control based 
devices aimed at enhancing the aerodynamic performance of the modified airfoils with steps 


operational on the entire flight envelope designed for a particular mission. 
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Department of Mechanical Engineering & Aeronautics, City University London, London, UK 


Abstract 


A triple slotted aerofoil following the Handley Page 44F design was tested at City University London T- 
2 wind tunnel. The model allowed the study of a fixed triple slotted wing as well as investigation of the 
effects of isolated slots at different locations along the chord. PIV measurements were performed 
within the chord Reynolds number range in between approximately 200,000-400,000. The model was 
tested at an angle of attack of 22°. Measurements of mean streamwise velocity, velocity fluctuations 
and shear stress were analysed. The study shows how an isolated slot is more favourable when it is 
placed closest to the leading edge, although slow moving fluid regions can still be found close to the 
trailing edge. Fully attached flow was only achievable by using all three slots. In addition, the fully 
slotted profile is shown to generate channel exit velocities in the order of 1.4U... which highly energise 
the boundary layer on the suction side. 


1 - Introduction 


Flow control is employed to improve the aerodynamic performance of aircrafts’ wings. It can be 
realised via the use of auxiliary aerofoils (Chen [1]), boundary layer injection/suction (Shojaerfard, et al 
[2]) or vortex generators (Johnston [3]). Studies on auxiliary aerofoils (Lachman [4], Handley Page [5], 
Wenzinger, C., et a/ [6]) culminated in the development of high-lift systems (HLSs), namely slats and 
flaps, which are currently used in most commercial passenger airliners. The deployment of these 
auxiliary aerofoils changes the overall wing geometry and the pressure distribution on the constituent 
elements resulting in increased lift coefficient, C, (at a price of increased drag). In addition, the 
convergent shape of the channel formed between the slat/flap and the subsequent aerofoil makes use 
of the pressure differential to drive and accelerate the passing air, which is shown to be beneficial for 
the flow development as it contributes for attached flow and thus prevents separation (van Dam, C.P. 
[7]). A large amount of work with respect to the common HLS is present in the literature (Chen [1], van 
Dam, C.P. [7]). With regards to the deployable slat, not only its geometry and positioning of the latter 
in relation to the main aerofoil is of importance, but also when deployed, the events taking place in the 
cove region necessary to allow these HLSs to be retracted. These complex highly recirculating flow 
regions result in a high-noise signature and also present a challenge for flow simulation, which results 
in difficulties in multi-element aerofoil modelling (Olson, L., et a/ [8], Tung, C., et a/ [9], Savory, E. et al 
[10]). Moreover, complex mechanical actuators need to be designed and implemented to allow 
deployment. The fixed leading edge slat, however, has the benefit of requiring neither a cove region 
nor a mechanical actuator, while positively contributing to the flow development due to the above cited 
reasons. In addition, Katzmayr and Kirste [4] concluded that generally an auxiliary fixed HLS produces 
better results than a retractable slat because of the lack of the cove region. 

Recently, the interest in a fixed leading edge slot has grown due to its potential application for wind 
turbines, as it has been demonstrated efficient in both preventing flow separation and increasing the 
overall aerodynamic efficiency (Ashworth [12], Weick [13]). Also, modern flow field analysis of this type 
of configurations are lacking in the literature as a means to further understand the behaviour of the 
flow over such aerofoils. 

Therefore, the present investigation was intended to determine the effect of the fixed leading edge slot 
on the flow field and to observe the correlation between slot location and separation reduction over an 
aerofoil at a high angle of attack by means of digital flow field capture. Three different slot locations 
along the chord are investigated in this study. In addition, the flow field around a triple slotted wing is 
visualised to understand the impact of the slots. 


2 — Experimental facility and details 


The slotted aerofoil under investigation follows the design of Handley Page. This was achievable, 
thanks to the kind support of the HP foundation, which agreed on make the original model available to 
City University. A sketch of the model can be seen in figure 1. 
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Figure 1. Geometry of the HP 44F triple slotted aerofoil. 
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The model comprises one major section and three smaller front aerofoils (‘main’ and 1, 2 and 3 
respectively in figure 1). Hence, three leading edge slots /, J, K are formed throughout the whole span 
of 500mm. The chord of the aerofoil is c=153.8 mm in length; where the chord c,, of the main aerofoil 
is 84.6mm (0.55 x/c) and the geometric characteristics of the slots can be found in table 1. 


Inlet Exit Inlet/Exit Exit location 
(x/c) (x/c) Ratio (x/c) 


a 0.123 0.016 


0.117 0.017 
0.147 0.013 11.3: 


Table 1 — Geometric characteristics of the experimental model. 





The channel inlet dimensions are taken from the points were the slotted aerofoil deviates from the 
solid contour and the exit dimensions are aligned with the exit orientation (as shown in figure 1). 

The experiments were performed in the T-2 wind tunnel at City University London. The test section of 
the latter measures 1.12 m x 0.81 m x 1.78 m and the freestream turbulence intensity is <0.8%, which 
is deemed appropriate for the current experiment. The tunnel was operated at 20 m/s, 30m/s and 40 
m/s resulting in a Reynolds number Re; in the range of 200,000-400,000. 

In this study, we use x, y, Z to indicate the streamwise, wall-normal and spanwise directions. Mean 
velocity along this axis is indicated by U, V, W, while fluctuating components are denoted as U’, V’, W’. 
In order to generate substantial flow recirculation over the suction side of the aerofoil, the model is 
installed at an angle of attack (AoA) of 22 degrees. Particle Image Velocimetry (PIV) is employed to 
capture velocity fields. The experimental setup with the aerofoil model, laser sheet and camera are 
shown in figure 2. 
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Figure 2. Schematic of experimental setup. AoA 22°. 


The aerofoil was mounted horizontally in the centre of the test section and supported by Perspex 
plates to supress any 3D flow effects. To prevent deflection of the constituent aerofoils under load, 
small plastic clips were used to support them in the centre. The present aerofoil allows the study on 
the slots location effect through opening/closing individual sections. The examined cases are shown in 
figure 3. 

















Figure 3. Studied configurations (a) solid (b) slot 1 open (c) slot 2 open (d) slot 3 open (e) triple or fully slotted. 


A Phantom M310 CMOS high-speed camera with a resolution of 1280x800 pixels’, equipped with a 
Nikon 100 mm lens was used for particle imaging. A Litron LDY300 series dual-pulsed laser was used 
to illuminate the particles. A periscopic laser beam delivery arm with a combination of optics was used 
to form the laser sheet and to direct it into the wind tunnel test section through the bottom wall, which 
offers optical access. A Laskin 9307-6 oil droplet generator was used to generate seeding particles 
(olive oil), which have a nominal diameter of approximately 1 um. A high-speed synchroniser (TSI 
610036) was used to trigger the camera and the laser. 

A 1000 frame set was acquired for each configuration to promote converging statistics. Each image 
pair was captured with a time interval between 5 us and 15 us depending on the flow velocity. 

Given the limitations in optics and camera availability, it was necessary to join two or more field of 
views (FOVs) to obtain the entire flow field around the region of interest. In addition, the camera had to 
be moved for each configuration, which resulted in slightly different resolved FOV for each case. 
These factors resulted in different unresolved regions, as it becomes apparent in subsequent figures. 
The FOVs used for the different configurations can be seen in figure 4. 
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Figure 4. Field of view (FOVs) for (a) solid (b) slot 1 open (c) slot 2 open (d) slot 3 open (e) slotted cases. 


TSI InSight 4G software was used to capture the images and generate the vector fields. Image pre- 
processing entailed the subtraction of minimum intensity background noise. Spurious vectors 
generated by laser light reflection, low seeding density, etc. were removed and replaced by statistical 
generated vectors. 


3 - Results and discussion 


Section 3.1 discusses contours of mean velocity fields, which allowed comparison of the behaviour of 
the flow in the presence and absence of the slots. Section 3.2 considers the mean velocity 
fluctuations, while section 3.3 contains contours of the shear stress. Instantaneous velocity fields are 
analysed in section 3.4, which aids to visualise the flow patterns emerging from the slots. 


3.1 - Mean velocity fields 


Figure 5 shows the flow features of the solid aerofoil (slots closed), here taken as benchmark case. At 
the chosen angle of attack (22 degrees) this case is not able to sustain attached flow over the suction 
side and significant flow separation is observed. The same behaviour was observed at all tested 
velocities. The overlaid velocity vector field (in white) aids visualising the recirculation bubble. This is 
centred close to the aerofoil trailing edge and has height of 0.3c with reversed flow up to a magnitude 
of 0.2 U.. Moreover, the onset of separation is rather early, in correspondence of what would be the 
location of the first slot (see figure 6(a) for comparison). 
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Figure 5. Contour of time-averaged streamwise velocity component U/U.. 
at AoA=22° for solid aerofoil; U.=40 m/s 
Velocity vectors are visualized by white arrows. 


Figure 6 shows mean velocity contours for the remaining test cases. 





Figure 6. Contour of time-averaged streamwise velocity component U/U.. at AoA=22 for 
(a) slot 1 open (b) slot 2 open (c) slot 3 open (d) slotted; U.=40 m/s. 


Opening of the first leading edge slot is expected to alleviate flow separation, as a high-momentum jet 
should entrain the external flow. It is shown in figure 6(a) that this high-momentum flow re-energises 
the boundary layer, resulting in reducing the large-scale separation, when compared to previous case 
in figure 5. It is apparent that the boundary layer re-develops after the first slot and remains attached 
for the most part of the upper surface, despite the existence of a small packet containing reversed flow 
at the trailing edge. However, the re-developed boundary layer is not able to fully overcome the large 
turning angle and only has velocity magnitude of 0.2U. at the trailing edge. This could lead to 
unsteadiness and flow separation if adverse pressure gradient is further encountered or the AOA is 
slightly increased. Further, when tested at 20 m/s, this configuration revealed separated flow at the 
trailing edge, however this result is here omitted. 


Figure 6(b) shows the effect of opening the second slot, which causes the development of attached 
flow over the trailing element with small high-velocity packets exiting the slot. A small separation 
bubble of height 0.05c can be seen in the upper surface prior to the slot exit before re-attachment is 
achieved. Although the slot contributes to keeping the flow attached in the region near the slot exit, the 
instantaneous flow field suggests further separation at the trailing edge under this configuration 
(although not shown here for the sake of brevity). 


The effect of opening the third slot is shown in figure 6(c). It is clear that this position is not able to 
contribute to an attached flow. The flow remains separated although the velocity of the flow adjacent 
to the trailing element is seen to increase to 0.2U... This could denote instabilities caused by the 
interaction of the recirculating flow and the flow exiting the channel. 


Utilisation of the remaining two slots, namely slots 1 and 2 in figure 1, further improves the flow quality, 
as more high-momentum flow is entrained. The resulted flow field in the triple slotted configuration is 
shown in figure 6(d). The downstream flow velocity increases further to 0.7U.. close to the trailing 
edge. Moreover, high velocity is injected within the boundary layer and persists over a larger portion of 
the aerofoil. The slot jet has much more impact on the flow field, and the energised region is more 
apparent than that in flow when only one slot was open (figure 6(a)). To summarise, multiple slots are 
beneficial in consolidating the control outcome and the sole use of one slot (irrespective of its location) 
is shown not to fully succeed in keeping the flow attached. However, it is clear that the further 
upstream a single slot is located, the more beneficial its effect is. This is in accordance with the result 
from Weick and Shortal [13] on the Clark Y wing. 


3.2 - Mean fluctuating fields 


Figure 7 shows mean fluctuations of streamwise velocity for the solid case. These have peak values 
over 0.4U., for Re.=4x10°. This is in accordance with unsteadiness within the large separation region 
observed in the mean velocity contours in figure 5. 





Figure 7. Contour of time-averaged streamwise normalised velocity fluctuation U’/U.. at 
AoA=22 for solid aerofoil. 


Similar analysis is carried out for the slotted cases and it is presented in figure 8. Under the same 
conditions, the cases where the second and third slots are open (Figure 8(b)(c)) also experience high 
velocity fluctuations (of the order of 0.25U..) although these clearly diminish after the slot exit. 
Nevertheless, these two cases are characterised by separated flow close to the trailing edge. The 
velocity fields suggest that the further upstream the slot is located, the smaller the unstable flow region 
becomes. Opening of the first slot (in Figure 8(a)) removes instabilities at the leading edge and 
achieves attached flow. The fluctuations closer to the trailing edge move further away from the surface 
and when the model is fully slotted (Figure 8(d)) these instabilities become yet less severe. The 
maximum mean fluctuating magnitude sits close to 0.2U.. for the slotted case (Figure 8(d)) compared 
to a maximum of 0.25 U.. for the case where the first slot is open (figure 8(a)). This suggests attached 
flow throughout the upper surface for the fully slotted configuration. 
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0.25 





Figure 8. Contour of time-averaged streamwise normalised velocity fluctuation U’/U.. at 
AoA=22° for (a) slot 1 open (b) slot 2 open (c) slot 3 open (d) slotted; U.=40 m/s. 


3.3 - Shear stress fields 


At each slot exit, a shear layer develops in between the channel flow and Its surroundings due to the 
velocity differences. To examine the extent of the latter, shear flow fields are presented. The shear 
layer between the flow originating from the jets and the freestream can be seen becoming thinner as 
the location of the slot is moved towards the leading edge. The shear layer developed with slot 3 
open, seen in figure 9(a) is that of a separated flow and reaches magnitude of 3x10 U’V’/U..°. With 
slot 2 open, a considerable reduction of the shear layer extent and magnitude is visible in figure 9(b). 
In addition, small packets of positive shear are seen close to the slot exit, which indicate the 
interference of the external flow with the flow exiting the slot. 
































Figure 9. Contour of time-averaged normalised shear stress U’V'/U." at 
AoA=22" for (a) slot 3 open (b) slot 2 open; U.=40 m/s. 


Figure 10 shows the shear stress distribution for the cases where the first slot is open and for the fully 
slotted case in (a) and (b) respectively. These are plotted separately from the remaining cases due to 
the large difference in shear stress magnitudes observed. 

A decrease of 50% in the maximum shear stress is obServed upon opening of the first slot, as shown 
in figure 10(a). In addition, the shear layer seems to develop close to the trailing edge, in contrast with 
the cases where the slots were located further downstream, that originated a large shear region close 
to the leading edge. Employment of all three slots further decreases the shear stresses and only a thin 
shear layer, far from the aerofoil surface, is visible (Figure 10(b)). Moreover, the maximum shear 
stress is reduced by 30% when compared to the case where the first slot is open in (a). These findings 
show the favourable impact of implementing a fully slotted configuration where there is a much higher 
momentum transfer to the top surface keeping the flow attached. 





Figure 10. Contour of time-averaged normalized shear stress U’V'/U." at 
AoA=22" for (a) slot 1 open (b) slotted; U.=40 m/s. 


3.4 - Instantaneous flow fields 


lt has been shown so far that the jets exiting the channels are responsible for reattaching the flow at 
the trailing edge of the airfoil. To further explore this matter, instantaneous velocity fields are 
investigated, in the hope of offering further support to this hypothesis. The case where the second slot 
is open is depicted in figure 11, showing three consecutive snapshots (a), (b) and (c) to aid 
visualisation of the flow evolution along the chord. In this case, a separated flow region can be seen 


on the upper surface just before the channel exit, being accelerated to 0.7U.. aft the channel exit. This 
causes the development of a new boundary layer at the trailing element and also the softening of the 
shear layer upstream. Further analysis of instantaneous velocity in this configuration shows the 
development of high fluctuations and vortical structures close to the trailing edge, which suggest 
further separation at that region, despite the beneficial effects of the slot. 


Figure 11. Contour of streamwise instantaneous velocity component U/U.. 
at AoA=22 for slot 2 open U..=40 m/s. 





Similarly, the three consecutive snapshots shown in figure 12 show the evolution of the flow field when 
the first slot is open. Slow moving fluid is seen close to the trailing edge and a large unstable region is 
found there, in accordance to findings shown in section 3.2. Small packets of fluid at velocity U.. are 
seen exiting the channel and contributing to reduce the low-momentum region downstream. 





Figure 12 — Contour of streamwise instantaneous velocity component U/U.. 
at AoA=22° for slot 1 open aerofoil U.=40 m/s. 


Instantaneous velocity contours for the triple slotted case are shown in figure 13. These suggest a 
steady behaviour of the flow on the upper surface. The slots channels are seen to output flow at a 
velocity up to 1.4U.., which energises the fluid in the suction side. Flow deceleration is seen close to 
the trailing edge, although the boundary layer is still attached. This is in accordance to the shear layer 
developed in this configuration (as shown in figure 10(b)). This configuration exhibits the same 
behaviour at all the tested Reynolds numbers, which speaks for the consistency of the results. 





Figure 13 — Contour of streamwise instantaneous velocity component U/U.. 
at AoA=22 for slotted aerofoil U.=40 m/s. 


The slot exit velocity is seen to be higher for the slotted case when compared to the exit velocity when 
only one slot is opened. Hence, utilisation of all slots is shown to be the most favourable configuration 
to fully suppress flow separation. An interesting point is that the exit velocity at the first slot differs 
when subsequent slots are used. Also, we must recall the work of Lungstrom [11], who states that the 
gap between the slat and the main aerofoil must not be smaller than 2% chord to obtain favourable 


results, and the work of Abbot and Von Doenhoff [14], who suggest that the optimum slot opening is in 
the order of 1% chord or slightly higher for flaps. In this experiment, the exit dimensions of 1.6%, 1.7% 
and 1.38% chord are shown to successfully energise the boundary layer despite that any variation to 
these parameters could not be investigated. In addition, for an isolated slot case, the channel exit 
velocity is seen to progressively decrease, as the channel is placed downstream. Again, this entails 
that the slot becomes more favourable as it is placed closer to the leading edge, in accordance with 
the work of Weick [13] on the slotted Clark Y wing. It is also shown in this work, however, that slots 
located further downstream are favourable, especially those located in the region 0.75-0.8c. 


4 — Conclusions and further work 


Wind tunnel experiments were performed to study control separation on a triple slotted aerofoil. These 
revealed the positive outcome of leading edge slots in doing so through PIV measurements. 


In the present work, jet velocity at the slot outlet reaches values up to 1.4U.. and is capable of locally 
re-energising the boundary layer in the fully slotted configuration. When only one of the slots is used, 
the channel exit velocity is seen to reduce as the channel is progressively placed downstream, 
although a relationship between the slot position and effect on exit velocity showed no clear trends. 
For the case of the first slot open, the exit velocity was seen to be of the order of U.. and for the case 
of slot 2 open, the exit velocity was measured to be 0.7U... This discrepancy is seen to have impact on 
the overall flow field given that, as a result, the region of separated flow is seen to grow. Flow 
separation can be prevented using solely the first slot, however, relatively low speed is still present in 
the flow field, which suggests that the flow is bound to separate due to unsteadiness. Nevertheless, 
this study concludes that upon using only one slot, the most beneficial location is that closest to the 
leading edge. Improved control, achieved by using all three slots, results in total separation 
suppression. Only this configuration withstands the pressure gradients characterising the model 
guaranteeing a fully attached flow. 


Despite the exit velocity for the slotted case being the same for all channels (i.e. 1.4U..), additional 
studies considering opening of slots / and J, as well as /and K could shed more light into the influence 
of consecutive slots and the chord-wise distance between them. Further study on the aerodynamic 
performance of the cases presented herein via the use of a 6-component force balance is also 
scheduled in order to link the flow physics to the aerodynamic loads. 
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Abstract 


An extremely simple CFD tool is used to compare the calm-water drags of a series of hull forms and to define ‘optimized’ monohull ships 
for which the total (friction + wave) calm-water drag is minimized. The friction drag is estimated using the classical ITTC formula. The 
wave drag is predicted using the zeroth-order slender-ship approximation. Comparisons of theoretical predictions and experimental measure- 
ments for a series of eight hull forms show that—despite the extreme simplicity of the method that is used here to estimate the friction drag 
and the wave drag—the method is able to rank the drags of a series of hull forms roughly in accordance with experimental measurements. 
Thus, the method may be used, with appropriate caution, as a practical hull form design and optimization tool. For purposes of illustration, 
optimized hull forms that have the same displacement and waterplane transverse moment of inertia as the classical Wigley hull, taken as 
initial hull in the optimization process, are determined for three speeds and for a speed range. © 2002 Published by Elsevier Science Ltd. 


Keywords: Optimization; Hull forms; Calm-water 


1. Introduction 


Although CFD-based hull form optimization is not 
routinely used for ship design, a significant number of 
applications of CFD tools to hydrodynamic optimiza- 
tion—mostly for reducing calm-water drag and wave 
patterns— attest to a growing interest in hydrodynamic opti- 
mization. This growing interest in hydrodynamic optimiza- 
tion is a useful development because—in principle—ship 
design implies optimization, e.g. minimization of a func- 
tional that appropriately weighs payload, ship speed, 
motions, and calm-water drag. 

Optimization of a ship hull form involves a number of 
nontrivial issues, including selection of an appropriate 
objective function, choice of optimization scheme, geo- 
metrical representation of hull surface and choice of 
related design variables and constraints, selection of a 
practical and robust CFD tool to evaluate the objective 
function, and decision to perform optimization for a 
single-point design or for multiple-point design, e.g. for 
a single ship speed or for a range of speeds. These basic 
issues involved in hull form optimization have been 
addressed in various ways in the literature, and are now 
briefly considered in turn. 

Regarding objective functions, it 1s well understood that 
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hydrodynamic optimization needs to take into account a 
number of requirements, including speed, calm-water 
drag, motions and loads, and other considerations like 
payload, propulsion, and wakes. In particular, an integrated 
approach that simultaneously considers motions and calm- 
water drag should be used because minimization of ship 
motions without regard to drag, or of drag without regard 
to seakeeping characteristics, can lead to unacceptable 
designs. However, the much simpler task of minimizing 
objective functions associated with calm-water flow 
features (wave resistance or total drag, and/or wave 
patterns) is considered in all the CFD applications to hull 
form optimization that are listed further on, and in the 
present study as well. 

Several optimization algorithms have been used in the 
literature. Three algorithms (steepest descent, conjugate 
gradient, sequential quadratic programming) are con- 
sidered in Ref. [1], and genetic algorithm techniques 
are used in Refs. [2,3]. A hybrid method based on 
coupling of a genetic algorithm and a hill-climbing tech- 
nique is used in Ref. [4]. Alternative ways, noted in Ref. 
[1], of representing the hull surface modifications that are 
required in a hull form optimization procedure have been 
employed. 

With regard to CFD tools, both extremely simple tools— 
Michell’s thin-ship approximation and the slender-ship 
approximation given in Ref. [5]—and more sophisticated 
tools—potential-flow panel methods based on Rankine 


338 S. Percival et al. / Applied Ocean Research 23 (2001) 337-355 


sources and RANS-based viscous-flow methods—have 
been used. 

Thin-ship theory is used in Refs. [3,6—12,33]. The hull 
form optimization results reported in Refs. [13-15], are 
based on the slender-ship approximation. Ref. [16] relies 
on the slender-ship approximation and the Fourier—Kochin 
flow representation given in Refs. [17,18]. 

A combination of a Rankine-source panel method and 
simpler prediction tools are used in Ref. [1] to speed up 
the optimization procedure. Refs. [1,2,9,19-—22], use 
potential-flow panel methods based on Rankine sources. 
RANS-based viscous-flow methods are used in Refs. 
[23-26]. A combination of flow solvers based on potential 
flow, viscous boundary layer, and RANS is used in Ref. 
[27]. 

The usefulness of many currently-available CFD tools 
for routine applications to hydrodynamic optimization is 
restricted by various factors. In particular, some CFD 
tools can only be used within a limited range of ship 
speed and/or wave frequency. Other tools require highly 
structured discretizations of the ship hull and the free 
surface (and even the fluid domain), and may indeed 
fail to provide reliable results unless a discretization 
that satisfies stringent requirements is used. Lack of 
versatility (due to limited range of applicability) or 
robustness, preprocessing discretization requirements 
that are overly time consuming (and hence too expen- 
sive for routine applications), excessive computational 
requirements, and need for highly-experienced expert 
users are factors that can seriously restrict the practical 
usefulness of a CFD tool for routine practical 
applications to ship hull form design and, especially, 
optimization. 

Thus, an extremely simple CFD tool is used in this study 
to estimate the total (friction + wave) calm-water drag of a 
ship. Specifically, the friction drag is evaluated using the 
ITTC 1957 model-ship correlation line formula. The 
calm-water wave drag is estimated using the zeroth-order 
slender-ship approximation given in Refs. [5,18] because 
the exceptional simplicity of this calculation method (no 
nearfield-flow calculations are required) renders it ideally 
suited for optimization (a one-speed wave drag calculation 
for a ship hull represented by 8000 panels requires less than 
1 s on a gigahertz computer). 

This method is used to compare the drags of eight 
hull forms in the series of hull forms analyzed in Refs. 
[28,29] within the mid-term sealift ship technology 
development program. Comparisons of theoretical calcu- 
lations and experimental measurements of drag for this 
series of eight hull forms show that—despite the 
extreme simplicity of the method used here to estimate 
the friction drag and the wave drag—the method is able 
to rank the drags of the variant hull forms roughly in 
agreement with experimental measurements. These 
results—and those reported in Refs. [14—16]—indicate 
that rough approximations to the friction drag and the 


wave drag may be used, with appropriate caution, as a 
practical hull form optimization tool. Indeed, the 
method is used to determine optimized hull forms that 
have the same displacement and water-plane transverse 
moment of inertia as the classical Wigley hull, which is 
taken as initial hull in the optimization process for 
purposes of illustration. 

The studies previously published in the literature on 
hydrodynamic optimization of ship hull forms only 
consider optimization for a single ship speed. The 
present study presents both hull forms optimized for a 
single speed (point optimization) and hull forms opti- 
mized for several speeds (multi-point optimization). In 
fact, a main conclusion of the study is that optimization 
within a speed-range (multi-point optimization) 1s much 
preferable to single-speed optimization, which appears 
to yield highly-tuned optimized hull forms that perform 
well only within a fairly narrow speed range centered at 
the optimization speed. 


2. A series of eight hull forms 


A series of eight hull forms is considered in this study. 
These hull forms, previously analyzed in Refs. [28,29] 
within the mid-term sealift ship technology development 
program, are variants of the baseline hull form that is 
depicted in Fig. 1 and is identified as DIMB model 5501 
in Refs. [28,29]. The other seven hull forms are identical to 
the baseline hull form shown in Fig. | except for the shape 
of the bow bulb and/or skeg. 

The baseline bulb and six bulb variants are shown in Figs. 
1 and 2, respectively. The seven bows consist of the (ellip- 
tical) baseline bulb shown in Fig. 1, four nabla-type bulbs, a 
producible bulb (also elliptical), and a no-bulb bow. The 
baseline skeg and a producible skeg variant are depicted 
in Fig. 3. 

The series of eight hulls that is considered in this study 
are identical except for the shapes of the bow bulb and/or 
skeg. The series consists of the following bulb and skeg 
combinations: 


BULB SKEG 
baseline baseline 
no bulb baseline 
nabla | baseline 
nabla 2 baseline 
nabla 3 baseline 
producible baseline 
baseline producible 
nabla 6 producible 


These eight bulb/skeg combinations are compared in six 
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Fig. 1. Baseline hull. 


hull-to-hull comparisons. Specifically, the baseline-bulb/ 
baseline-skeg hull is compared to the five hulls 


BULB SKEG IDENTIFIER 
no bulb baseline no bulb 
nabla 1 baseline nabla 1 

nabla 2 baseline nabla 2 

nabla 3 baseline nabla 3 
producible baseline producible 


and the baseline-bulb/producible-skeg hull is compared 
to the hull 


BULB SKEG IDENTIFIER 


nabla 6 producible nabla 6 


The six hull-to-hull comparisons are referred to in the 
remainder of this paper by the identifiers listed in the tables 
(no bulb, nabla 1, nabla 2, nabla 3, producible, nabla 6). 


3. Experimental drag 


The drag D of a ship advancing at constant speed U along 
a straight path in calm-water is expressed as 


2D = pU*S*Cy = pU*L7 Cy (la) 


where p is the water density, S“ is the wetted-hull surface 
area, Lyer 1s a reference length (typically the length of the 
ship), and Cy and Cy are nondimensional drag coefficients. 
We have 


Cp GS iL (1b) 


Models of the eight variant hull forms have been tested in 
the DTMB tow tank at model-scale speeds corresponding to 
full-scale speeds in the range of 10—26 knots. The model- 
scale ratio for the 635 ft Strategic Sealift Ship is A= 
32.495. The ship models were free to pitch, heave and roll 
in these resistance tests, but were restrained in surge, sway 
and yaw. Further information about the resistance tests may 
be found in Refs. [28,29]. 

The results of the model tests reported in Refs. [28,29] are 
summarized in Fig. 4, which depicts the experimental drag 
coefficient 


CT? = (Ca + Cp)S*/Liet (2) 
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for the eight hull forms as a function of the Froude number 
F = U/./gLe¢ In the range 
0.16<F<0.31 


The reference length L,.-¢ 1s taken as the length (between 
perpendiculars) of the ship models, and g stands for the 
acceleration of gravity. Cp in Eq. (2) is the residuary-drag 
coefficient—reported in Refs. [28,29]—and the friction 
coefficient Cp is given by the ITTC 1957 model-ship corre- 
lation line formula 


Cr = 0.075/[logy)(Re) — 21° (3) 


where R, is the Reynolds number of the full-scale ship. 
Fig. 4 shows that differences among the drags of the six 
bulbous-bow models are relatively small, and that the no- 
bulb model has a significantly larger drag (except at low 
speed) than the six bulbous-bow models. Thus, Fig. 4 


provides experimental confirmation of the well-established 
advantage of bulbous-bow ships. 


4. Theoretical estimation of drag 


The experimental drag coefficient C7? given by Eg. (2) 
is compared in this study to the theoretical drag 


Cre = Cw + CpSA/LA (4) 


The wave drag coefficient Cw is evaluated using the 
Havelock formula 


2R ~~ d 
pu- Lin T J—-0o K— VD 


for the wave drag Ry associated with the energy radiated via 
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the farfield waves. Here, 


1 U 
v= — with F = —— (5b) 
2F 7 V aLiret 
Furthermore, the wavenumber x in Eq. (5a) is defined in 
terms of the Fourier variable B as 


K(B) = v+4/ + B? (Sc) 


S, and S; are the real and imaginary parts of the farfield wave 
spectrum function S = S(a, B) where a is defined in terms 
of the Fourier variable B as 


a(B) = Vy K(B)/F (Sd) 


This relation and expression (5c) for the wavenumber k = 
Ja? + B? follow from the dispersion relation F*a* = « for 
steady ship waves. 

The Fourier—Kochin representation of farfield waves 
given in Refs. [17,18] defines the wave spectrum function 
S in the Havelock formula (5a) in terms of the disturbance 
velocity u at the mean wetted hull surface. The disturbance 
velocity u at the ship hull can be evaluated using various 
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Se Bip oes ox. Ae computational methods. A simple, practical method is the 
Stati, Sees 5 Cereal slender-ship approximation given in Ref. [5] and summar- 
RS Sees eo ized in Ref. [18]. This theory provides a remarkably simple 
Se Soo explicit approximation to the velocity u, and hence to the 
wave spectrum function S, that is defined directly in terms of 
Big ;3) ekeg Variaiits. the ship speed and the hull geometry. 
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Fig. 4. Measured resistance for a series of eight hulls. 
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Fig. 5. Computed resistance for a series of eight hulls. 


An even simpler approximation to the spectrum func- 
tion is the zeroth-order slender-ship approximation. This 
approximation to the farfield wave spectrum function corre- 
sponds to the trivial approximation u = 0 for the nearfield 
disturbance velocity. The corresponding approximation to 
the wave spectrum function, given in Refs. [5,18], is 


i | n® eetlaxt BY qa + al (n*yP tar (6a) 
= i 


Here, dA and dZ stand for the differential elements of area 
and arc length of the mean wetted hull surface ¥ and the 
mean waterline J’, and 


(X,Y, 2) = (X,Y, Z)/Lree (6b) 


The x axis is taken along the path of the ship and points 
toward the ship bow (i.e. the ship advances in the direction 
of the positive x axis), the z axis is vertical and points 
upward, and the mean free surface is the plane z= 0. 
Furthermore, n* and ft’ are the x and y components of the 
unit vectors 


n = (n*,n’,n*) and t = (t*, f°’, 0) (6c) 


normal to the ship hull surface S and tangent to the ship 
waterline I’; n points inside the mean flow domain (i.e. 
outside the ship) and # is oriented clockwise (looking down). 

Thus, the wave spectrum function S in the Havelock 
formula (5a) for the wave drag is defined explicitly in 
terms of the ship speed and the hull form in the zeroth- 
order slender-ship approximation (6a). The exceptional 
simplicity of this approximation to the wave spectrum 


function (no nearfield-flow calculations are required) 
renders it ideally suited for hull form optimization. 

The zeroth-order slender-ship approximation (6a) may 
be regarded as a generalization of the classical Michell 
thin-ship approximation 


g thin _ [on eke tax 4 (7) 


Both the thin-ship approximation (7) and the zeroth-order 
slender-ship approximation (6a) involve simple distribu- 
tions of elementary wave functions, with strength equal to 
n’, over the ship hull. The major difference between the two 
approximations is that the wave functions are distributed 
over the ship centerplane y = O in the thin-ship approxima- 
tion, while they are distributed on the actual hull surface in 
the slender-ship approximation. Thus, the shape of a ship 
hull is mostly accounted for by means of the strength n* of 
the wave distribution in the thin-ship approximation, while 
the slender-ship approximation accounts for the hull shape 
via both the strength of the wave distribution and the loca- 
tion of the distribution, which is over the actual ship hull 
instead of over the ship centerplane. Thus, the slender-ship 
approximation may be expected to provide a more realistic 
account of hull form than the thin-ship approximation. 

Figs. 1-3 show the panelizations used to represent the 
eight hull forms for the calculation of the wave spectrum 
function given by Eq. (6). Approximately 8000 unstructured 
flat triangular panels have been used in these calculations. 
The theoretical drag Cy" is depicted in Fig. 5 for the eight 
hull forms in the range 0.16 < F < 0.31. 

Figs. 4 and 5 show that the theoretical and experimental 
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drags differ significantly, as might be expected in view of 
the extreme simplifications embodied in the ITTC friction 
drag formula and the zeroth-order slender-ship wave drag 
approximation. However, the question that is under con- 
sideration here—and is in fact most relevant for practical 
applications to hull form design and optimization—1is 
whether a method based on the ITTC friction drag formula 
and the zeroth-order slender-ship wave drag approximation 
might be able, despite its extreme simplicity, to rank the 
drags of a series of hull forms in approximate agreement 
with experimental measurements. 

In fact, the theoretical and experimental results shown in 
Figs. 4 and 5 may immediately be seen to agree in two 
notable respects: both Figs. 4 and 5 indicate that (1) differ- 
ences among the drags of the six bulbous-bow models are 
relatively small, especially at high speed, and (11) the no- 
bulb model has a significantly larger drag than the six 
bulbous-bow models, except at low speed. Thus, Figs. 4 
and 5 provide both experimental and theoretical confirma- 
tion of the well-known usefulness of bulbs for reducing 
drag. The theoretical and experimental results depicted in 
Figs. 4 and 5 are now analyzed further. 


5. Theoretical and experimental comparison of series of 
hulls 


Fig. 6 provides a comparison of both the experimental 
drag coefficient Ce and the theoretical drag coefficient 
CY for the eight hulls in the series defined in Figs. 1-3. 
Fig. 6 shows that, although large differences exist among 
the theoretical and experimental drag coefficients as already 
noted from Figs. 4 and 5, theoretical and experimental 


results are in agreement in several notable respects: 


(1) Both the theoretical and experimental results shown in 
the upper left corner of Fig. 6 indicate that the no-bulb 
hull has a significantly larger drag than the baseline hull 
over the entire speed range 

(ii) Both the theoretical and experimental results in the 
lower left corner indicate that the modified baseline hull 
(baseline bulb with producible skeg) has an appreciably 
larger drag than the nabla 6 hull (nabla 6 bulb with pro- 
ducible skeg), except at low speed 

(111) Both the theoretical and experimental results in the 
right column show that the baseline bulb yields a larger 
drag than the producible bulb and the nabla 2 and nabla 3 
bulbs in the high-speed range, while the opposite holds in 
the low-speed range 

(iv) The center of the left column also shows agreement 
between the theoretical and experimental results for the 
base hull and the nabla | hull, which are essentially 
comparable (although the theory indicates that the nabla 
1 hull has a slightly larger drag than the base hull). 


Thus, the hull-to-hull comparisons shown in Fig. 6 


indicate that, despite their extreme simplicities, the ITTC 
friction drag formula and the zeroth-order slender-ship 
wave drag approximation appear to be able to rank the 
drags of a series of variant hull forms in approximate agree- 
ment with experimental measurements. 

The procedure used in Fig. 6, based on visual comparison 
of the (theoretical or experimental) drag curves of alterna- 
tive hull forms, necessarily involves some degree of subjec- 
tivity. A complementary way of comparing the drag curves 
of two alternative hull forms, say hull A and hull B, consists 
in defining a ranking coefficient r 3 as 


yH(CR - ¢) 
S Fi(CA + C22 
i=1 


This ranking criterion for comparing hull forms A and B 
essentially compares the power 


pULi¢CyU = pU? LieeCy 0 FP? Cy 


required to overcome the drags of hulls A and B for a 
number of Froude numbers F; (with 1 =i=n) within a 
given speed range. The ranking criterion (8) indicates that 
hull B is better than hull A if re > 0. 

The six ranking coefficients 


feo? Tnabla Toabia3 Tabla? Yprod Tabla 6 

determined from both the experimental and theoretical 
drags depicted in Fig. 6 are now considered. As already 
noted, approximately 8000 unstructured flat triangular 
panels were used to represent the eight hulls considered in 
the calculations reported in Figs. 5 and 6. Calculations 
using approximately 32,000 panels have also been per- 
formed for the purpose of estimating the influence of hull 
discretization. Figs. 7 and 8 present the experimental and 
theoretical ranking coefficients for the six hull-to-hull 
comparisons considered in Fig. 6 in the ‘low-speed’ and 
‘high-speed’ ranges 


0.16 < F < 0.24 0.24<F<031 


Seven ship speeds (equal to 14, 15, 16, 17, 18, 19, and 
20 knots) and six speeds (21, 22, 23, 24, 25, and 26 knots) 
are considered for the low-speed and high-speed ranges, 
respectively. Thus, n in Eq. (8) is taken equal to 7 and 6 
for the low-speed and high-speed ranges, respectively. 
‘Experiment’ in Figs. 7 and 8 refers to the ranking coeffi- 
cients determined from experimental measurements, and 
‘32k panels’ and ‘8k panels’ identify the theoretical ranking 
coefficients obtained using 32,000 and 8000 panels, respec- 
tively. Figs. 7 and 8 show that differences between the 
theoretical ranking coefficients obtained using 32,000 
panels and 8000 panels are negligible for the present 
purpose. 

For the high-speed range, Fig. 8 indicates that theory 
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Fig. 6. Comparison of theoretical and experimental drag coefficients for a series of hulls. 


and experiments agree in predicting that the no-bulb 
hull has a significantly larger drag than the baseline 
hull, and that there is no appreciable difference between 
the drags of the nabla | hull and the baseline hull. 
Experiments and theory also agree that the nabla 3, 
nabla 2, producible, and nabla 6 hulls have smaller 
drags than the baseline hull, although there are appreci- 
able discrepancies between experimental and theoretical 


results for these 4 cases (especially for the nabla 3 
hull). Discrepancies between theory and experiments 
are relatively larger for the low-speed range considered 
in Fig. 7, although both theory and experiments agree in 
predicting that the baseline hull has a smaller drag than 
the no-bulb, nabla 3, producible, and nabla 2 hulls. 
Thus, Figs. 7 and 8 show that the theoretical and 
experimental ranking coefficients are in relatively fair 
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Fig. 7. Experimental and theoretical rankings of drag in the low speed range. 


agreement on the whole, considering the extreme simplicity 
of the theoretical method that has been used to estimate the 
drag. 


6. Hull form optimization example 


For purposes of illustration, an optimization procedure is 
applied to the Wigley hull. This mathematical hull form has 
parabolic waterlines and framelines, and is defined by 


y= +(1 — 4x°)(1 — 2562”)/20 (9a) 
The volume displacement V of the Wigley hull (9a) is 
V/L° = 1/360 ~ 0.00278 (9b) 


which yields a 5080 LT (5160 MT) displacement in sea 
water at a length of 400 ft (122 m). The transverse moment 
of inertia of the waterplane (moment of inertia about the x 
axis) [7 1S 

I,/L* = 1/26, 250 ~ 0.0000381 (9c) 
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Fig. 8. Experimental and theoretical rankings of drag in the high speed range. 





Fig. 9 shows the waterlines (constant-z lines), buttocklines 
(constant-y lines), and framelines (constant-x lines) of a 
slightly modified Wigley hull having rounded forefoot and 
stern. This modified Wigley hull is actually used, in lieu of 
the classical Wigley hull (9a), as the initial hull in the opti- 
mization example considered here. Coordinates in the lines 
drawing shown in Fig. 9, and further on in Figs. 10-13 
for the corresponding optimized hull forms, are nondimen- 
sionalized with respect to the length of the initial hull, 
which is the reference length L,.¢ hereafter. The contour 
interval in each view is specified by 6. 

The nondimensional total-drag coefficient Cy in Eq. (1a) 
is expressed as 
Cp =Cwt+ (1+ k)Cp (10) 
The wave drag coefficient Cy is evaluated using the 
zeroth-order slender-ship approximation (6) to the spectrum 
function S in the Havelock formula (5a). The friction 
drag coefficient Cp is given by the ITTC 1957 model-ship 
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Fig. 9. Lines for the modified Wigley hull taken as initial hull in the optimization. 


correlation line formula (3). The form factor k is taken 


k = 0.6VV/L3 + 9V/L? with 005<k=040 (11) 
where L and V stand for the length and displacement of the 
ship. Expression (11) is a fit of three data points given in 
Ref. [30], p. 28. 

Optimized hull forms that have the same displacement V 
and transverse moment /; as the initial hull and have a 
minimal total drag a oe are determined. Thus, only two 
constraints are considered. A_ real-world optimization 
problem would obviously involve a number of additional 
constraints, notably restrictions about principal dimensions 
(length, draft, beam), and these constraints could be 
included into the optimization process. However, an ideal- 
ized (academic) optimization problem involving a minimal 
number of constraints is purposely considered here to 
test the capability of the optimization method to gener- 
ate very different optimized hull forms. Indeed, the opti- 
mized hull forms obtained further on (and shown in 
Figs. 10-13) for three single-speed optimizations and for 
a three-speed optimization (i.e. four cases in all) are drasti- 
cally different. 

It is important that the wetted-hull surface area S“ be 
included in the objective function S“C; because significant 
changes in wetted area occur as a result of the optimization 


procedure. The downhill simplex optimization method, 
given in Ref. [31] Section 10.4, is used in this study. 

As shown in Figs. 10-13, ship hull form optimization 
may involve large hull form modifications, which must be 
represented effectively. Thus, a representation of a ship hull 
surface that is sufficiently flexible and robust to permit large 
hull form modifications, while involving only a moderate 
number of unknowns, is an important element of hull form 
optimization. 

In this study, a ship hull is defined as a NURBS (non- 
uniform rational B-spline [34]) surface. Specifically, a cubic 
surface with a 5 by 9 control net (5 control points for each of 
9 sections) is used to represent a ship hull form, which is 
assumed to be symmetric about its centerplane y = O. For 
simplicity, the surface is polynomial (control points are 
equally weighted) and has uniform knot vectors. The loca- 
tions of the 5X9 =45 control points that define the 
NURBS surface representing a ship hull are defined in the 
manner explained in Appendix A. 

For the purpose of evaluating the zeroth-order slender- 
ship approximation (6) to the spectrum function S in the 
Havelock wave drag formula (5a), a ship hull surface is 
approximated by a set of flat triangular panels. These panels 
are defined automatically using the panelization for the 
initial hull surface, in the manner now explained. A panel 
vertex is identified by values of the parameters u and v in the 
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Fig. 10. Lines for hull optimized at F = 0.250. 


(u, Vv) parametric representation of the NURBS surface 
corresponding to the initial surface. The x, y, z coordinates 
of a panel vertex in the panelization of the hull surfaces that 
are defined in the course of an optimization are evaluated 
directly from the values of the parameters u and v attached 
to the panel vertex in the initial panelization. The volume 
(displacement) and waterplane-inertia constraints that are 
considered in the optimization procedure are enforced by 
iteratively scaling the hull form along the y and (x,z) axes. 

In the hull form optimization example reported later, the 
hull is defined by 61 independent variables (as explained in 
Appendix A), and is approximated by 11,200 unstructured 
flat triangular panels. The corresponding computing time is 
approximately 1.8 s per evaluation of drag for one Froude 
number on a gigahertz processor. About 12,000—16,000 
evaluations of drag are needed until ‘complete’ conver- 
gence. Thus, 18—24h of computation are required for a 
hull form optimization over three speeds. 


7. Optimized hull forms 


Figs. 10-12 depict the waterlines, buttocklines, and 
framelines of the optimized hull forms obtained by mini- 
mizing the drag coefficient S‘C,(F) for F = 0.25, 0.316, 
and 0.408, respectively. These three Froude numbers corre- 


spond to 16.8, 21.2, and 27.4 knots for a 400 ft ship. Fig. 13 
depicts the optimized hull form determined by minimizing 
the sum 


s*(cr of oes " cr 048) (12) 
of the drag coefficients at F = 0.25, 0.316, and 0.408. Thus, 
the hull form depicted in Fig. 13 corresponds to a 3-speed 
optimization, while the hull forms shown in Figs. 10-12 
correspond to single-speed optimizations. 

Figs. 10—13 show that all three single-speed optimized 
hulls (most notably the hull optimized at F = 0.408) have 
sizable bow and stern bulbs, while the 3-speed optimized 
hull has fine ends. The lengths, wetted-hull areas, and 
displacement to length ratios of the initial hull and the 
four related optimized hulls are shown in Table 1. The 


Table 1 
Length, surface area and displacement to length ratio for the initial hull and 
the four optimized hulls 


Hull L/Liet SEs VIL 

Initial 1.000 0.144 0.0028 
Opt.250 0.793 0.116 0.0056 
Opt.316 1.073 0.137 0.0022 
Opt.408 1.064 0.143 0.0023 
Opt-3F 1.635 0.155 0.0006 
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Fig. 11. Lines for hull optimized at F = 0.316. 


low-speed optimized hull is shorter than the initial hull by 
more than 20%, while the hulls optimized for F = 0.316 
and 0.408 are slightly longer than the initial hull, and the 
3-speed optimized hull is very much longer. The hull area is 
reduced (compared to the initial hull) for all three single- 
speed optimized hulls, most notably for the hull optimized 
at F = 0.25 (for which the friction drag yields the largest 
contribution to the total drag), and increased slightly for the 
3-speed optimized hull. 

Fig. 14 depicts the variation of ou ee, CAC; and CAC; 
with respect to the number of evaluations of the objective 
function in the course of the 3-speed optimization. Here, Cr 
is the sum of the total-drag coefficients, as in Eq. (12), for 
the initial hull. Fig 14 shows that the wave drag is reduced 
first, and quickly becomes quite small compared to the fric- 
tion drag. The friction drag increases during the early phase 
of the optimization process. As the optimization process 
progresses beyond the early phase of rapid reduction in 
total drag, both the wave drag and the friction drag slowly 
decrease. During the early phase of the optimization 
process, the 3-speed optimized hull deforms into a hull 
having large, deep bulbs at both the bow and the stern. 
However, as the optimization process continues, these 
bow and stern bulbs are gradually eliminated, and the hull 
becomes longer and pointed at the ends as shown in Fig. 13. 


Figs. 15 and 16 depict the wave drag coefficient 
Cos Ls and the total-drag coefficient Csr respec- 
tively, of the initial hull, the three single-speed optimized 
hulls, and the 3-speed optimized hull within the speed range 
0.2 =< F = 0.5. Fig. 15 shows that the wave drag coeffi- 
cients Cw for the single-speed optimized hulls are greatly 
reduced at the Froude numbers for which the hulls are 
optimized (at F = 0.25, 0.316, and 0.408 for the low- 
speed, intermediate-speed, and high-speed optimized 
hulls) but increase very rapidly away from the optimization 
speeds. Thus, single-speed optimization can yield highly- 
tuned optimized hull forms that perform quite well only in a 
fairly narrow speed range centered at the optimization 
speed. 

Figs. 15 and 16 also show that the wave drag coefficient 
Cy and the total-drag coefficient Cy; of the 3-speed opti- 
mized hull remain small over a broad speed range, in 
contrast to the drag curves for the three single-speed opti- 
mized hulls. Furthermore, the drag of the 3-speed optimized 
hull is only moderately larger than the drag of the single- 
speed optimized hulls within the narrow regions where these 
hulls are optimized. Thus, optimization for a speed range 
(i.e. multi-speed optimization) appears to be much prefer- 
able to optimization for a single speed. 

Table 2 lists the values of the wave drag coefficient Cy at 
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Fig. 12. Lines for hull optimized at F = 0.408. 


Wave drag and friction drag coefficients 10° CSL. for the initial hull and the four optimized hulls 


Hull 


Initial 
Opt.250 
Opt.316 
Opt.408 
Opt-3F 


Table 3 


Cy at F = 0.25 


14.26 
0.46 


0.22 


Cy at F = 0.316 


22.98 


0.84 


0.46 


Cy at F = 0.408 


37.36 


1.78 
2.64 


Contribution of wave drag to total drag for the initial hull and the four optimized hulls 


Hull 


Initial 
Opt.250 
Opt.316 
Opt.408 
Opt-3F 


Cy/Cy at F = 0.25 (%) 


Cy/Cy at F = 0.316 (%) 
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Fig. 13. Lines for hull optimized at F = 0.25, 0.316, and 0.408. 


F = 0.25, 0.316, 0.408, and of the friction drag coefficient 
(1 + k)Cp for the initial hull, the three single-speed opti- 
mized hulls, and the 3-speed optimized hull. In this table, 
Cy and Cy are multiplied by 10° Me 

The ratio Cy/Cy of the wave drag to the total drag 1s listed 
in Table 3 for the initial hull, the three single-speed opti- 
mized hulls, and the 3-speed optimized hull at F = 0.25, 
0.316, 0.408. This table shows that the contribution of the 
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Fig. 14. Variation of drag in course of 3-speed hull form optimization. 


wave drag is reduced to near insignificance in the optimized 
hulls. Indeed, Table 4 shows that the wave drag CaS Ti 
of the optimized hulls is reduced by 93—98% in comparison 
to the wave drag of the initial hull. 

The corresponding reductions in the friction drag 


Table 4 
Reduction in wave drag for the four optimized hulls 


Hull % Reduction in Cy at F 

0.25 (%) 0.316 (%) 0.408 (%) 
Opt.25 97 
Opt.316 96 
Opt.408 95 
Opt-3F 98 98 93 
Table 5 


Reduction in friction and total drags for the three hulls optimized at a single 
speed 


Hull % Reduction in 

(1 + k)Cp (%) Cr (%) 
Opt.25 17 47 
Opt.316 5 51 
Opt.408 1 60 
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, mized hulls, and in Table 6 for the 3-speed optimized 
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(1+ k)Cp (%) Cr (%) optimized hull is increased by 7%, the total-drag Cy is 
decreased significantly (although by a lesser amount 


ee - - than for the single-speed optimized hulls). The bottom 
0.408 _7 55 row in Table 6 lists the mean of the reductions in the values 
Mean ey AT of (1 + k)Cp and Cy over the three speeds F = 0.25, 0.316, 
rs 0.408. 
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Fig. 16. Computed total drag for the initial hull and the four optimized hulls. 
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8. Conclusions 


The experimental and theoretical drags reported in Figs. 6— 
8 indicate that the extremely simple CFD tool that has been 
used to estimate the total drag of a ship, based on the ITTC 
friction drag formula and the zeroth-order slender-ship wave 
drag approximation, is able to rank the drags of a series of hull 
forms roughly in accordance with experimental measure- 
ments. The method may therefore be used, with appropriate 
caution, as a practical hull form optimization tool. 

This extremely simple CFD tool has indeed been used to 
determine ship hull forms that minimize the total (friction + 
wave) calm-water drag. For purposes of illustration, optimized 
hull forms that have the same displacement and waterplane 
transverse moment of inertia as the classical Wigley hull, 
used as initial hull in the optimization cycle, have been 
determined for three speeds and for a speed range. 

Figs. 15 and 16 show that the wave drag coefficient Cw 
and the total-drag coefficient Cy of the hull optimized for 
three speeds remain small over a broad speed range, in 
contrast to the drag curves of the three hulls optimized for 
a single speed. Thus, speed-range (multi-point) optimization 
has been found to be much preferable to single-speed opti- 
mization, which yields highly-tuned optimized hull forms 
that perform well only within a fairly narrow speed range 
centered at the optimization speed. Furthermore, Figs. 15 
and 16 show that the wave and total drags of the 3-speed 
optimized hull are only moderately larger than the drags of 
the single-speed optimized hulls within the narrow regions 
where these hulls are optimized. 

Figs. 9-13 show that the wave drag calculation method 
and the hull form representation that have been used in this 
study are robust enough to make it possible to explore very 
large hull form modifications from an initial hull (taken here 
as the Wigley hull). Tables 3 and 4 show that the wave drag 
is nearly eliminated as a result of hull form optimization, 
and Tables 5 and 6 show dramatic reductions in total drag. 

The results presented in this study confirm that, despite its 
extreme simplicity, the zeroth-order slender-ship wave drag 
approximation can be a useful tool for routine practical 
applications to hull form design and optimization, as already 
demonstrated in Refs. [14-16]. It will be interesting to 
examine the series of eight hull forms and the optimization 
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example considered in this study anew using more refined 
CFD tools than the extremely simple method that has been 
employed here. In particular, significant improvements 
upon the zeroth-order wave drag approximation can be 
expected from the approximation based on the first-order 
slender-ship approximation to the nearfield flow and the 
Fourier—Kochin representation of farfield waves, as 
explained in Ref. [18]. More refined optimization proce- 
dures can also be used, e.g. Ref. [32]. 

Although optimized hull forms that only take into account 
calm-water drag have been considered here, it 1s important 
to keep in mind that hydrodynamic optimization needs to 
take into account a number of requirements, including calm- 
water drag, unsteady motions and loads, and other relevant 
considerations such as propulsion, cavitation, and wakes. In 
particular, both motions and calm-water drag should be 
considered because design and optimization of a ship hull 
form for calm-water drag without regard to seakeeping char- 
acteristics, or minimization of ship motions without regard 
to drag, can lead to unacceptable hydrodynamic designs. 
Thus, an integrated approach that simultaneously considers 
both calm-water drag and motions should be used. 
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Appendix A. Hull definition 


As depicted in Fig. 17, a ship waterline is defined by 9 
control points, which evidently are all located in the free- 
surface plane z= 0. The locations of the 2 end control 
points, which are only permitted to move along the x axis, 
are defined by 2 unknowns (1 degree of freedom—namely a 
displacement along the x axis—for each of the 2 end 
points). The first and last of the 7 interior control points, 
1.e. the 2 interior control points that are located next to the 
end points, define the curvature of the waterline at the ship 
bow and stern within the NURBS representation. Accord- 
ingly, the first and last interior control points are presumed 


o 
hs 





Fig. 17. Plan view of control points defining a ship waterline. 
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Waterline 





Keel line 
Fig. 18. Profile view of control points defining a ship stem line. 


to have the same x coordinates as the related end points, and 
the locations of these 2 interior points are defined by 2 
unknowns (1 degree of freedom—a displacement along 
the y axis—for each of the 2 points). Finally, the locations 
of each of the 5 remaining interior control points, which are 
allowed to move within the free-surface plane z = O, are 
defined by 5 X 2 = 10 unknowns (2 degrees of freedom— 
displacements along the x and y axes—for each of the 5 
points). Thus, the locations of the 9 control points that define 
a waterline involve 2 + 2 + 10 = 14 unknowns. 

A ship keel line is similarly defined by 9 control points, 
which evidently are all located in the ship centerplane y = 
0. The locations of each of the 7 interior control points, 
which are allowed to move within the centerplane y = 0, 
are defined by 7 X 2 = 14 unknowns (2 degrees of freedom- 







+ 
{Waterline control point 


Section fullness 
control point 


Fig. 19. Body plan view of control points defining an interior station. 


displacements along the x and z axes-for each of the 7 inter- 
ior points). The locations of the 2 end keel line control 
points are defined in the manner explained further on. 
Thus, the definition of a keel line involves 14 unknowns. 
As already noted, a ship hull form is defined here as a 
NURBS surface that involves 9 sections. The 2 end sections, 
1.e. the stem and stern lines, are now considered. As depicted 
in Fig. 18, the stem line is defined by 5 control points. The 
locations of each of the 3 interior control points, which can 
only move within the centerplane y = 0, are defined by 3 X 
2 = 6 unknowns (2 degrees of freedom—displacements 
along the x and z axes—for each of the 3 interior points). 
The end point located at the waterline has already been 
considered. The location of the end point located at the 





Fig. 20. Control net used to define a ship hull surface. 
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Keel line is defined in the manner explained further on. It 
follows that the definition of the stem line involves 6 
unknowns, and similarly for the stern line. Thus, the definition 
of the stem and stern lines involve 12 additional unknowns. 

The first and last interior sections (sections located next to 
the stem and stern lines) are now considered. As already 
noted, these sections define the curvature of the ship hull 
at the bow and stern within the NURBS representation. 
Accordingly, the 5 control points that define each of 
these 2 interior sections are presumed to have the same x 
coordinates as the related control points on the stem or stern 
lines. It follows that the definition of the locations of the 3 
interior control points (the end points attached to the water- 
line and the keel line have already been considered) involve 
6 additional unknowns (1 degree of freedom—a displace- 
ment along the y axis—for each of the 3 interior control 
points that define the first and last interior sections). 

The 5 remaining interior sections are now considered. As 
depicted in Fig. 19, each of these 5 sections is defined by 5 
control points. The 2 end points, attached to the waterline 
and the keel line, have already been considered. The 3 inter- 
1or control points define the flair angle, the section fullness, 
and the deadrise angle, as indicated in Fig. 19. The locations 
of these 3 interior points is defined by 3 unknowns (1 degree 
of freedom for each of the 3 interior points) because the x 
coordinates of the interior points are presumed to vary line- 
arly between the x coordinates of the 2 end points located at 
the waterline and at the keel line. Thus, the definition of the 
5 interior sections now considered involves 5X3 = 15 
unknowns. 

All 45 control points have now been considered except 
for two control points, namely the points that are common to 
the keel line and either the stem line or the stern line. These 
points are taken midway between the neighboring points 
located on the keel line and the stem/stern line. Thus, the 
definition of these 2 special points does not involve addi- 
tional unknowns, and the NURBS representation of a ship 
hull form that is used here involves 14+ 144+ 12+6+ 
15 = 61 unknowns. Fig. 20 shows the initial 45-point 
control net for the initial hull and the corresponding control 
net for a hull form defined in the course of an optimization. 

Several constraints are imposed on the control points to 
prevent unrealistic hull form deformations. The displace- 
ments along the x axis of the interior control points that 
define the waterline and the keel line are restricted so that 
each interior point remains between its logical neighbors. 
The displacements along the z axis of the 3 interior control 
points that define the stem line and the stern line are simi- 
larly restricted to ensure that these interior points are 
between their logical neighbors. 
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W. KASPRZYK AIRFOIL. THE FIRST WIND-TUNNEL TESTS 


Tadeusz Wusatowski 


Recently Technika Lotnicza i Astronautyczna (No. 9, 1974) rie 
published an article about a slotted-flap glider airfoil and 


a glider designed by Witold Kasprzyk. Daniel 
student at the University of California, also 
the unique aerodynamic characteristics of the 
possible glider flight at 32 km/h and 0.5 m/s 
Because these data were controversial and not 





Walton, a senior 
found out about 
Kasper wing, making 
descent speed. 
officially confirmed, 


he decided to study this problem as part of his semester work at 


the University. He conducted the tests inthe Northridge low- 


speed wind tunnel, and was well aware of the fac® that model wind- 


tunnel tests could not be more accurate than accurate measurements 


made during flight. Below we publish the report on his tests and 


their results. (taken from the journal Soaring, No. 11, 1974). 


The purpose of the study was to obtain specific quantitative 


data about an interesting method for increasing aerodynamic lift 


at large angles of attack, which was possible 


thanks to the 


collaboration of a system of flaps, sustaining vortices above 


the upper airfoil surface. A rectangular wing with a 48.3 cm 


span and 15.2 cm chord was produced for the wind-tunnel tests. 


Two edge plates were placed at its ends, which were also used for 


mounting the flaps at the leading edge and the slotted flaps at 


the trailing edge. The latter were obtained by bending steel 


sheet (approximate thickness 1.6 mm) according to the shape 


shown in Fig. 1. The entire wing of the model was carefully 


polished. The tests were carried out in a wind tunnel whose test 


chamber was 71 cm wide and 50.8 cm high, with 
clasping the model. 


three points for 


*Numbers in the margin indicate pagination in the foreign text. 
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Dynamometers and an 
electronic counter were used 
to measure the 1ift and drag. 

5 It turned out that a suf- 
a ra rd ficiently high'speed of 


‘2534 5 6 + 





i flow around the airfoil for 


Fig. 1. Wing model airfoil: the Reynolds number equality 
a) pine wood elements of airfoil; 
b) flaps from 1.6 mm metal sheet; 
c) pivoting points. airfoil could not he used 


for the model and the real 


in the wind tunnel because 
of the dynamometer loads which were calibrated in the range to 
4.5 kg. In order not to overload and damage the expensive measure- 
ment system, the measurements were made at lower=speeds, which 


were 31.4 m/s and 22.2 m/S, respectively. 


A somewhat undesirable feature of the model of the wing was 
the small vibrations of the slotted flap at the trailing edge 
which occurred at small angles of attack. However from a practi- 
cal standpoint these vibrations are of no importance, because 


under normal flight conditions the flaps are closed. 


Visualization of flow was carried out during the wind-tunnel 
tests. The method of filaments giued to the airfoil surface was 
used because of the lack of a smoke generator in the wind tunnel. 
After the model was positioned at large angles of attack at which 
the occurrence of the vortex phenomenon was anticipated, the 
tunnel was put in operation and the velocity of the flow was 
gradually increased. In the range of low velocities the flow 
around the airfoil behaved in a normal manner, and at higher 
velocities, the direction of the flow changed to the opposite 
direction. Initially the point at which the direction changed 
was unstable, however, with increasing velocity, the reverse 
flow stabilized (Fig. 2). This reversed flow indicated the 
existence of a vortex behind the leading edge (as described in 


TLiA, No. 9, 1974, p. 18). Visual observation confirmed the 


existence of a similar vortex behind the trailing edge. 


Subsequent observations 
were made at a number of 
other large angles of attack, 
always with the same positive 
result. Only the velocity at /10 


which the flow changed direc- 





Fig. 2. Model in wind tunnel, visi- tion decreased commensurately 
ble filaments indicate reverse 
flow around the body on upper 

surface of wing with open flaps. attack. Next the lift and 


with increasing angle of 


drag were measured at a con- 
Stant velocity in the entire range of angles of attack (Fig. 3}. 
The model with open flaps was tested at two different speeds, the 
only difference being that at the lower speed it was possible to 
investigate the entire range of angles of attack up to 55° (on 


account of dynamometer loads). 


The results presented in the diagrams (for open flaps) show 
considerable differences in the drag and lift compared with more 
conventional airfoils (Figs. 4 and 5). An airfoil with retracted 
flaps gives graphs which are similar to those of-a conventional 


airfoil presented in Fig. 3. 


From Fig. 4 it is evident that at greater speed and with 
open flaps the wing does not tend to stall in the usual sense of 
the word. In general the drag of the wing turned out to be 
very great and did not increase parabolically with increasing 
angle of attack as in other airfoils. The mentioned region 
which lies in the range of angles of attack from 19° to 30° 
is clearly marked in Figs. 4 and 5. Figure 6 gives the gliding 
ratio of the wing, i.e. C/C,, for closed flaps at a 31.4 m/s 


speed and for open flaps at a speed of 22.2 m/s. This gives rise 


to a difference in Reynolds numbers, however, for smaller speeds 
the variation in c,/C,, could be investigated in a greater range 
of angles of attack. The optimal angle of attack for a smooth 
airfoil lies at smaller angles of attack, whereas for an airfoil 
with open flaps, at greater angles of attack. This indicates 
that a Kasprzyk airfoil meets best the requirements at greater 
angles of attack. Two numbers pertaining to the descent speed at 
great angles of attack were given beforehand, namely 1.02 m/s at 
48.2 km/h and 0.51 m/s at 32.2 km/h. From the equality c,/c,, = 
= v/w, which is important for small angles of attack, it follows 
that the gliding ratio in the given cases is 13.3 and 17.6, 
respectively. These quantities contradict the maximum value of 
the gliding ratio obtained in experiments, which-is equal to 1.6. 
{8 
During the tests Cc. mas 
2.98 was obtained at a 50° 
angle of attack, however the 
Poa fd | drag was always considerable. 
ay ee SZC | During visualization of the 
Serip> | flow it was ascertained that 


the flaps give rise to a vor- 





tex system. Flutter of the 





“04 
“S$ 0 5 10 13 4% 25 30 35 40 


aia a f] buffeting type also occurred. 
Fig. 3. Graph of C, and C, as It was assumed that the tests 
function of angle of attack for would not be extensive and 
system with retracted flaps, 
velocity v = 31.4 m/s, Re = that their purpose was to 
330,000. test a prototype airfoil in 


a wind tunnel and obtain more 
detailed information about the advantages and limitations result- 
ing from the use of an unusual system of flaps. This is how 


D. Walton summarized his study. 


Now we will try to give some thought to what the description 


of the tests which was presented above means. 


| ’ r" 
The obtained Cc. mat 


2.98 comes close to the value 





" HTS | obtained by W. Kasprzyk during 
vr | flight, i.e. 3.15. 
arainiae 

i * The excessive drag, 

2 | which resulted in a small 


see gliding ratio, was due to the 
“— “| measurement method used, 

Fig. 4. Graph of C. and C, as a namely: 

function of angle of attack, 

See eee Ssbecse | 77 Blane flow axound 

; the airfoil was not ob- 

tained in the wind tunnel 

7 because the wingspan did 

not take up the entire 


width of the wind tunnel; 


fb ~- the edge plates 
: that were used, which 


0 stabilized the vortex 
0 a 0 2 30 40 W 5D 


_ a | and are used to obtain 
Fig. 5. Graph of C, and C, as a a plane flow are too 
function of angle of attack, small for this purpose 


open flaps, flow velocity v = 


22.2 m/s, Re = 226,000. and may cause great 


induced drag, especially 
Since the maximum angles of attack at that time are about 
55° and the aspect ratio of the model wing is 3.17. These 
values give an induced drag coefficient Cy, approximately 
equal to 1 (we recall that the aspect ratio of the Kasprzyk 
wing is 15); 


—~ mounting the edge plates of the wing at a distance 
of about 10 cm from the wind tunnel walls must have caused 


increased turbulence 
and consequently also 
increased drag (this 
method is normally not 


used) ; 


-~- the flow around 
a straight wing (model) 





and a backswept wing 
2 
3 0° 5 0 15 20 25 “30°35 “40 45 50.5: (Kasper wing) cannot be 


Fig. 6. Gliding ratio of model as func- compared since a bound 
tion of angle of attack: a) retracted 


flaps; b) open flaps. vortex occurs on them, 


at which time these 
vortices differ somewhat, and the drag of the straight wing 


is greater; 


-~ the condition of equality of Reynolds numbers which 
was not satisfied (the number was too smail for the model) 
increases the drag coefficient Cy. for the model; 

-- at very large angles of attack (the most important 
angles), throttling could have occurred in the wind tunnel 
because the wing of the model set in this position consti- 
tuted a considerable part of the cross-sectional area of 


the chamber in which the measurements were made. 


The obtained results confirm only partially W. Kasprzyk's 
theory, however in no case provide further substantiation of it. 
The problem that was described is still not fully understood 
and requires further, more reliable testing in a wind tunnel 


and during flight. 
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[57] ABSTRACT 


Tail-less airplanes, such as dsciosed in U. S. Pat. No. 
3,438,597 as a stall occurs or is about to occur, may 
be flown at very high angles of attack to generate fa- 
vorable spanwise vorte, flows which augment the 
swept wing profiles creating resultant wing profiles 
having better lift characteristics, i.e., the vortexes cre- 
ated are lift generating. However, the aircraft is then 
uncomfortable to be im during such flights at very high 
angles of attack. Therefore to achieve the benefits of 
this lift generating vortex flow, without maneuvering 
such aircraft mto a very high angle of attack, the 
swept wing is equipped with airfoil structures and ac- 
cessories therefor, which are extended beyond the 
cruising speed contour of the swept wing, at lower 
speeds, to create spanwise vortex air flows which se- 
lectably enlarge the effective overall airfotl] contours 
as sensed by the passing major air flows. Sustaining 
aerodynamic lift forces are thereby created at angles 
of attack well beyond the stall angles of the cruising 
airfoil contour, and by timely use of such airfoil struc- 
tures, the vortex air flows are created soon enough for 


. lift generating, so stable flight conditions may be cre- 


ated without so extensively altering the pitch of the 
landing and/or slow flying tailless aircraft. 


4 Claims, 22 Drawing Figures 
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1 
AFRCRAFT WING WITH VORTEX GENERATION 


CROSS REFERENCE 


The Applicant, Withold A. Kasper, ilustrated and 
described his “Aircraft” in U.S. Pat. No. 3,438,597 
which issued on Apr, 15, 1969, and understandings of 
many structural components and contro] components 
of this first aircraft are pertinent to the aircraft ilus- 
trated and described herein and such information is so 
incorporated herein. 


BACKGROUND OF INVENTION 


In flying his glider embodiment of the aircraft Witold 
A. Kasper illustrated and described in his U.S. Pat. No. 
3,438,597, he observed in flight after reaching and sur- 
passing an angle of attack of 40°, that the air flow re- 
versed immediately adjacent the top surface of the 
wing. Upon such reversal he was still able to control the 
flight, flying the glider much in the same way as one 
mushes in during a parachute flight. Further flights and 
experimentations using wind flow directional devices 
indicated the formation of transverse spanwise vortices 
of air flow. Subsequently, the flights and landings: of 
birds were restudied and observations indicated the 
possible formation of transverse spanwise vortices of 
air flow on their wings. 

As a consequence of the glider flying experience and 
the continued observation of birds flying slowly and 
landing, néw wing designs, changeable in flight, were 
undertaken by him to create and maintain transverse 
spanwise vortices of air flow. After these vortices were 
generated, safe flight operations were undertaken well 
under stall speeds normally associated with fixed con- 
figurations of wings in their cruising crossectional 
forms. When the new swept wing designs were changed 
in flight, the overall aircraft did not have to be flown at 
such a high pitch attitude at the lower speeds and at 
and under stall speeds to create and maintain the favor- 
able transverse spanwise vortices of air flow. 

Fssentially during such favorable attitude of the air- 
craft at lower angles of attack with the swept wing air- 
foil structures adjusted for flight beyond normal stall, 
the created transverse spanwise vortices of atrflows ef- 
fectiveiy changed the overall wing configuration sensed 
by the entire surrounding and passing airflow. The 
changed wing form plus the vortex air flow created an 
effective wing of large profile or cross section in a form 
suitable for creating sufficient lift to permit a well con- 
trolled and safe descent to the ground. | 

This flight result occurring open creation of the span- 
wise vortices has been compared, by analogy, tn refer- 
ence to the magnus effect and explained to some extent 
using the associated studies and formulaes. After con- 
tinued experimentation, different types of changing 
wing structures are now used to start, maintain, and 
control spanwise vortices to create the resultant overal] 
wing profiles which produce the needed lift forces dur- 
ing slow and/or landing flight periods. 

Also during these earlier flights at high angles of at- 
tack, it was realized that air discharging from vortexes 
was being directed past rudders and elevons. However, 
a need remained for more effective aerodynamic flight 
controls. Therefore, in the aircraft illustrated herein, 
the stabilizer and rudder control surfaces at the wing 
tips were slanted to create additional directional air 
flow force components. Moreover, elevons were 
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equipped with tabs which moved to make their control 


_ effectiveness equally sensitive and responsive whether 
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the eleyon was pivoted upwardly or downwardly. By 
using tabs, drag is minimized on the depressed elevon 
and drag is mcreased on the raised elevon, thereby 
avoiding any unwanted yaw condition. 


SUMMARY OF INVENTION 


Aircraft, such as disclosed in U.S. Pat. No, 3,438,597 
and modified as disclosed herein are flown safely, com- 
fortably, rchably, and effectively, at many operating 
speeds and particularly ut very low speeds, even as a 
normal! stail is about to occur or after a normal stall has 
occurred, At the near norma! stall, stal! and/or under 


5 stall speeds, the airfoil structures selectively and espe- 


cially provided in a sweep wing are extended beyond 
the cruising speed contour of the swept wing. When so 
extended, they create and control transverse spanwise 
vortex air flows, one on each side of the aircraft, with 
each vortex discharging over the wing tip. flight control 
surfaces. The spanwise vortex air flows enlarge the ef- 
fective overall airfoil controus as sensed by the passing 
major air flows, thereby creating sustaining aerody- 
namic lift forces at angles of attack just beyond and 
well beyond the stall angles of a cruising airfoil contour 
swept wing. 

These extendable airfoil structures of the swept wing 
include: leading edge airfoil structures mounted so 
their trailing edges are pivoted upwardly, trailing edge 
airfoil structures mounted so their jeading edges or 
trailing edges are pivoted upwardly; and/or trailing 
edge airfoil structures, one above the other, separable 
into two trailing edge airfoil structures, the one above 
pivoting upwardly at its trailing edge and the one below 
pivoting downwardly at its trailing edgc. 

Accessories used with these extendable airfoil struc- 
tures include actuators, linkages and auxiliary air distri- 
bution systems for sclectively injecting supplemental 
air flows to enhance, control, and maintain the trans- 
verse spanwise vortex air flows. Such extendable airfoil 
structures with their accessories effectively create the 
greater lift overall airfoil flow patterns enlarged by the 
vortex formations, without reliance on or in lieu of 
nearly continuous skin airflow extended structures. As 
a consequence, this very desirable slow speed flying ca- 
pability is obtamed without substantially increasing the 
weight of the aircraft, by the otherwise addable amount 
of the weight to be attributed to such nearly continuous 
fully extended airfoil structures and their more elabo- 
rate and extensive actuators. 

In providing this aircraft capable of such safe slow 
flying speeds, additional improvements in control sur- 
faces are included. Each vertical stabilizer and each 
rudder at cach wing tip of the swept wings are slantably 
mounted. Each rudder is mounted on a hinge having a 
hinge line which ts slanted outwardly and rearwardly 
from bottom to top. In such slanted positions these air 
flaw contro] structures create air flow force compo- 
nents which are especially beneficial in controlling the 
very slow flying aircraft. 

Also to equalize the sensitivity of upward and down- 
ward contro! movements of the elevons, each one is 
equipped with an airfoil tab which beyond a neutral 
cruise position, always pivots upwardly. As the elevon 
is pivoted upwardly, the combination of elevon and this 
tab becomes more effective overall, and as the elevon 
is pivoted downwardly, this same combination of ele- 
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von and tab becomes less effective overall than the ele- 
von per se would have been, and may initially notice- 
able yaw drag effects are thereby eliminated. 


DRAWINGS OF PREFERRED EMBODIMENT 


FIG. 1 is a perspective view of the aircraft at rest, 
with all the directional control airfoils and vortex gen- 
erating and control airfoils in their neutral positions; 

FIG. 2 is a perspective view of the aircraft in flight, 
with a left rudder airfoil in its left turn producing posi- 
tion and the balance of the directiona! control airfoils 
retained in their neutral position, and the vortex gener- 
ating and control airfoils in their active positions gener- 
ating and controlling transverse spanwise vortexes; 

FIG. 3 is a perspective view of the aircraft in flight, 
with all the directional control airfoils in their neutral 
positions, and the vortex generating and control airfoils 
in their active positions generating and controlling 
transverse spanwise vortexes, with light curved lines in- 
dicating the circular motion of the air travelimg in the 
vortex and with directional arrows indicating the air 
leaving the vortexes at the respeclive swept wing tips; 


FIGS. 4, 5, and & are schematic sectional views of a 
transverse spanwise vortex to illustrate, respectively: in 
FIG. 4, a simple vortex; in FIG. 5, a step induced vortex 
which varies in size as gusts occur and as the step Is var- 
ied in its hcipht; and in FIG. 6, a confined vortex which 
is located between an inducing step and a shaving bar- 
rier is kept relatively unaffected by gusts and its center 
of pressure location remains relatively stable, 

FIGS. 7 and 8, respectively in section and in perspec- 
tive, with vortex air flow indicating lines being used in 
FIG. 7, illustrate how a wing is equipped with a leading 
edge airfoil structure which is pivoted upwardly at its 
trailing edge to become a step, thereby creating a lift 
vortex, as previously indicated in the schematic sec- 
tional views in FIGS. 4 and 5, and also creating a drag 
vortex beyond the trailing cdge of the wing; 

FIGS. 9 and 10, respectively in section and in per- 
spective, with vortex air flow indicating lines being 
used in FIG. 9, illustrate how a wing is equipped with 
a leading edge airfoil structure which is pivoted up- 
wardly at its trailing edge to become a stcp, thereby 
creating a lift vortex, and with a trailing edge atrfoil 
structure which is pivoted upwardly at its leading edge 
to become a vortex shaving barrier above to keep the 
lift vortex within stable limits and to become an entry 
below for air to beneficially circulate into the trans- 
verse spanwise lift vortex; 

‘ FIGS. 11 and 12, respectively in section and in per- 
spective, with vortex air flow indicating lines being 
used in FIG. 11, illustrate how a wing is equipped with 
a leading edge airfoil structure which is pivoted up- 
wardly at its trailing edge to become a step, thereby 
creating a lift vortex, and with two trailing edge airfoil 
structures, one above the other, the one above pivoting 
upwardly to place its trailing edge as a vortex shaving 
barrier, and the one below pivoting downwardly to pro- 
vide an entry below for air to beneficially circulate into 
the transverse spanwise lift vortex, and also to provide 
a following space in conjunction with the above trailing 
edge airfoil structure in which a drag producing vortex 
is generated with the assist of this above airfoil struc- 
ture serving as a step as tt continues to serve also as the 
shaving barrier, the trailing edge airfoil above having its 
leading edge beveled, and the trailing edge airfot! 
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below having its leading edge equipped with an up- 
standing air flow guiding flange, which both serve to 
better direct the airflow coming in from below to en- 
hance the air flows in both the lift and drag vortexes, 
and, as necessary, to further control the shaving of the 
respective vortexes, the above and below trailing edge 
airfoil structures include tabs, 

FIGS. 13 and 14, respectively in section and tn per- 
spective, with vortex air flow imdicating lines being 
used in FIG. 13, illustrate how a wing equipped with all 
the airfoil structures illustrated in FIGS. Il and 12, is 
further equipped with transverse spanwise air conduits 
to bring air under pressure substantially throughout a 
wing to feed vortexes, such as the lift vortexes; 

FIGS. 15 and 16, respectively in section and in per- 
spective, with vortex air flow indicating lines being 
used in FIG. 15, illustrate how a wing is equipped with 
a leading edge airfoil structure which is ptvoted up- 
wardly at its trailing edge to become a step, thereby 
creating a lift vortex, and with two trailing edge airfoil 
structures, one above the other, the one above being 
shorter and pivoting upwardly to place its trailing edge 
as a vortex shaving barrier, and the one below being 
longer and pivoting downwardly to provide an entry 
below for air to beneficially circulate into the trans- 
verse spanwise lift vortex, and also to provide a follow- 
ing space in conjunction with the above trailing edge 
airfoil structure in which a drag producing vortex ts 
generated, with the assist of this above airfoil structure 
serving as a step as it continues also to serve as the 
shaving barrier, the trailing edge airfoil above having its 
leading edge beveled, and the trailing edge airfoil 
below having its leading edge equipped with an up- 
standing air flow guiding flange, which both serve to 
better direct the airflow coming in from below to en- 
hance the air flows in both the lift and drag vortexes; 


FIG. 17 is a side view of the aircraft indicating with 
arrows and air flow lines how the slantably mounted 
rudder deflects air upwardly and outwardly, as It Is piv- 
oted about its hinge mounted at the angle shown; 

FIG. 18 is a top view of the aircraft indicating at the 
right wing tip with air flow lines, how the right or star- 
board near vertical stabilizer and rudder are pointed 
inwardly and yet tilted or slanted outwardly at their 
tops at their near straightaway cruise operating posi- 
tions, and indicating at the left wing tip with arrows and 
air flow lines, how the left or port near vertical stabi- 
lizer is potted inwardly and yet tilted or slanted out- 
wardly at its top, and how the left or port rudder pivots 
outwardly beyond its slanted hinge mounting to effec- 
tively redirect the oncoming air to create the reactive 
tuming component of the aerodynamic force, and also 
how it pivots inwardly forward of tts slanted hinge 
mounting so its bajance portion also act as a spoiler of 
the air flow over the wing tip inclusive of the horizontal 
stabilizer, and 

FIGS. 19, 20, 21 and 22 illustrate, in partial views, 
perspective and side, how a control surface such as the 
elevon is equipped with a unidirectional tab and its 
linkages to make the teed back sensitivity and the ef- 
fective contro! comparable for like arcuate deflections 
of the elevon either above or below the normal cruising 
contour of the wing, with FIG, 19 showing the cruise 
position, FIG. 20 the elevon up position wherein the 
tab enhances the effectiveness of the combined elevon 
and tab, FIG. 21 the elevon down position wherein the 
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tab decreases the cffectiveness of the combined elevan 
and tab, and FHG. 22 the respective major alternate po- 
sitions of the elevon and the tab wherein the tab pivots 
in the same direction whether the elevon pivots up- 
wardly or downwardly, the faired or generally cruise 
position being shown in solid tines and the up and down 
elevon positions being shown in dotted lines, 


DESCRIPTION OF PREFERRED EMBODIMENT 


|. General introduction 

In flying his aircraft as disclosed in U.S. Pat. No. 
3,438,597, Witold A. Kasper, at extreme angles of at- 
tack found it still controllable yet his positioning was 
unconfortable. Subsequently, the aircraft which he dis- 
closes herein, was developed to fly at these very low 
speeds, experienced by him previously at the extreme 
angles of attack, while still maintaining an angle of at- 
tack wherein his positioning is comfortable. All this ts 
done without detracting from the higher speed, cruise, 
landing and/or take off flight advantages realized dur- 
ing flights of his aircraft disclosed in his U.S. Pat. No. 
3,438,597, wherein the contro! surfaces were mounted 
on the swept back wing and no cxtending fuselage por- 
tions were needed to support any control surfaces. 

His former aircraft, flyable at all times without any 
need from centerline located tail control surfaces, at 
these high angle of attacks created transverse spanwise 
vortexes which he observed as sustaining his slow but 
controllable flight well beyond angles of attack, previ- 
ously and normaliy thought to indicate the loss of all 
controllable lift and/or the loss of immediate overall 
contro] of an aircraft. With this flight information cou- 
pled with the restudy of many vortices, and the restudy 
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of many birds in their flight, inclusive of their take off 35 


and landing, Withold A. Kasper now has a new aircraft 
which flies extremely well throughout all reasonable 
speeds of flight and also at lower speeds previously con- 
sidered to be unsafe, being below the previously desig- 
nated normal stall speeds. To enhance his contro! over 
the aircraft at speeds below these normal stall speeds, 
he has mounted the near vertical stabilizers and rud- 
ders at a slant, with the rudder turning on its slanted 
hinge line to create a reactive air flow dynamic control 
component which is especially useful in the slow mush- 
ing in type flight of the aircraft. Also he has used a uni- 
directional tab on the elevons to make them compara- 
tively responsive, both in control and sensitivity, 
whether moved upwardly or downwardly, avoiding any 
yaw drag effects. 

2. General Configuration of Aircraft 

As observed in FIGS. f, 2,3, 17 and #8, the aircraft 
30, has a compact fuselage 32, not requiring a rearward 
extension solely to position any control surfaces. An 
engine 34 and its propeller 36 are mounted at the rear 
of the fuselage 32. A nose wheel 38 and its support 40 
are positioned at the front of the fuselage 32. Some- 
what beyond the center of the fuselage 32, the main 
landing gear 42 has a continuous U-shaped support 44 
secured to it to pesition the other landing wheels 46, 
completing the three whee! support of the aircraft 30, 


The swept wing 50 is secured to the top of fuselage 
32 and positioned without any dihedral angle. Each en- 
tire left or port section 52 and each entire right or star- 
board section 54 are identical and the port and star- 
board sections are mirror images of one another. 
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Throughout the wing 50, airfoil structures are mov- 
able beyond the cruise control configuration. At each 
wing tip and trailmg behind are horizontal stabilizers 
56, 58. Just inboard of them are elevons 60, 62, each 
having a unidirectionul tab $4, 66. 

Farther indoard are split flaps 76, 78, having trailing 
airfoil structures 80, &2 jocated above and trailing air- 
foil structures 84, 86 located below, which respectively 
are pivoted so their above trailing edges 88, 90 raise 
and their lower trailing edges 92, 94, lower. The trailing 
airfoil structures 80 and 82 located above on the split 
flaps 76, 78, each have a lower slanted front portion 96. 
Also the trailing airfoil structures 84, 86 located below 
each have an upstanding structure 98 at their front por- 
tions. 

Again, farther inboard are additional split flap assem- 
blies 190, 162 of ike components. However as the fu- 
selage 32 is approached the downward motion of the 
airfoil 104 so located is restricted and therefore it is not 
a split flap assembly, instead being a single airfoil ratsed 
on occasions to serve as a spoiler and to shave the lift 
vortex. 

Along the leading portions of each side section 52, 
54, of the swept wing 5Q, are airfoil structures 100, 112 
pivotally mounted so their trailing edges Lid, 116,. 
raise, They are positioned as nose flaps would be; how- 
ever, vortex generation is their purpose when pivoted 
well beyond the cruise contour of the swept wing 50. 


At each wing tip 118, 120, the near vertical stabiliz- 
ers 122, 124 mounted there are pointed longitudinally 
inwardly at their forward ends and also are secured tn 
a transversely and outwardly slanted position at their 
tops. Continuing in this same orientation are the rud- 
ders 128, 130 when in their straight away faired or nor- 
mal cruise configuration. They are mounted on hinges 
which are slanted again and they only swing outwardly. 
However, their balance portions 136, 138 swing tn- 
wardly over the wing tips 118, 120 to also serve as 
spoilers. 

The many internal linkages, actuators, and power 
sources utilized in flying this aircraft 30 are principally 
considered to be derived from conventional products 
and systems. For clarity of the presentation of the 
major aspects of this aircraft 30, they therefore are not 
discussed. 

3, Generation of a Vortex to Create Lift 

After Witold A. Kasper observed wing secured indt- 
cators following air flow patterns transversely spanwise 
across the wing of his aircraft, he concluded that such 
a spanwise vortex existed. He therefore generated such 
a vortex in flight, without reaching such a high angle of 
attack, in the aircraft disclosed herein by operating 
many airflow structures. 

To better understand the respective purposes of 
these airflow structures, the schematic FIGS. 4, 5, and 
6 show in cross section the generation of different vor- 
texes where the source air is coming in tangentially at 
the outer boundaries and the discharge air is leaving ax- 
ially, the latter being indicated by arrows and flow lines 
in FIG. 3. In FIGS. 4 and 5 simple voriexes 144, 146 are 
formed behind respective obstructions 148, 150. Gusts 
are known to effectively reduce or stop them, There- 
fore in addition to regulating the height of the obstruc- 
tions, such as step 150, which aid in the generation of 
a vortex 146, a shaving or sizing barrier or step 182 is 
often used as shown in FIG. 6 to confine the location, 


y 
size, and center of pressure of a vortex 154. Following 
the presentations of these schematic FIGS. 4, 5, and 4, 
their application to the airfoils shown in the following 
figures is better understood. 

Although the aircraft 30 illustrated and described 
herein embodies the wing 5@ and its airfoils indicated 
in FIGS. 15 and 16, the other wings illustrated are used 
to meet different operating specifications and cost ob- 
jectives. For cxample the basic approach to generating 
the vortex is undertaken by using the wing 160 having 
a leading airfoil 162 pivoted so the trailing edge 164 
thereof raises as it is positioned as shown in FIGS. 7 and 
8 to create the lift vortex 166. Also a second vortex Is 
generated and referred to as the drag vortex 168, to dis- 
tinguish it from the lift vortex 166. As indicated by the 
air flow lines these two vortexes 166, 168 combine to 
create an effective cross sectional profile of a much 
larger wing which has better lift characteristics at these 
lower aircraft speeds. This vortex defined larger wing 
160 is acquired by adding only a small portion of the 
weight that would otherwise be required if this enlarged 
profile were to be established completely by adding 
only a small portion of the weight that would otherwise 
be required if this enlarged profile were to be estab- 
lished completely by airfoil structures and their ex- 
oanding actuators. : 

In FIGS. 9 and 10, wing 174 has a leading airfoil 176 
pivoted to create the lift vortex 178 which is both 
shaved and fed by a trailing airfoil 180 having its lcad- 
ing edge 182 pivoted upwardly for the shaving func- 
tion. The lower surface 184 of airfoil 18@ redirects air 
from below the wing #74 to the bottom of the vortex 
178 above. Generally in this arrangement a drag vortex 
dces not form. 

In FIGS. 11 and 12, wing 188 has a leading airfoil 190 
creating the lift vortex 192 and it is shaved and con- 
trolled by trailing airfoil structure 194 arranged above 
on a split flap 196. The trailing edge 198 of airfoil 
structure 194 pivots upwardly and its effectiveness may 
be enhanced by a tap 200 which also acts as a shaving 
device. The trailing airfoil structure 202 arranged 
below on the split flap 196 pivots to move its trailing 
edge 204 downwardly and its effectiveness is enhanced 
by a tap 206 which also acts as a shaving device with 
respect to the drag vortex 208. This drag vortex 208 is 
created upon the separation of the trailing edges 198 
and 204, Both the lft vortex 192 and this drag vortex 
208 are further controlled both by the upstanding 
structure or step 219 on trailing airfoil structure below 
202 and by the slanted portion 242 on the trailing air- 
foil structure above 194. Also the wing 188 has a 
slanted surface structure 214 to aid in guiding air flow- 
ing from below the wing up and into the bottom of the 
lift vortex 192, The overall wing profile created by op- 
erating ali these airfoil structures to establish vortexes 
192 and 208, is very large creating an effective wing of 
very high lift characteristics. 

In FIGS. 13.and 14, all the same airfoil structures are 
used. However, in this wing 214 a transverse spanwise 
tube 216 filled with compressed air distributes this air 
to feed the lift vortex 218 either at all times or at se- 
lected times, With this capability slower flying speeds 
are undertaken. 


In FIGS. 15 and 16, the wing 50 of aircraft 50 is , 


shown, It has a leading airfoil 11@ creating the lift vor- 
tex 222. Also trailing split flap 76 is used to create a 
drag vortex 226. This occurs as the trailing edge 88 of 
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the trailing airfoil structure 89 arranged above is piv- 
oted upwardly to serve both as a shaving barrter for 
controlling the lift vortex 222 and as a step for creating 
the drag vortex 226, Further confinement and control 
of the drag vortex 226 occurs as the trailing edge 92 of 
trailing airfoil structure 84 arranged below is pivoted 
downwardly. At the leading edge of this lower trailing 
airfoi! structure 84, an upstanding guidance structure 
88 is secured to direct air coming from below the wing 
§0 both back into the bottom of lift vortex 222 and on 
into the drag vortex 226. Also to better guide the air, 
the wing 5@ has a slanted surface structure 230 oppo- 
site guidance structure 93. 

4. Flight Control Airfoils to Aid Piloting During 
Slower Flying Speeds When Vortexes Are Being Gen- 
erated 

The capability of the flight control airfoils disclosed 
in U.S. Pat. No. 3,438,597 are further enhanced, as 1]- 
lustrated in FIGS. 17 and 18. In their straight away 
faired or cruise position the rudders 126, 130 with their 
balance portions 136, 138 are in alignment as before 
with the near vertical stabilizers 122, 124. Then tn ad- 
dition to being pointed inwardly as before, this entire 
group is transversely and outwardly slanted at its tops. 
Moreover, the hinges 132 of the rudders 128, 130 are 
mounted on still another slant so upon deflection of a 
rudder, on coming air is redirected upwardly and out- 
wardly as shown by the flow lines and arrows illustrated 
in FIGS. 47 und 18. 

Also as illustrated in FIGS. 19, 20, 24 and 22, the ele- 
vons 60, 62 are equipped with unidirectional tabs 64, 
66. The left side 52 of the wing 50 is shown in these fig- 
ures wherein in FIG. 19, the faired and normal cruise 
position is illustrated. In FIG. 20, the elevon 60 has 
been pivoted upwardly and the unidirectional tab 64 
has pivoted in the same rotational direction as the ele- 
von 6 to enhance the overall effectiveness of their 
combination. In FIG. 21, the elevon 60 has been piv- 
oted downwardly whereas the unidirectional tab 64 
continues to pivot upwardly to reduce the overall effec- 
tiveness of their combination. The scope of overall 
travel of the elevon 66 and its tab 64 is shown in FIG. 
22. By using these unidirection tabs 64, 66 the elevons 
69, 62 are comparably effective and responsive when 
moved either upwardly or downwardly in relation to 
corresponding degrees of elevon control movements, 
avoiding yaw drag. 

These improvements in controls over those controls 
previously set forth in U.S. Pat. No. 3,438,597 aid in 
the piloting of the aircraft 30 throughout all flight 
stages but they are particularly helpful at the very slow 
flying speeds. 

For example, in resect to the slanted or canted 
mounting or positioning of the near vertical stabilizers 
422, 124 and the rudders 128, 130, a dihedral effect is 
supplied. Also during a steep decent with the aircraft 
39 remaining level or near level, the slanted vertical 
stabilizers 122, 124, and the rudders 128, 1390, create 
resultant contro] forces as air is deflected by them. 
When either rudder 128 or 130 is deflected it is effec- 
tive about at least two motion axes of the aircraft 396. 


5. Use of Vortex Discharge to Direct Air Over the 
Flight Contro! Surfaces Located at and Near the Wing 
Tip Trailing Edges 

in FIG. 3, the cumulative effect of the transverse 
spanwise radially fed lift vortexes 222, 224 and the 
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transverse spanwise radially fed drag vortexes 226, 
228, is shown by both the overall radial air flow indicat- 
ing lines and the resultant axial air out flow indicating 
flow lines arranged to present large directional arrows. 
The latter arrows are curved rearwardly to indicate 
now the near vertical stabilizers 122, 124, and the rud- 
ders 128, 130 together with the normal overall air flow 
passing by the wings redirect the axially discharging air 
from all the voriexes. As redirected the discharging 
vortex air Dasses directly over the wing tip flight control 
surfaces: elevons 60, 62, horizontal stabilizers 56, 58, 
and the rudders 128, 130. 

AS a consequence at slower speeds when lift from the 
vortexes 222, 224 is being relied upon, the air passing 
over these wing tip controls is furnished by both the 
axial discharge air from the yortexes and from the nor- 
mal overall air flow passing by the wings. When the for- 
ward speed diminishes so the normal overall air flow is 
minimal or nonexistent, the air discharged from the 
vortexes continues to flow by these wing tip control 
surfaces so they remain effective. 

6. Summary of Some Flight Control Surface Move- 
ments to Obtain Different Fight Directions and Atti- 
tudes | | 

In U.S. Pat. No. 3,438,597 many of the flight charac- 
teristics are piven and none are withdrawn with respect 
to aircraft 36. However, to supplement those flight 
characteristics descriptions, it is now to be remem- 
beted that the slant or canted mountings of the near 
vertical stabilizers 122, 124 and rudders 128, 139, pro- 
vide the added control at very jow speeds and espe- 
ctally during decent as the aircraft remains level or 
nearly so. 

Also at anytime during all flight speeds, the canted or 
slanted rudders 128, 130, with their canted hinges 122, 
when defleted create a side force as well as a down 
force. The side force for turning is the primary func- 
tion, However this down force, being applied back of 
the center of gravity, results in a bencficial pitch up re- 
action which prevents altitudes losses during a turn. 

This rudder 128 movement, as before, places the bal- 
ance portion 136 of a rudder over the wing to serve as 
spoiler. The change in the effective lift moves the cen- 
ter of lift forward or pressure forward supplementing 
the pitch up action generated by the deflection of rud- 
der 128 as just discussed. Also this inward movement 
of the balance portion 136 of rudder 128 creates a drag 
generating a yaw reaction which is beneficial with re- 
spect to turning the aircraft 30. So, as before, all the 
movements of most contro! surfaces are additive to 
each other and often effective in a dual way to aid in 
piloting the aircraft 30. 

At times as flight continucs when lift vortexes are 
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formed, the flow of air over and near the top surface of 
the wing is flowing in the space where previously the air 
was flowing in the opposite direction at the higher 
speeds of the aircraft. Therefore there is an elimination 
of the drag previously existing at this locale and what- 
ever new drag is formed in other locations does not 
seem to match it, so a performance gain is realized as 
the overall drag is reduced. 

I claim: 

A. A wing for an aircraft comprising a main air foil 
member, a first flap structure pivotally sccured to the 
leading edge of suid member for swinging movement 
between a first position in which its outer surface forms 
@ smooth continuation of the upper surface of said 
main member and a second position in which the trail- 
ing edge of said flap structure is disposed a substantial 
distance above the upper surface of said main member, 
said flap structure being so mounted on said main 
member as to prevent flow of air between said flap 
structure and said main member when said flap occu- 
pies said second position, whereby a lift generating vor- 
tex will be formed adjacent the upper surface of said 
main meraber rearwardly of said flap structure, and 
second flap structure pivotally secured to said main 
member adjacent the trailing edge thereof, said second 
flap structure being movable between a first position in 
which it forms a smooth continuation of the trailing 
surtace of said main member and a second position in 
which it ts inclined with respect to the surface of said 
main member, to dispose a portion of said second flap 
structure a substantial distance above the upper surface 
of said main member, and said second flap structure is 
spaced rearwardly of the trailing edge of said main 
member to provide a slot for air to flow from the lower 
surface of said main member upwardly around the trail- 
ing edge of said main member and forwardly into said 
vortex. 

2, A wing according to claim 3 wherein said main air 
forl member is swept back at opposite sides of a center 
line and wherein satd first and second flap structures 
each include a pair of flaps, each flap extending out- 
wardly from the region of said center line toward the 
tip of said main air foil member, a, 

3. The wing according to claim 2 wherein said second 
flap structure at each side of said center linc comprises 
a pair of flaps, one of said pair of flaps being mounted 
for movement upwardly of said main air foil member 
and the other flap being mounted for movement down- 
wardly of said main air foil member. 

4. The wing according to claim 1 together with a duct 
system in said main air foil member fer supplymg atr 


to said vortex. ee 
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ABSTRACT 
Because of the sSweeptng claims made by Kline and Fogelman 1,2,3 
{and the resulting national publicity} for a new wedgeshaped airfoil 
with a sharp teading edge and a spanwise step discovered in paper model 
flights, parametric wind-tunnel tests were conducted on this airfoil to 
determine its aerodynamic performance. The airfoil was tested with 
variations of the following parameters: Reynolds number, step location, 
step shape, apex angle, and with the step on either the upper or jower 
surface. These resuits are compared with a flat plate and with wedge 
airfoils (without step} with the same aspect ratio. A standard NACA 
65-(10)10 airfoil as well as two delta wings were aiso tested. Water 
table tests were conducted for flow visualization, these results show 
that the flow separates from the upper surface at low angles of attack 
which is typical of thin airfoils and flat plates (leading edge sepa- 
ration). 

The wind tunnel results show that for this new airfoil the 
lift/drag ratio is lower than for the flat plate, and the pressure 
data show that the airfoil derives its lift the same way as the inclined 
flat plate. This airfoil offers no discernable advantages over the 
conventional airfoil. The observed stability of the paper model {with 
high swept back angle} is primarily due to the sharp leading edge vartex 
lift phenomenon. Thin airfoils operating at low Reynolds mumbers exhibit 


similar stability characteristics. 


INTRODUCTION 

The Kiine-Fogelman wing first caught the attention of one of the 
authors in a Time Magazine science section article 2 The claims in 
that articte include one made by Kline (an advertising art director) 
of haying somehow violated Bernoulli's principle with an airfail dis- 
covered accidentally while flying paper model planes. The article 
further claimed that Kline had “inadvertently stumbled on a whole new 
field of aerodynamics.” 

It is easy to dismiss these claims as far-fetched since Bernoulli's 
principle can be derived from conservation of energy, and thus violatic. 
of this principle could be included in the category of “perpetual motion 
machines." Curiously, a great deal of favorable national publicity 
has been given to this airfoil’, and a patent was issued”, Recently, 
piastic models of this plane have appeared in department stores, with the 
same claims, After observing a demonstration, Aeronautics Professor 
John Wicolaides of the University of Notre Dame, was quoted as being a 
"believer" in the Kline-Fogelman plane’, 

Figure | shows a chordwise section of the Kline-Fogelman airfoil. 
The claims in the patent include the sharp leading edge, straight upper 
and lower Surfaces, and the step, In Tooking at this mode] it appears 
that the stability is derived from its sharp leading edge (vortex lift) 
and, at low Reynolds numbers leading edge separation is known to be very 
stable. The step appears to serve oniy to increase the drag at low 


angles af attack. Tt is also known that the Jift/drag ratio for this 


tea 


2 


type of stable delta wing is quite low (4,5,6). 


i 


Figure 1 Kline-Fogleman Airfoil Section 


{In a private discussion, Mr. Edward C, Polhamus of Langley research 
Center, who is well-known in the study of vortex lift from delta wing air- 
foiis, revealed that he had made a preliminary study of the Kline-Fogeiman 
airfoil and concluded that the airfoil offers no particular advantage over 
conventional airfoils. 

Near the end of the test program reported herein, results were pub- 
lished of recent tests conducted on this airfoil by Messrs. Delaurier and 
Harris of Battelle’ . The resuits presented here are in agreement with 


their findings though much more extensive. 
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EXPERIMENTAL SETUP 


A. 


Wind Tunnel and Instrumentation 

The wind tunnel was of the standard open type and Figure 2 shows the 
test section with model no. 12 mounted on the balance. The balance incor- 
porates strain gages to measure lift and drag up to a maximum of 2 ibf. The 
balance was equipped with a d-c drive motor for remotely varying the angle 
of attack and a potentiometer for angle readout. In an effort to produce 
two-dimensional flow, plates were employed at the two ends of the airfoil. 
However, the top plate was ineffective due to the clearance which was nec- 
essary for free movement of the airfoil. Removal of the top plate had no 
discernable effect on the resuits. 

Figure 3 shows the instruments employed in the study. Lift, drag and 
angle of attack were read out on digital voltmeters while test section 
pressures were measured with a differential transducer and read on a 8-I¥V 
type output device. The schematic of this instrumentation hookup is 
Shown in Fig, 4. The static pressure distribution over selected models 
waS read out on an inclined manometer. A check test was conducted at The 
University of Tennessee using a 50 pound batance and an airfoii having a 16 
inch span and a 4 inch chord. The results of this test were in good agree- 
ment with those reported herein. 

A total of 14 models were tested in the wind tunne!. Figures 5 and 6 
show 10 of the 14 tested. Table I gives a summary of the geometric para- 
meters of each model. 
water Table 

Fiow visualization using hydrogen bubbles was employed with a standard 
48 in, x 18 in. Scott-Armfie!ld water table, Model 9093. Comparisons were 


made between a Kline Fogleman airfoil with a 20° apex angle and a fiat plate, 
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Test Procedure 

For each model tested, lift and drag were determined as a function of angle 
of attack for angles from 0° to 360°. This produced data with the step on both 
the upper surface and lower surface in addition to other data not reported 
herein. These data were taken for tunnel dynamic pressures of 5, 10, 15 and 20 
PSF which produced a Reynolds number range of 60,000 to 135,000. In additton, 


Static pressures were deternnined over the airfoil surface at various angles of 


attack for modeis 8, 9 and 12 which were equipped with pressure taps. 


DISCUSSION OF RESULTS 

Figure 7 shows the influence of Reynotds number. It appears that in a range 
of 60,000 to 135,000 the lift coefficient is not grossly affected by Reynolds num- 
ber variation. The results of Delaurier and Harris (7) are for a Reynolds number 
of 20,000 which is representative of "paper airplane” conditions. The influence 
of step height as illustrated on Figure 8, shows that increasing the step height 
decreases the lift at high angles of attack. As the step height becomes smailer, 
the jift curve approaches that of a flat plate. Thus, it appears that the step 
merely decreased the lift. The effect of the step on drag can be observed in 
Figure 9 which shows that the step acts to increase the acag. 

If the airfoil is turned upside down (step up position) both C, and C, iC 
increase significantly as shown on Figures 10 and 11. This 18 in agreement with 
some preliminary measurements by Nicolaides (8). If an airfoil ist operate 
continuously at an angle of attack higher than 20 degrees, the upside-down Kline- 
Fogleman airfoi] with a fairly thick step (approximately 0.25 chord) appears to 


have an advantage with a higher C, and a higher C fC, than a flat plate or the 


L 
NACA airfoil. This achievement of maximum lift at high angles of attack jis 


typical of airfoils wnich derive their lift from leading-edge vortex phenomena. 


=5- 


Figures 12 and J3 point out the influence of step location (1/4 chord, 1/2 
chord and without step). Moving the step forward increases the lift at higher 
angles of attack but decreases lift at jower angles of attack. The lift/drag 
ratio of these three airfoils remained about the same indtcating that moving the 
step to increase lift would produce a corresponding drag increase. Some tests 
were conducted with an open slot. These resuits are not presented herein, but 
are substantially the same as with the solid airfails. 

Results from a delta wing with a 30 degree taper are Shown on Figures 14 
and 15, Comparison between the stepped delta model and the flat plate delta 
model show the same behavior as was found in the rectangular planform models. 

Figures 16 and 17 show photos of the vortex patterns around a flat plate and a 
Stepped airfoil at approximately the same angles of attack. [It is quite evident 
that the leading edge separat?on and overalj flow patterns of these two shapes 


are quite similar. 


CONCLUSIONS 

From the scope of this test program it can be concluded that the stepped 
airfoil] offers little or no advantages over the conventional airfoil or flat 
plate, and in most configurations 1s decidedly inferior in terms of both lift 
and lift/drag ratio. Tests involving variations in the step geometry (joca- 
tion, siz@ and shape) indicate quite conclusively that the lift is primarily a 
leading edge phenomenon which reaches a maximum when the step is not present. 
khen the stepped airfoil is turned upside down (with the step on top of the 
airroil} there is a marked increase tn both ‘ift and lift/drag ratio but these 
are Still below the levels for conventional airfoils except at very large 
angles of attack. As noted 1n Reference 7, the pitching moments of this type 
airfoil tend to be slightly positive, which may have advantages in some appii- 


cations. 
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Wind Energy Extraction by Birds and Flight Vehicles 


Peter Lissaman* 
Da Vinci Ventures, Santa Fe, NM 87505 


When a bird or flight vehicle is oriented with a component of its lift vector aligned with 
the natural wind work is done on the flight system. Consequently, by suitable maneuvers, 
variations in wind speed can be used to add energy to the system. These procedures are used 
by albatrosses and many other birds. The equations of motion are simplified by normalizing 
by the minimum drag speed and integrated numerically for control cycle involving angles of 
attack and bank. An energy neutral cycle, by which the vehicle returns to initial velocity 
and height with no power input depends only upon the maximum lift/drag ratio of the 
vehicle and the wind speed variation. The minimum speed difference occurs for a vertical or 
horizontal step in wind speed. For a continuous wind profile a variational method is used to 
find the minimum gradient for a neutral energy cycle. Simple expressions are derived for 
the minimum wind variations for these two cases. The oceanic boundary layer, and the 
shear layer downwind of a ridge are also studied, and neutral energy wind criteria derived 
for them. Birds and small UAV’s, with flight speeds comparable to atmospheric wind 
variations, can profit from wind energy extraction. 


Nomenclature 


= constants for fixed bank trajectory 
drag 
= profile drag at cruise speed 
= induced drag at cruise speed 
= unit vector in direction of relative wind 
= maximum value of L/D 
= acceleration due to gravity 
normalized height above reference 
= maximum circuit height 
= height of wind shadow in lee of ridge 
= reference height for ridge wind flow 
= unit vector in horizontal direction 
= unit vector in vertical direction 
= ratio of lift coefficient to that for maximum L/D 
= lift 
= lift to drag ratio 
= unit vector in direction of lift 
= vehicle mass 
= normal acceleration, normalized 
= dynamic pressure normalized to that for max. L/D 
= initial value of Q 
= value of QatS 
= value of QatB 
= dynamic pressure at cruise speed. 
= normalized distance along flight path 
= inertial speed 
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Vv = normalized inertial velocity vector 


U,V,W == normalized components of velocity in inertial space 
v* = cruise speed 

Vv = velocity vector 

W = normalized wind speed 

Wo = wind speed at reference height 

Ws = wind speed at maximum circuit height 

xX = normalized horizontal distance in U direction 
Y = normalized horizontal distance in V direction 
AW = change in wind speed 

AVp = change in flight speed due to drag 

oO = inclination of drag force to the horizontal 

x = azimuthal angle 

d = angle of bank about axis parallel to wind 


I. Fundamentals of Energy Extraction and Dissipation 


Consider an unpowered vehicle (bird or sailplane) operating with an inertial speed, V, and height, h, with respect to 
some ground reference. In zero wind conditions it experiences an airspeed equal and opposite to V. Its total energy 
per unit mass with respect to the ground reference can be expressed as an equivalent height, H,, defined as 
h+V7/2g. It can climb, turn, dive and execute maneuvers, although these will be performed at the expense of total 
energy, H,. Drag acts in the direction of, and in opposition to, the motion, while lift is normal to the motion. As a 
result, drag always does work on the vehicle while lift forces cannot. Gravitational forces perform conservative 
work on the vehicle, so that if the potential energy, h, is included in the total energy then H, is always reduced by 
drag forces, or dissipated, regardless of the vehicle motion, while lift forces cannot change H,. This is a 
fundamental consideration. 


If the vehicle operates in a wind field so that the airmass has a speed relative to the ground reference, then the 
aerodynamic forces are a function of the local airspeed, the vector sum of the inertial speed and the local wind 
speed. The vehicle response is a function only of the aerodynamic forces but changes only the inertial speed. It is 
now possible for the wind to do work on the vehicle which increases its total energy, H., as described below. 


For the wings level case of climb into wind, the inclination of the relative wind to the horizontal is shallower than 
the actual climb angle. As a consequence, the lift is not orthogonal to the flight path, but inclined forwards, so does 
work on the vehicle, increasing its energy. For a downwind dive the same mechanism does work on the vehicle and 
again adds to its energy. A similar situation occurs in level flight during a banked downwind turn (“downwind turn” 
is defined as one in which flight direction changes from upwind to downwind). The lift vector is inclined towards 
the center of the flight path, and a component of the wind also acts in this direction, so does work on the vehicle. 
In the case of a level upwind turn, however, negative work is done on the vehicle. So, in the course of a general 
maneuver cycle consisting of an upwind climb, downwind turn, downwind dive and an upwind turn to complete the 
cycle work is done by the wind during the first three legs of the trajectory. A simple rule is: “belly to the breeze’, 
indicating that any orientation of the vehicle in which the lift vector is inclined in the direction the wind is blowing 
will enable the wind to do work on the vehicle. 


In order to extract energy from the wind in a circuit the vehicle must fly in regions of different wind speed during 
this cycle. A wind shear profile with wind speed increasing with height is not necessary, although this is what 
usually occurs in the earth boundary layer. 


The simplest manifestation of wind variation for dynamic soaring is that of an infinite gradient, a horizontal shear 
layer above (or below) the vehicle so that the vehicle can enter the high-speed flow with negligible change in height. 
To exploit this, the vehicle enters the wind flow from the calm, flying upwind, executes a 180° downwind turn in the 
wind, drops out of the wind back into the calm and then makes an upwind 180° to return to the start with no 
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significant change in height. This maneuver can be flown by a vehicle operating under (or above) a shear layer of 
very small thickness so that by small changes in height it can immerse itself in either stream. This is the most 
efficient way of extracting energy because no energy is lost in climbing, or diving, to seek out the energetic wind 
flow. The only parameters involved in the flight mechanics are the drag and AW, the wind speed step. 


The energy increase is estimated as follows. The vehicle, with an initial inertial speed of V in calm air enters a head 
wind of speed AW. The airspeed now becomes V+AW. The vehicle completes a 180° level turn to finish flying 
downwind at an airspeed of V+AW - AVp, where AVp 1s the speed loss due to drag. It re-enters the calm with an 
airspeed and inertial speed of V+2AW-AVp. It entered the headwind flow with an inertial speed of V, and departed 
with an inertial speed of V + 2AW- AVp. The difference in kinetic energy between these states is the work done on 
the vehicle by the wind minus that due to drag dissipation. For the return leg of the upwind turn, the loss is again 
approximately AVp so that the final speed change in the cycle is 2; AW-AV)), illustrating that the work done by the 
wind must balance that lost in drag for no energy change in the cycle. 


Another energy extraction case due to flow speed differences occurs when there is a region of higher speed flow 
horizontally adjacent to the calm with a thin shear layer in the vertical plane separating the airmasses. This is a 
vertical shear layer, such as occurs downwind of the sides of an obstacle at the edge of the wake. In this case the 
vehicle flying into the wind from the wind shadow, drifts laterally to enter the headwind, climbs, gaining energy and 
then moves laterally back into the adjacent calm where it can execute 360° turn and dive to return to the original 
state with increased energy. 


A common natural flow state is that of the planetary boundary layer (PBL), where wind speed increases with height. 
In order to enter the energy bearing higher speed flow, the vehicle must climb, which involves penalties in increased 
drag and length of flight path. The simplest model is to postulate a linear wind profile with a uniform shear, s, as the 
driving parameter and the maximum lift/drag ratio of the vehicle as the energy loss parameter and to determine the 
minimum s for a given G. The least restrictive cycle constraint is to require only that the original velocity vector 
and height are regained. Normally the vehicle will not return to its inertial space starting point at the end of a cycle. 
In this case, because the cycle is not closed, we refer to it as a “loop”. 


II. Normalization of Equations of Motion 


An unpowered vehicle is subject to aerodynamic and gravitational forces. The aero forces are a function of the 
angle of attack and airspeed. The response following Newton’s Law can written: 


Mdv/dt =D’d+L’l +Mek (1) 


Where M is the mass, v the vector velocity in an inertial frame, D’ the drag parallel to the relative wind with d the 
unit vector in the direction of airspeed relative wind and L’ the lift normal to the relative wind and in the plane of 
symmetry of the vehicle, with I the unit vector in this direction. The unit vector k is vertical. 


The equations can be normalized by the cruise speed, V*, defined as the speed at which the vehicle has its minimum 
glide angle, or maximum L/D. This maximum is called the glide ratio, and denoted by G. Dynamic pressure for this 
state is q* and at this dynamic pressure the profile drag and induced drag are given by D*,, D*; with D*, = D*;_ 
The minimum drag is given by D*, + D*; . The ratio of dynamic pressure to that at minimum drag, q*, is defined by 
Q (= q/q*), while the ratio of lift coefficient to that at minimum drag is given by L. Time is normalized by V*/g, 
and length by, V*’/g. Dividing by weight, W (= Mg), and normalizing provides for the non-dimensional vector 
acceleration: 


dV/dT ={ D’/W}d +{ L’/W}1 +k (2) 
The drag, D’, consists of profile drag, proportional to dynamic pressure, and given by W{D*,Q}, plus induced drag, 


proportional to dynamic pressure and lift coefficient squared, and given by W{D*,;L’Q}. The lift L’, is proportional 
to dynamic pressure and lift coefficient and given by W{LQ}. Substituting the above provides: 
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dV/dT = Q[{(1+L’)/2G}d+L]] + k (3) 
The normal acceleration, N is given by N = LQ. 


The above equation defines the vehicle motion, and it is seen that the only parameter is G, the maximum lift-drag 
ratio. Controls are exerted by L, which is a function of angle of attack, and controlled by elevator (or similar tail 
aerodynamic mechanism for birds) and 9, inclination of the lift vector, controlled by ailerons or wing twist, which 
dictates the bank angle. 


The normalized relative wind, W, is given by W, = V + Wi assuming the normalized inertial speed of the wind is W 
and its direction is in the horizontal, 1, direction, that is orthogonal to the vertical denoted by k. From this vector the 
direction and magnitude of Q can be calculated. 


Generally, practical limits are imposed on Q by structural factors relating to the never-exceed speed of the vehicle 
(about 2.5 for modern sailplanes), on L by aerodynamic factors relating to maximum lift (about 2.0 for modern 
sailplanes), and on N (=LQ) by structural factors relating to wing bending (about 5 for modern sailplanes). It is 
likely that all these limits are lower for birds than for those of mechanical aircraft. 


III. Fundamental Sachs Solution 


Sachs (1994) has set the fundamental problem of determining the minimum linear shear required for a vehicle to fly 
a cycle which returns to the same height and speed vector, but not the same inertial location. This is called an 
“energy neutral” loop. A variational procedure was employed using the exact equations of motion. This produced 
an optimal bank and lift routine as a function of wing loading and G. This energy neutral trajectory requires extreme 
maneuvers — dynamic pressures vary by a factor of 16, G loads, defined by N, vary by a factor of 8, drag coefficients 
by about 4, while bank angles go from zero to 80°. This loop is of importance in establishing an exact lower bound 
for energy neutral cycle operating in a linear shear. As noted in the following paragraphs, although this loop is 
optimal in requiring the minimum gradient, the cycle requires a long time to execute, a lengthy flight distance, large 
height gain at the upwind end and a significant downwind drift for each loop. 


IV. Development of Equations of Motion 


General Equations in Inertial Frame 


The equations derived above in vector form are re-written in scalar form in an inertial orthogonal Cartesian 
coordinate system fixed in the earth. The normalized inertial horizontal, lateral and vertical velocities, U, V, W, 
modified for various inclinations, become: 


2 ys 1 asa y = Gacy asa sine) (4a) 
aT 2G 
ie e070 cosasiny— nasi cso — cosa) (4b) 
aT 2G 
dW 1 ; , 
— =—_|0+N°/Q)sina—Ncosacos@+l 4 
aT wry 0) ? 


Here the symbols are all non-dimensional with their conventional meanings. 


The non-dimensional airspeed squared, Q, is defined as: 
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OQ=(U+W*) +V?+W° (5) 
where W*(H) is the normalized wind at the appropriate height. 


The orientation of the vehicle is in general three dimensional and defined by three angles, a, y and @. The angle a, 
the inclination of the drag force to the horizontal, identical to the angle of the relative wind to the horizontal, 1s 
defined by: 


sina=W/JO (6) 
The azimuthal angle, y, is given by: 
tany=V/(U+W*) (7) 


while the bank angle, 9, is defined as roll about an axis parallel to the relative wind with zero being wings level. 
The bank is positive in the clockwise sense viewed from the rear (right wing down) using a right hand screw rule 
oriented with this axis. 


The inertial position, X, Y, H is defined by 
dX/dT =U, dY/dT=V, dH/dT =-W (8) 


where H represents height measured positive upwards, in the opposite direction to Z, which is taken downwards for 
this coordinate system. The above equations, 4a,b,c are compact expressions of the dimensional equations used by 
Sachs. 


The above set was solved, using fourth order Kutta-Runge, for an arbitrary lift ratio, L and bank, 0, schedule. The 
lift and bank controls were specified at twenty equidistant time intervals on the cycle and the integration was 
performed using 400 equal time intervals per cycle. It is useful to compute the total energy, H,, at each step to 
check the accuracy of the numerics. Setting the wind speed to zero for any time step must give a negative change in 
H. 


Analytic Solutions 
The equations above can be integrated exactly in the case of no wind for some special cases. Two particular cases 
are those of wings level constant angle climb or dive and of constant bank angle, constant height turn. These are 


useful for checking numerical methods to determine step size in the numerical procedure for acceptable accuracy. 
They can also be patched together as approximations of smooth trajectories. The solutions are listed below. 


Wings level, constant N 


For the case of an unpowered wings level climb or dive at an angle of 0, the normal acceleration, N, is constant and 
given by N =cos@. The equations of motion can be integrated analytically to provide the dynamic pressure Qs after 
a distance S from the initial state of Qp by the expression: 


(OQ, _ a)* (OQ, 7 by’ = (Q, _ a)* (Qo 7 bere. (9) 
Here a, b are defined as the roots: 
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a,b =—Gsin@ +./{Gsin0}" —cos@” (10) 


For the level flight case (0 = 0) the above expression degenerates, but using the basic equations gives the final 
dynamic pressure as: 


(0. +1)=(Q,” ihe 1) 


The final dynamic pressure is a function only of flight path angle, 0, distance flown along the flight path, S, and 
glide ratio, G. 


Constant height, constant N 


For the case of an unpowered constant height turn at constant bank angle, o, and constant N the equations of motion 
can be integrated analytically to provide the dynamic pressure Qg after a turn through the azimuth angle B by the 
expression: 


Qz =N tan{[atan(Qo /N)] - B/[Gsing] } (12) 


Here Qo is the dynamic pressure ratio at 8 = O and Qg the value at B. The final dynamic pressure is a function only 
of normal acceleration, N, turn angle, B, and glide ratio, G. 


V. Validation by Comparison with Sachs Open Loop Optimal Case 


The Sachs (1994) result for optimal trajectory is not a closed circuit in inertial space. As shown in Fig. 1, it is a 
hairpin-like open loop where the vehicle starts flying cross wind horizontally, banks into wind climbing to 
maximum altitude, performs downwind 180° turn, dives towards original height and then banks to return to 
crosswind flight. This figure is drawn to scale for a reduced length scale of unity, and G of 45. 





Fig. 1 Typical trajectory for energy neutral loop in linear wind shear. 
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The vehicle returns to its original kinematic velocity state and height, but not the original position. It covers ground 
in a crosswind direction and drifts downwind. The case analyzed by Sachs for a vehicle of 35 kg/m’ wing loading 
and G = 45 maximum lift to drag ratio was repeated using the present formulation. The bank angle and lift ratio 
schedule quoted by Sachs was used to exercise and validate the present code. Sachs defined the vehicle by wing 
loading and G, and does not quote cruise speed. To check the present code cruise speed was varied until the loop 
matched that given by Sachs. A good match was obtained for a speed of 26.8 m/s. The shear gradient, s, for an 
energy neutral loop was 0.0324 per second, normalized by cruise speed, this gives the non-dimensional shear, S, of 
0.089. The control schedule was that used by Sachs and is shown as Fig. 2. The “clipped” top of the L schedule is 
caused by Sachs imposing a Cy max limitation on the control. Principal characteristics of the loop are a circuit period 
of 8.9 time units, maximum speed of 3.0 speed units, a maximum bank of 68°, a total climb of 4.5 units, a crosswind 
displacement of 4.5 units and a downwind drift of 1.8 units 











Fig. 2 Control schedule developed by Sachs. 


As a consequence of the normalization the critical shear, S, can be a function only of G, so that S = S(G). It is of 
interest that by using this normalization all of Sach’s data, for all vehicles in all linear shear flows, can be 
approximately collapsed to the simple form expressed by the equation: 


S = 4.00/G (13) 


Sachs results are for G = 20 — 80. The above expression is a good approximation for G within this range. A more 
precise determination of the function S(G) could be made by using the variational method to compute S for G values 
below 20, and refining Eq.13. 


Open Loop, G = 25, Linear Wind Shear 


Modern sailplanes have glide ratios in excess of 40. No bird performs as well. A reasonable G value for an 
albatross is about 25 with a cruise speed of 15 m/s. This G value is used in all following calculations so that they 
will be applicable for comparison to observed performance of large birds. For the open loop G = 25 case the bank 
and lift ratio schedule developed by Sachs for G = 45 was used. Surprisingly, even for G = 25, this gave an 
adequate control schedule for an energy neutral open loop at an S value of 0.16, as predicted by the above equation. 
The loop has a shape similar to the G = 45 case. Principal characteristics for this glide ratio, (with the G = 45 result 
is shown in parenthesis) are a circuit period of 9.0 (8.9) time units, maximum speed of 3.0 speed units, a total climb 
of 4.8 (4.5) units, a crosswind displacement of 5.8 (4.5) units and a downwind drift of 3.2 (1.8). 
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The control schedule, shown in Fig.3 can be a function only of G. It is, in fact, almost independent of G, as shown 
in numerical results quoted by Sachs for different G cases. The optimal control schedule is not very sensitive to 
small changes. An ad hoc sensitivity study was made to determine the precision required for the bank and lift 
schedule. It was found that a very simple, but smooth, schedule, arbitrarily selected to consist of Fourier cosine 


series with only a few terms (three for lift ratio, L, two for bank angle, @) gave almost identical final performance. 
The two schedules are shown in Fig. 3. 











Fig. 4 Ground tracks for open loop energy neutral trajectories 
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The ground tracks for two open loop canonical cases, G = 45, 25, with a linear profile are shown in Fig. 4. The loop 
satisfies the energy neutral condition, but it is not necessarily the optimal control routine. 


Closed Circuit, G = 25, Linear Wind Shear 


As has been noted, an alternative, but more demanding, objective is that the cycle should return not only to the same 
inertial speed and height state but also to the same inertial position. We call these closed cycles “circuits”. This is a 
more restrictive constraint, and requires a stronger shear than the simple energy neutral “loop”. The circuit yields 
more options for navigation in any arbitrary direction. For example, the circuit will permit orbiting about a given 
inertial position indefinitely, while the minimal energy neutral loop would involve a steady downwind drift. 


A trial and error procedure was used to define a control schedule for such a circuit. The solution indicates that a 
shear value of 0.17 1s required. The open loop requires 0.16. 


Principal characteristics of the circuit were a circuit period of 8.6 time units, maximum speed of 3.0 speed units, a 
total climb of 4.5 units, a crosswind displacement of 0.0 units and a downwind drift of 0.0. This cycle, which 
provides more range access for the bird, does not require a significantly higher shear than the minimal loop. 


The control schedule was first approximated by adjusting the schedule by hand and observing the result. This gave 
a circuit which matched the closure conditions but involved a somewhat irregular schedule. A solver code with 40 
unknowns (Lift ratio and bank at 20 stations) was then constructed and a solution obtained for eight conditions (U, 
V, W, X, Y, H, L, ) returning to T= O state. Because there are many more variables than constraints, this can be 
satisfied by an infinity of schedules, but the apparently smoothest schedule was selected. 











Fig. 5 Comparison of Control Schedules for two energy neutral circuits 


It is of interest small variations in the schedule will still give a satisfactory solution, as is the property of an 
extremal. The schedule selected cannot be the theoretical optimal, but must be close. The final smooth schedule, 
called Schedule 7, is used here. The differences between it and the first ‘ad hoc’ solution, called Schedule 1, are 
shown in Fig. 5. Both schedules produce energy neutral circuits. 


The fact that an acceptable energy neutral circuit can be obtained from a fairly crudely controlled schedule is of 
interest in dynamic soaring by birds, suggesting that extreme precision, which could not be achieved by the bird in a 
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natural, variable wind shear, is not necessary. Temporal variations in wind profile must be particularly trying for the 
bird to handle. The period of a circuit is about 10 so that for a bird with cruise speed of 15 m/s the period is about 
15 sec., and it becomes necessary for the bird to control its trajectory anticipating a likely wind profile that will 
occur about 7 1/2 seconds in the future. 


Closed circuit, G = 25 in Step Shear 


A fundamental wind shear model is that of step shear, expressed as AW and discussed qualitatively in the first 
section. The existing codes were used to exactly compute a cycle of this nature to give quantitative results. The 
circuit analyzed here is not optimal, but it is a possible flight path and provides an upper bound of the AW required. 
The procedure was as follows. 


The optimal bank was determined for a 180° constant height turn as a function of entry speed. Such an optimum 
exists because a shallow bank will expend less energy per unit distance, due to reduced induced drag, but will 
require a longer distance to complete the turn. The downwind turn takes place in the wind field, starting upwind. 
After the 180° turn the bird enters the zero wind field at the same inertial speed, and selects the optimal bank for a 
180 upwind turn. A short straight and level segment is required to regain the ground lost during the downwind drift, 
to return to the starting point. The wind speed is selected so that the final speed returns to the initial value. This 
process was iterated over a range of flight speeds to determine the minimum wind step, AW, required for a complete 
circuit. For G = 25, the wind step, AW is calculated to be 0.19. For other levels of G the wind step, AW, was 
calculated numerically using the above procedure. The results can be expressed to a good approximation by: 


= 4.75/G (14) 





Fig. 6 Ground tracks for fundamental open and closed trajectories 
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Ground tracks for these loop and circuit trajectories are shown in Fig. 6. Examination of this figure shows the 
striking result that the length of the flight path, and time of cycle is much shorter for the step shear case than the 
period for the linear shear case. 


Circuits with Planetary Boundary Layer Wind Profiles. 


Because of mixing in the lower layers of the atmosphere, the PBL normally has a profile in which the speed 
increases while the shear gradient reduces as height increases. The actual natural profiles vary widely, depending on 
heat flux, surface roughness and wind speed itself. Consequently, there is no particular characteristic profile to 
choose for fundamental calculations. Two possible, but not general, profiles,are chosen for illustration: ridge flow 
and oceanic flow. 


Ridge flow is the profile created by an obstacle to the flow, where the vehicle emerges from a zero speed wind 
shadow into a typical separated PBL. This occurs naturally when a vehicle climbs from downwind and below a 
ridge into a normal wind flow, or in the separated flow in the lee of an ocean wave. This is a practical 
approximation of the ideal step shear case already discussed. The important feature of this type of flow is that very 
extreme wind shears occur, and the vehicle can make its return path sheltered by the wind shadow, thus pays no 
penalty of downwind drift or of energy loss during the upwind turn. This wind model has recently been studied by 
Sachs and Mayrhofer (2002). They used a wind profile of the form: 


W = 0, H<Ho 
W = W,{(H-Ho)/H,}", H>Ho (15) 


where m = 0.2 and Hy = 0.097 and H, = 0.097 was selected. Ho represents the vertical extent of the calm below the 
ridge in which the vehicle makes the upwind turn. At the scale used by Sachs and Mayrhofer this is about 10 m. 
Results are shown in Table I. For the high performance vehicle of G = 45 it is noted that there is a small 
discrepancy between the results of Sachs and Mayrhofer and those of the present paper. The reason for this 1s 
unknown, but may be in numerical round-off errors. Sachs and Mayrhofer do not provide a solution for the G = 25 
case, which has been calculated by the present method and is shown in Table I below. 


Table I Speed Parameters for Ridge Flow 


G H, W. Wax Source Profile 
45 0.097 0.097 0.114 Sachs &M_ Ridge 
45 0.097 0.070 0.104 Present Ridge 
25 0.097 0.133 0.198 Present Ridge 


A possible PBL profile above the ocean for some flow states can be represented by the Sachs model with Ho = 0. 
The present model was exercised for this for the case G = 25, and provided the loop having Ha, = 0.80, with the 
other parameters shown in Table Hl. This provides a trajectory and performance between that for the linear profile 
and the step shear, as is expected. 


Table Il Speed Parameters for Oceanic Flow 


G H, W. W imax Source Profile 
25 0.00 0.163 0.249 Present Oceanic 


Maximum Wind Speed for Basic Profiles 
Four types of profile are modeled to provide basic flow cases: step shear, linear shear, ridge flow and oceanic flow. 
The first two are mathematical ideal cases, the latter represent idealizations of real flows. Energy considerations of 


these types are discussed below. 


Energy addition always occurs through coupling with the wind speed during the downwind turn consequently it is 
the maximum wind speed that is the fundamental criterion for an energy neutral circuit. The vehicle must climb in 
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order to reach this energy bearing flow. But drag energy is expended during the cycle so that the required 
magnitude of the upper level wind depends on how much height (and drag loss) must occur in reaching it. 


Minimum upper level wind speed occurs for the case of a step change in speed, a vertical sharp-edged gust, AW. 
The next most favorable case, ridge flow, is similar, where the return loop occurs in calm conditions, under the wind 
shadow of an obstruction. For a normal PBL, as experienced by oceanic birds, the situation is slightly less 
favorable, since part of the upwind turn is made in lower level wind. For the linear shear the most unfavorable 
condition occurs. These four cases have been calculated, all for G = 25 and for a neutral energy circuit. They are 
summarized in Table III below. 


Table If] Maximum Wind Speed and Height for Basic Profiles 


G H, H. / Profile 

25 0.000 0.00 0.167 Step Shear 

25 0.097 0.81 0.198 Ridge Wind 
25 0.000 0.80 0.249 Oceanic Wind 
25 0.000 4.50 0.757 Linear Shear 


The table shows the height at which the wind flow starts, Hp, the maximum height, H,,,, and the maximum wind 
speed, Winax, required for an energy neutral circuit. The difference between the first two cases is not extreme. For 
linear shear a large height change and wind speed is required to extract the requisite energy. Oceanic flow lies 
between these cases. Since operating conditions for birds and gliders more closely match the first three cases, it is 
apparent that energy extraction from natural winds and ridge conditions requires less extreme profiles than are 
predicted using the linear model. 


re HT. OF TRAJECTORY 
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z 
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Fig. 7 Wind profiles for basic cases 
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The wind profiles for the basic cases are shown in Fig. 7. It is seen that the cases of ridge flow and oceanic flow 
require quite modest wind speed changes, and small changes of height through a cycle. For example for an albatross 
of the characteristics specified to perform in an energy neutral cycle in an ocean boundary layer the wind speed 
would have to rise from zero at sea level to about 3.7 m/s at 18 m. 


VI. Conclusions 


The basic dynamics of energy extraction from wind variations is described. The wind will do work on a vehicle if 
there is a component of the lift in the direction of the wind. For the most extreme wind gradient, a step, the required 
step magnitude, AW, is equal to the speed loss of the vehicle in a 180° constant height turn. The equations of motion 
are normalized using cruise speed (speed for minimum drag) and the acceleration of gravity. In the normalized 
form, the only parameter required to describe the vehicle drag is the maximum lift/drag ratio, G. The control 
variables are the lift coefficient and the bank angle, which, for the classical aircraft, may be selected by elevator and 
aileron and, in the case of birds, by their ornithic flight control system 


For the case of a linear wind profile the trajectory for absolute minimum wind shear is an open loop, called an 
energy neutral circuit, which returns to original speed and height, but to an inertial position displaced crosswind and 
downwind. For this case the minimum normalized gradient, S, is given by 4.00/G, while for the step gradient the 
speed change, AW, is given by 4.75/G. 


By testing various control schedules, it is found that the exact optimal schedule for minimum wind shear, as 
developed by Sachs, need only be approximated. 


Other wind flows are studied, including that of a closed circuit where the vehicle returns to its original height and 
speed and inertial position. This requires a slightly higher gradient than the open loop. Other wind flow cases 
involve the energy neutral loop in a boundary layer-like profile typical of an oceanic flow and in one characteristic 
of the flow downstream of a ridge. In terms of the maximum wind required for an energy neutral cycle the lowest 
wind speed case is the step, followed by the ridge, followed by the oceanic and finally by the linear shear, which 
requires about four times more wind speed variation than the step case. 


The modest wind differentials, and relatively crude control schedules required for the cases of ridge-like flow and 
oceanic flow suggest that these loops and circuits are a fairly simple routine for birds to use in extracting energy 
from the wind, and that natural wind flows of sufficient speed differential are ubiquitous. 


Results from the above conclusions are summarized below: 


1. Fundamentals of wind energy extraction by flight vehicles are described 

2. The equations may be normalized in terms of only maximum L/D, defined as G, and cruise speed, V* 

3. Minimum wind variation for energy neutral cycles, a vertical step in speed, AW, is given by: AW = 

4.00/G 

4. Minimum linear shear gradient for energy neutral cycles, S, is given by: 
S = 4.75/G 

5. Solutions for natural wind profiles characteristic of oceanic and ridge flows are given 

6. Optimal control schedules have been developed, but performance is not sensitive to exact adherence to 
these schedules. 

7. Oceanic wind profiles require quite modest wind speeds for birds with performance characteristics of 
the albatross, and these winds are common over the oceans. 
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Aerodynamic performance of albatross-inspired wing shape for 
marine unmanned air vehicles 
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Abstract 


The wing shape of any flying object has an important role in its flight performance. When studying nature, 
it is found that albatrosses have the highest flight performance which allow these birds to fly thousands of 
miles, even though they are the largest migrating bird. To study the effects of the wing shape on the 
aerodynamic performance of fixed wing drones, at first the wing of an albatross is patterned and compared 
to other inspired wing shapes as well as standard geometrical wing shapes, such as rectangular, elliptical, 
and delta wing shapes. Considering similar wingspan, wing area, and aspect ratio, an aerodynamic analysis 
is performed in the same conditions (airfoil and flight speed). The 3D-Panel and Horseshoe Vortex Lattice 
Methods are applied for all the wings. The results show that for lower angles of attack, the albatross wing 
shape generated better flight performance (lift to drag ratio) than other geometric shapes. Also, in another 
study, an analysis 1s carried-out to compare the albatross wing shape with other migrating birds, such as the 
artic tern, the sooty shearwater, the golden eagle, the great white pelican, and the whooper swan. It 1s found 
that albatrosses that have the ability to fly these long distances through soaring flight because of their higher 
lift to drag ratio, when compared to other migrating birds. It should be noted that this study can provide the 
guidelines for the design of efficient marine drones. 











I. Introduction 


Nowadays, there is a growing need for fixed wing Unmanned Air Vehicles (UAVs) with different 
capabilities for both military and civilian applications'. There is also a significant interest in the 
development of new concepts for UAVs which can perform various missions autonomously in different 
environments and locations, such as marine and space explorations'®. In the past decade, the optimization 
and performance enhancement of these UAVs have received most attention which led to the invention of 
some new wing shapes’. The efficiency of a bio-inspired UAVs or micro unmanned aerial vehicles (uUAV) 
is largely based on the type of flying as well as the wing shape*. The wing shape greatly affects the 
aerodynamic performance. Shape of the wing or planform is the one of the main factor affecting the flight 
performance of the fixed wing drones’. There are various wing planforms that are derived from geometrical 
shapes, such as rectangular, delta, tapered, and elliptical wings. With the increasing number of applications 
of the drones, there is a demanding need of creating a higher performance using the fixed wing drone®. A 
great place to start at creating better performance fixed wing drones is by studying natural flyers’. Since 
birds have a better flight performance than other flying organisms, inspiring from them can be beneficial 
for the design of bio-inspired drones®”. 

One aspect of the birds’ features, is the wing shape. Generally, birds with different flight modes have 
various wing shapes’. The wing shapes of the birds are classified in six configurations: (1) tipped and roll- 
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back wing for hovering (hummingbird), (2) short, broad, and cupped wings that are used for quick takeoff 
and short range of flight (northern mockingbird), (3) short and wide wings with cracked primary feathers 
that are used for soaring (turkey vulture), (4) thin, flat, and triangular wings that are used for high speed 
and maneuverability (swallow), (5) large and arched wings for flapping flight (herons), and (6) tipped, flat, 
high thin wings for soaring long distances (albatross)’’. Albatross which are flying in the boundary layer 
of the oceans are able to fly thousands of miles by performing soaring flight'”'’. Dynamic soaring is 
generally a flying technique which is applied by these birds to gain energy by repeatedly crossing the 
boundary between air layers with significantly different velocities'’. To advantage of this dynamic soaring 
the design efficient marine fixed wing UAVs, the wing shapes of albatrosses are studied to understand the 
high performance of these soaring birds". 

In this study, the wing shape of an albatross is extracted, modeled, and the aerodynamic performance 
of the inspired wing is investigated. 3D-panel method is carried-out to determine the effects of the wing 
shape on the lift and drag forces for various angles of attack. To show the performance of the selected wing 
shape by nature, albatross wing shape is compared to various other shapes, such as rectangular, elliptical, 
delta, and two modified albatross wing shape with different offsets. Also, the wing of the albatross 1s 
compared to other birds’ wing shapes, such as Artic Tern, Great White Pelican, Golden Eagle, Sooty 
Shearwater, and Whooper Swan. 





II. Shape modeling of the albatross’s and geometrical wings 


Since individuals of different sexes and populations of albatrosses have different sizes, in this work, 
the birds with maximum possible wingspans are considered. The shape of Wandering albatross (Diomedea 
exulans) which is considered as one of the largest of the tubenose seabirds with wingspans reaching up to 
3.5m, is patterned'®. These birds have the ability to travel 10000km by taking advantage of dynamic soaring 
and having a long, thin, flat wing that is used for this flight mode’. The low-cost flight of this migrating 
bird has attracted the attention of drones’ designers in order to propose efficient and high performance 
marine drones. Due to the high flight performance of the albatross, the wing shape of these birds are first 
patterned. In Fig. 1, a view of albatross’s wing and the patterned schematic are shown. 





Figure 1. Views of (a) Wandering albatross, (b) albatross’s wing, and (c) patterned albatross’s wing. 


The inspired wing shape shown in Fig. 1 has a wingspan of 3.4m and aspect ratio of 11.5. In order to 
study the impacts of the wing shape of the albatross wing on its performance, five wing shapes including 
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rectangular, delta, elliptical, albatross wing with zero offset, and albatross with offset are defined. It should 
be noted that all the considered wings have similar wingspan, aspect ratio, and consequently same wing 
area. Since the objective of this study is the wing shape, similar conditions are considered for all the wings. 
In Fig. 2, schematic views of the defined wings are shown. 


(seometrical wing shapes Albatross inspired wing shapes 


Elliptical Albatross with offset 


Albatross without offset 


Kectangular 





Figure 2. Views of geometrical and albatross inspired wing shapes. 


HI. Aerodynamic modeling and convergence analysis 


In order to study the aerodynamic performance of the shown wings in Fig. 2, XFLR5 software is used 
to calculate the generated lift coefficient, drag coefficient, and lift to drag coefficient versus angle of attack 
for similar flight conditions. Aerodynamic analyses are carried out based on four thoeries, namely, Lift Line 
Theory (LLT), Horseshoe and Ring Vortex Lattice Methods (VLM), and 3D-Panel method. These methods 
are implemented and compared to each other. 

In the lift line theory, it is assumed that the relation between the lift coefficient and the angle of attack 
is linear and the viscous effects are not taken into account. In this theory, the lift of the wing is replaced by 
a lifting line where the incremental vortices shed along the span trail behind the wing are in straight lines 
in the direction of the freestream velocity. This theory assumes implicitly that all the surfaces lie essentially 
in the X-Y plane, sweep, and dihedral angles are not used in the calculation of the lift distribution. 
Horseshoe and Ring vortex lattice methods are applicable to any usual wing geometry, including those with 
sweep, low aspect ratio or high dihedral angles, including winglets. In the VLM method, the lift distribution 
and the induced angles can be calculated, and the induced drag is inviscid and linear. In other words, it 1s 
independent of the wing's speed and of the air's viscous characteristics. Generally, the principle of VLM 
method is to model the perturbation generated by the wing by a sum of vortices distributed over the wing's 
planform. This method computes the lift coefficient and the other values which may be calculated by 
integration of the surface forces, i.e. the moment coefficients and the center of pressure's position. For the 
VLM, the assumption of small angle of attack 1s considered, therefore the results should not be considered 
around the angle of attack values close to stall angles. In Horseshoe Vortex Lattice Method (VLM1) or 
classic one, a horseshoe vortex is positioned at the panel quarter chord and the non-penetration condition 1s 
set at the three-quarter chord point, but in Ring Vortex Lattice Method (VLM2) only the trailing vortices 
extend to infinity. 3D Panel method can be used to refine the LLT and VLM results by a more sophisticated 
full 3D method, taking the wings’ thickness into account, whereas the VLM only considers the mean camber 
line. This method provides insight on the center of pressure distributions over the top and bottom surfaces 
of a wing. The principle of a 3D panel method is to model the perturbation generated by the wing by a sum 
of doublets and sources distributed over the wing's top and bottom surfaces!®!°. 

Since XFLRS is used for aerodynamic analysis, first a convergence analysis is carried-out for the mesh 
of the inspired albatross wing shape. In this study, as an example to show the convergence analysis of the 
lift and drag as function of number of panels, 3D-Panel method is run for the albatross wing with zero and 
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five degrees angles of attack. The results show that the number of the optimum mesh which should be 
applied in this type of analysis. It should be mentioned that the wing 1s composed of strips in which each 
strip is divided into X and Y numbers of panels. To investigate the convergence of the aerodynamic results, 
with applying the maximum possible value for the X, different values for Y are considered. In Fig.3, an 
example of mesh is indicated. 

















= | | = 
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Figure 3. Views of mesh and the panels for albatross inspired wing. 
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In Figs. 4(a) and 4(b), the convergence analysis for lift and drag coefficients as a function of the number 
of panels are presented. As can be seen in Figs. 4 (a) and 4(b), the results for specified number of panels 
are starting to converge, which shows the minimum number of the required panels in the Y direction. 
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Figure 4. Convergence analysis for (a) lift and (b) drag coefficients versus number of panels for Y when 
considering zero and five degrees angle of attack. 
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IV. Comparison of wing shapes’ performance 


The aerodynamic analysis of the defined wings 1s carried-out through Horseshoe vortex lattice method 
in order to compare them together. The diagrams of coefficients of lift, drag, and lift to drag versus angle 
of attack are shown in Figs. 5(a), 5(b), and 5(c), respectively. As can be seen from Fig. 5(a), for all the 
considered wings, the elliptical wing provides higher values of lift coefficients, followed by rectangular, 
albatross, albatross with offset, albatross without offset, and delta wing, for angles of attack of zero. When 
the angle of attack is increasing, it can be seen that the lift coefficient of albatross wing 1s increasing more, 
and it generates higher values than rectangular. The variation of the drag coefficient versus angle of attack 
shows that for zero angle of attack the albatross wing shape and the albatross with offset are generating less 
drag coefficient followed by the albatross wing shape without offset, delta, rectangular, and elliptical wing 
shapes. As the angle of attack increases, the delta wing shape produces higher coefficients of drag, while 
the albatross with offset still creates the smallest coefficients of drag. For an angle of attack near zero 
degrees, the inspired albatross wing shape has higher performance (lift to drag ratio) than other wings, 
which can be seen in Fig. 5(c). The maximum lift to drag coefficient occurs at an angle of attack of four 
degrees for the standard geometrical shapes, while for the albatross inspired wing shapes maximum occurs 
at five degrees. Comparing all the wing shapes, the elliptical wing shape shows the highest value of lift to 
drag coefficients followed by the albatross, albatross with offset, rectangular, albatross without offset, and 
delta wing shapes. 
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Figure 5. Diagrams of coefficients of (a) lift, (b) drag, and (c) lift to drag versus angle of attack. 
5 


American Institute of Aeronautics and Astronautics 


Downloaded by UNIVERSITY OF GLASGOW on September 17, 2018 | http://arc.aiaa.org | DOI: 10.2514/6.2018-3899 


In Fig. 6, the pressure, lift, and both viscous and induced drag distribution for defined wings are shown 
for an angle of attack of one degree. As can be seen in Fig. 6, the lift and drag distributions depend on the 
wing shape. It should be mentioned that this aerodynamic analysis is performed using the Horseshoe VLM. 





Figure 6. Views of pressure, lift, and viscous and induced drag distribution for (a) delta, (b) rectangular, (c) 
elliptical, (d) albatross without offset, (e) albatross with offset, and (f) inspired albatross wings. 


V. Comparison of albatross wing shape with other migrating birds 


For the second part of the analysis, the aerodynamic performance of albatross wing shape and the wing 
shapes of other birds consisting of the artic tern, the sooty shearwater, the golden eagle, the great white 
pelican, and the whooper swan are compared. The analysis is performed using 3D panel method. AIl the 
birds’ extracted wing shapes are scaled to have the same wingspan, wing surface area, and the same aspect 
ratio. All wings are modeled using the GOE 174. The velocity of the analysis was kept at a constant rate of 
15m/s for all wing shapes. In Fig. 7, views of considered birds and their modeling are shown. 
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Figure 7. Views of considered birds and their patterned wings. 


Considering the shown birds and their patterned wings, an aerodynamic analysis is carried-out which 
results in the shown graphs in Figure 8. In Figs. 8(a), 8(b), and 8(c), the lift, drag, and lift to drag coefficients 
as function of the angle of attack are depicted, respectively. It follows from the plotted curves in Fig. 8(c) 
that albatross has the highest lift to drag coefficient of all the wing shapes that were extracted. The value of 
this peak is 26.8 at an angle of attack of 3.5 degrees. For angles of attack from -2 to 4.4 degrees the albatross 
has the highest lift to drag coefficient. This angle of attack is desired for a soaring bird unmanned aerial 
vehicle. A bird that performed similar to the albatross is the artic tern. The artic tern has the second largest 
lift to drag coefficient for the angle of attack range of -2 to 4.4 degrees, and after 4.4 to 8 degrees has the 
largest lift to drag coefficient. When comparing the lift coefficient to the angle of attack, the wing shapes 
of albatross and the artic tern have a higher value of the lift coefficient for the range of angle of attack. This 
is due to their large aspect ratio of the birds’ wing planform. The albatross has an aspect ratio of 11.5 while 
the artic tern has an aspect ratio of 11.8. The next closest aspect ratio is that of the great white pelican with 
a value of 7.8. The great white pelican has the third highest lift coefficient and the third smallest drag 
coefficient, which can be seen in Fig. 8(b). 
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Figure 8. Diagrams of coefficients of (a) lift, (b) drag, and (c) lift to drag versus angle of attack for studied 
migrating birds. 


VI. Conclusions 


In this study, the wing of an albatross as a migrating bird was patterned and compared to other 
geometrical shapes as well as other migrating birds’ wing shapes from aerodynamics points of view. Two 
different aerodynamic analyses were used using Horseshoe VLM and 3D-Panel methods. A convergence 
analysis was also carried-out to determine the optimum number of panels for the analysis. Considering 
similar wingspan, wing area, and aspect ratio, an aerodynamic analysis was performed in the same 
conditions (airfoil and flight speed). The results in both studies showed that for lower angles of attack the 
albatross wing shape generated better flight performance (lift to drag ratio) compared to geometric and 
migrating birds’ wing shapes. 
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Abstract 


Since the original 1963 Frisbee flying discs have evolved, becoming ubiquitous and far-ranging. Effects of drag, 
lift, pitching moment and spin on range are analysed. With lift, ranges significantly exceed the drag-free ballistic 
range. An upper limit wings level case, optimized for range, is compared with the realistic free case of a spinning, 
banking disc. For 30 m/s launch and 0.033 minimum drag coefficient the wings level range is 186 m, but the free 
range only 157 m. Tests report ranges between 77 - 170 m. The procedure is a powerful tool for designing superior 
discs. 


(©) 2010 Published by Elsevier Ltd. 


Keywords: Flying discs: Frisbee: flight dynamics 


1. Introduction 


From time immemorial men and apes have hurled things (feces, rocks, sticks and spears) at other animals that 
displeased them. As missiles, these illustrate the military principle of action at a distance. Simian feces-slinging is 
reportedly quite accurate, so apes, as well as people, must have noted that range depends on the launch speed and 
angle (V,@) as well as weight, size and shape. For a flattish projectile launched roughly horizontally, aerodynamic 
lift extends range significantly. The flying disc is a prolific and far-ranging example of a lifting projectile, with 
many variants. The original Frisbee, since 1963 a well-known commercial sport product, has evolved into the low 
drag, long-range professional, competition disc-golf version. The Aerobie flying disc, a ring-shaped advanced 
design, in 2003 achieved a range exceeding 400 m, the furthest of any hand thrown projectile. The present work 
gives a theoretical upper bound to range, assuming an optimized, controlled “wings level” trajectory and compares 
this with the predicted flight of a “free” spinning disc, banking in response to spin and pitching moment. 
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2. Fundamental ballistics 


2.1. Ballistic trajectories 
For an object with no external forces except gravity g the trajectory is called ballistic and range given by 


R = (V’/g) sin 2@. Range varies with V’, as expected, since it defines launch kinetic energy. Ballistic range is 
maximized for a 45° launch. The effect of launch angle on range was well known to artillerymen in the 15th century; 
Leonardo drew a cartoon c. 1505 illustrating this for mortar trajectories. 


2.2. Effect of drag 


For a projectile with only aerodynamic drag, the range is less than the ballistic value. The effect of drag is shown 
in Fig. 1, plotting trajectories for a number of non-lifting projectiles launched at 24 m/s and 45°. The cannon ball 
range is close to the normalized ballistic range of 6.25. It is little affected by drag; the ping-pong ball profoundly so. 
For a sport disc (Frisbee is a commercial trademark) at a minimum drag coefficient of 0.080 and zero lift the range 
is about 3.6, much less than the drag-free case. Disc range can increase significantly with lift. 





Range 


Fig. 1. Trajectories of common projectiles 


2.3. Effect of lift 


It was apparent to the original Olympic Greek athletes that the range of a classical symmetric discus can be 
increased by exploiting lift. Calculations for a standard discus with lift by Hubbard and Cheng [1] show a 10% 
increase over the ballistic range. But lift causes induced drag, so that the drag of a lifting system exceeds the 
minimum drag. Yet the lift, even with induced drag, produces longer range than the drag-free case. A near ballistic 
trajectory terminates almost at launch speed, so that little kinetic energy is lost. But with lift this kinetic energy can 
extend range, resulting in a low touch down speed, as described below 

A lifting vehicle has maximum glide range at the lift coefficient corresponding to maximum lift/drag ratio. The 
vehicle glides at constant flight speed and angle, extracting potential energy to offset drag losses. Glide should start 
from maximum height. To maximize height the vehicle should ascend at minimum drag (zero lift) so that most of 
the launch kinetic energy is converted into height (potential energy). This requires a roughly ballistic trajectory 
during ascent. The vehicle then glides at cruise speed until the ground is approached, when lift is increased to 
extend the range at the expense of forward speed. For aircraft landing this is called the flare-out. 
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Fig. 2 Trajectories of sport disc with drag and lift 


Some trajectories typical of a sport disc, described in 2.2, are shown above in Fig. 2. For a drag-free ballistic 
trajectory the normalized range is about 6.25; with drag alone it is only 3.6. Exploiting lift the range is about 7.2, 
exceeding the ideal drag-free case. The fine lines show the three asymptotic regimes, the low drag, almost parabolic 
ascent, the steady glide descent at optimum glide angle and the flare-out to horizontal as the ground is approached. 


3. Controlled wings-level trajectories 


3.1. Ideal dynamic flight equations 


An ideal case of maximum range is obtained assuming that the disc flies wings level (without banking) and that 
the lift coefficient 1s controlled to provide the optimal schedule.. The flight dynamics are described by the equations 
of motion for an unpowered vehicle by Lissaman [2], where the dynamic flight trajectory of the albatross, soaring, 
turning and diving in a planetary boundary layer wind without wing flapping, is modelled and optimized. The case 
here is simpler, with no bank (wings level) and no ambient wind. The non-dimensional equations contain only one 
control variable, the normalized lift coefficient, that can be optimally scheduled to maximize the range. 


3.2. Optimal trajectories 


A typical optimized trajectory, using 50 equal spaced lift-coefficient control inputs, illustrating the above, for a 
sport disc with a minimum drag coefficient of 0.080 launched at 25 m/s is shown in Fig. 3. Normalized plots of the 
control lift, the flight speed, and the trajectory are shown. 
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Fig. 3 Typical wings level sport disc trayectory with optimum lift 
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The lift starts at zero, increases smoothly to the cruise lift (normalized here as unity), holds constant for the 
glide and then rises to the maximum for flare out (the lift plateau represents the stall lift). During the glide portion 
the speed is constant (normalized as unity) at an approximately constant glide angle, as predicted. 

For a low drag competition disc (Cpo = 0.033) launched at 30 m/s the wings level model predicts a range of about 
186 m, implying an augmentation above the ballistic of about 101%. Pozzy (2000) presents data for 26 launches of 
different discs by experts at speeds between 29 and 31 m/s, reporting an average range of about 123 m with a 
maximum of about 170 m. 


3.3. Disc attitude 


The angle of attack (angle of the velocity with respect to the disc plane) for an optimal wings level trajectory is 
that required for optimal lift. The disc ascends in a roughly ballistic parabola, so that, for minimum drag, the angle is 
that for zero lift, approximately parallel to the flight path. The descent is at cruise conditions, requiring a constant 
attitude (disc angle to the horizontal) and lift, while the final stage calls for a high angle of attack to extend the flare- 
out. So the disc must start ascending at an attitude of about 45°, level off at the apex and pitch over to cruise attitude 
for the glide descent, with a final pitch up for the flare. A projectile with stabilizing fins, like a bomb, does not 
execute this trajectory, since the fins force a near zero angle of attack and lift throughout the flight. 

The free disc will not execute this schedule. Rather it pitches and banks according to the coupling between its 
aerodynamic pitching moment and the gyroscopic effects due to spin. This free trajectory is discussed next. 


4. Free trajectories 


4.1. Mechanics of free trajectory 


For the real case, the disc is released with an initial launch speed, spin rate and attitude. Discs are normally 
unstable due to displacement of the center of lift ahead of the center of mass (c.m.), and, to avoid tumbling, must be 
stabilized by spin. The pitching moment couples gyroscopically with the spin to induce a roll rate. This rate is right 
side upwards for a pitch up moment on a disc spinning clockwise viewed from above. Long range is achieved by 
exploiting the gyroscopic terms so that the disc acquires a roughly wings level state near the apex and thereafter 
glides at approximately constant attitude, giving the optimum lift/drag ratio. This requires that the disc be launched 
at the correct ascentangle, usually with a pronounced bank, calling for skill by the thrower. 

It 1s not possible to design the pitching moment curve so that the disc actually follows the ideal orientation 
calculated in the optimal case. In fact, the variation of the pitching moment at disc angles of attack between -5° and 
10° has a very pronounced effect on the free trajectory. To calculate the trajectory, a Matlab code following Hubbard 
and Hummel [3] has been written and is exercised for different launch conditions and pitching moment 
characteristics. The former are controlled by the disc thrower, the latter by the disc designer. The code requires 
input of the disc physical properties and launch conditions as well as fundamental aerodynamic force and moment 
derivatives. The latter are not well known since the disc experiences separated flow over the top, and cavity flow on 
the lower surface; a complicated flow state, beyond the capability of prediction by computational methods. Wind 
tunnel tests on rotating discs by Potts reported by Hubbard and Hummel [3] indicate that the aerodynamic center of 
the disc changes significantly with angle of attack for attack angles below 10°. This is expected from the complex 
upper and lower surface separated flow for disc-like shapes. Computational fluid dynamics methods cannot 
correctly predict this flow, so that testing is required for accurate drag and pitching characteristics. 


4.2. Maximum range cases 


The disc is launched at a speed and spin of 30 m/s and 50 rad/s. Constants are 0.175 kg mass; 0.27 m diameter; 
spin axis moment of inertia, I = 0.0025 kg m’: lift at zero angle of attack, Cy9 = 0.15; lift curve slope, Cla = 2.91; 
minimum drag, Cpo = 0.033; span effectiveness, e = 0.85; pitch damping, Cy, = - 0.10; roll damping, Cy, = - 0.15. 

Bank angles and trajectories for best range for a typical high performance disc for wings level and for free flight 
are shown in Figs. 4 and 5. The wings level and free trajectories are similar except for the flare-out, even though the 
free case is far from wings level, involving large, near vertical, bank angles during the initial ascent. 
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The range increase over ballistic compared with the sport disc is apparent. The free case starts with a large — 70° 
bank (right side up) levels off before mid-flight and does not flare. The pitching moment characteristics for the 
longest range are those of Case 1, close to those measured on a sport disc. 





Range, m 





Range, m 


Fig. 5 Trajectories for ballistic, tree and wings level cases, competition disc 


The pitching moment characteristics have a dominant effect on the flight. Fig. 6 shows the pitching moment 
coefficients used in the cases analyzed in the flight dynamics model. 
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Fig. 6 Representative pitching moment characteristics 
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Four cases have been analysed, each having representative pitching moments. They are: Case 1: from 
measurements on an actual disc, with a slightly unstable moment, as given by Potts wind tunnel tests of a sport disc, 
described by Hubbard and Hummel [3], Case 2: a neutral case, where the moment gradient between 0° and 10° is 
zero, Case 3: where the moment has a slight stable gradient in this range and Case 4, where the moment is linear and 
unstable, corresponding to a fixed aerodynamic center at 6.3% of the diameter ahead of the c.m. The four cases 
were analyzed, optimum initial conditions for each determined, and maximum range calculated, as tabulated in 
Table I. Case 4 gave a very low range and results have not been listed. It is of interest that the Case 4 “normal”, 
unstable linear pitching moment, associated with a shape with a fixed aerodynamic center, does not provide suitable 
moment characteristics for good performance from a free, spinning disc. 

Sensitivity is given by calculating the range reduction due to a 5° change in both launch bank and angle. The 
stable case, 3, is very sensitive to small errors in launch, and would require much skill to throw consistently. 

Five cases for the competition disc are summarized in Table I. Spin is clockwise viewed from above and positive 
bank angle right wing down. Case 3 has a slightly stable pitching moment and initial bank is in the opposite 
direction to that of Cases | and 2. In Case 3 there is a large change associated with a small reduction in initial 
optimum bank angle. For launch at 36° ascent angle and 52° bank, Case 3 range drops by more than 66 m from the 
optimum of 135 m listed in the table. This needs interpretation. 


Table I. Range for varying cases 





Case Stability Ascent Angle, ° Bank, ° Range, m Sensitivity, m 
Free, Case | Unstable 41 -75 157 2.0 
Free, Case 2 Neutral 8 -5 144 1.3 
Free, Case 3 Stable 31 57 135 66.7 
Pozzy NA NA NA 184 NA 
Wings level NA AA 0 186 NA 


5. Field tests 


On Dec. 20, 2009 field tests were conducted using competition disc-golf discs thrown by professional disc 
golfers (G. Lissaman and J. Lissaman) and by amateurs (D. Peterson and R. McMurray) Range and launch speeds 
were measured and trajectories recorded with 30 Hz video for 100 flights. Observed ranges compare well with those 
of Pozzy [4]. Although the professional throwers had mean release velocities slightly, but not significantly, less than 
those of the amateurs (V,=22.9, V,=23.3m/s), their velocities were more repeatable (0,,=0.7, O,,=1.2 m/s) and they 
achieved larger (and again more repeatable) mean ranges (R,=81.8, R,=69.9 m; Op,=10.8, Or,=16.6 m). Detailed 
records of trajectories and velocity will permit estimation of disc aerodynamic properties in the future. 


6. Conclusions 


1. Methods of calculating the range of a disc for the ideal wings level case and for the spinning and banking free 
case have been developed and exercised. 

2. Range depends strongly on the minimum drag coefficient. For competition discs with minimum drag coefficient 
of 0.033 launched at 30 m/s the ideal wings level case range is about 186 m. Free disc range is 157 m. 

3. The free range depends significantly on the pitching moments in the angle of attack range 0° < a < 10°. 

4. The slightly unstable pitching moment characteristics obtaining in Free Case | give the best range for the free 
cases considered. 

5. A field test procedure has been developed and exercised. Test results support the free disc analysis, but the 
theory cannot be exactly correlated with the experiment without accurate values of aerodynamic characteristics . 
The data will be reduced and these characteristics will be reported later. 
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Foreword 


The job of the aeronautical engineer has changed dramatically in recent years and will continue to change. Ad- 
vanced computational tools have revolutionized design processes for all types of flight vehicles and have made it 
possible to achieve levels of design technology previously unheard of. And as performance targets have become 
more demanding, the individual engineer's role in the design process has become increasingly specialized. 

In this new environment, design work depends heavily on voluminous numerical computations. The computer 
handles much of the drudgery, but it can't do the thinking. It is now more important than ever for a practicing 
engineer to bring to the task a strong physical intuition, solidly based in the physics. In this book, Doug McLean 
provides a valuable supplement to the many existing books on aerodynamic theory, patiently exploring what it all 
means from a physical point of view. Students and experienced engineers alike will surely profit from following 
the thought-provoking arguments and discussions presented here. 


John J. Tracy 

Chief Technology Officer 
The Boeing Company 
September 2012 


Series Preface 


The field of aerospace is wide ranging and multi-disciplinary, covering a large variety of products, disciplines 
and domains, not merely in engineering but in many related supporting activities. These combine to enable the 
aerospace industry to produce exciting and technologically advanced vehicles. The wealth of knowledge and ex- 
perience that has been gained by expert practitioners in the various aerospace fields needs to be passed onto oth- 
ers working in the industry, including those just entering from University. 

The Aerospace Series aims to be a practical and topical series of books aimed at engineering professionals, 
operators, users and allied professions such as commercial and legal executives in the aerospace industry, and 
also engineers in academia. The range of topics 1s intended to be wide ranging, covering design and development, 
manufacture, operation and support of aircraft as well as topics such as infrastructure operations and develop- 
ments in research and technology. The intention is to provide a source of relevant information that will be of 
interest and benefit to all those people working in aerospace. 

Aerodynamics is the fundamental enabling science that underpins the world-wide aerospace in- 
dustry—without the ability to generate lift from airflow passing over wings, helicopter rotors and other lifting 
surfaces, it would not be possible to fly heavier-than-air vehicles as efficiently as is taken for granted nowadays. 
Much of the development of today's highly efficient aircraft is due to the ability to accurately model aerodynamic 
flows using sophisticated computational codes and thus design high-performance wings; however, a thorough 
understanding and insight of the aerodynamic flows is vital for engineers to comprehend these designs. 

This book, Understanding Aerodynamics, has the objective of providing a physical understanding of aerody- 
namics, with an emphasis on how and why particular flow patterns around bodies occur, and what relation these 
flows have to the underlying physical laws. It is a welcome addition to the Wiley Aerospace Series. Unlike most 
aerodynamics textbooks, there is a refreshing lack of detailed mathematical analysis, and the reader is encouraged 
instead to consider the overall picture. As well as consideration of classical topics—continuum fluid mechanics, 
boundary layers, lift, drag and the flow around wings, etc.—there is also a very useful coverage of modelling 
aerodynamic flows using Computational Fluid Dynamics (CFD). 


Peter Belobaba, Jonathan Cooper, Roy Langton and Allan Seabridge 


Preface 


This book is intended to help students and practicing engineers to gain a greater physical understanding of aero- 
dynamics. It is not a handbook on how to do aerodynamics, but is motivated instead by the assumption that en- 
gineering practice 1s enhanced in the long run by a robust understanding of the basics. 

A real understanding of aerodynamics must go beyond mastering the mathematical formalism of the theories 
and come to grips with the physical cause-and-effect relationships that the theories represent. In addition to the 
math, which applies most directly at the local level, intuitive physical interpretations and explanations are re- 
quired if we are to understand what happens at the flowfield level. Developing this physical side of our under- 
standing is surprisingly difficult, however. It requires navigating a conceptual landscape littered with potential 
pitfalls, and an acceptable path is to be found only through recognition and rejection of multiple faulty paths. It 
is really a process of argumentation, thus the “arguing” in the title. This kind of argumentation is underemphas- 
ized in other books, in which the path is often made to appear straighter and simpler than it really is. This book 
explores a broader swath of the conceptual landscape, including some of the false paths that have led to errors in 
the past, with the hope that it will leave the reader less likely to fall victim to misconceptions. 

We'll encounter several instances of serious misinterpretations of mathematical theory that are in wide circu- 
lation and of erroneous physical explanations that have found their way into our folklore. In any case where a 
misconception has been widely enough propagated, the “right” explanation would not be complete without the 
debunking of the “wrong” one. I have tried to do this kind of debunking wherever it seemed appropriate and have 
not hesitated to say so when I think something is wrong. This is part of what makes aerodynamics so much fun. 
It's one of those little perversities of human nature that coming up with a good explanation is much more satisfy- 
ing when you know there are people out there who have got it wrong. But debunking bad explanations serves a 
pedagogical purpose as well, because the contrast provided by the wrong explanation can strengthen understand- 
ing of the right one. 

This effort devoted to basic physical rigor and avoiding errors comes at a cost. We'll spend more time on some 
topics than some will likely think necessary. I realize some parts of the discussion are long and are not easy, but I 
hope most readers will find it worth the effort. 

We are now well into what I would call the computational era in aerodynamics, made possible by the ever- 
advancing capabilities of computers. In the 1960s, we began to calculate practical numerical solutions to linear 
equations for inviscid flows in 3D. In the 1970s, 1t became economical to compute solutions to nonlinear equa- 
tions for inviscid transonic flows in 3D and to include viscous effects through boundary-layer theory and viscous/ 
inviscid coupling. By the 1990s, we were routinely calculating solutions to the Reynolds-averaged Navier-Stokes 
(RANS) equations for full airplane configurations. These computational fluid dynamics (CFD) capabilities have 
revolutionized aerodynamics analysis and design and have made possible dramatic improvements in design tech- 
nology. CFD is now such a vital part of our discipline that this book would not be complete if it did not address it 
in some way. While this is not a book about CFD methods or about how to use CFD, there are conceptual aspects 
of CFD that are relevant to our focus, and these are considered in chapter 10. 

I believe that although we now rely on CFD for much of our quantitative work, it is vitally important for a 
practicing engineer to have a sound understanding of the underlying physics and to be familiar with the old sim- 
plified theories that our predecessors so ingeniously developed. These things not only provide us with valuable 
ways of thinking about our problems, they also can help us to be more effective users of CFD. 

The unusual scope of the book is deliberate. The book is not intended to be a handbook. Nor is it intended as 
a substitute for the standard textbooks and other sources on aerodynamic theory, as I have omitted the mathem- 


atical details whenever the physical understanding I seek to promote can be conveyed without them. This ap- 
plies especially to the discussion of the basic physics in the early chapters. Those looking for rigorous deriva- 
tions of the mathematical details will have to look elsewhere. Also, exhaustive scope is not a practical goal. So, 
for the details on many of the topics treated here, and for any treatment at all of the many topics neglected here, 
the reader will have to consult other sources. This book is also not intended as an introduction to the subject. 
Though it would not be impossible for someone with no prior exposure to follow the development given here, 
some experience with the subject will make it much easier. And while I assume no prior knowledge of the sub- 
ject, I do assume a higher level of technical sophistication than is often assumed in undergraduate-level texts. 

An understanding of the physical basics is more secure if it includes an appreciation of the “big picture,” the 
logical structure of the body of knowledge and the collection of concepts we call aerodynamics. I have tried to 
at least touch on all of the topics that are so basic that the overall framework could not stand without them. I 
also devote more attention than most aerodynamics textbooks to the relationships between the parts, to how it 
all “fits together.” Beyond that, several considerations have guided my choice of topics and the kinds of treat- 
ment I've given them. One is my own familiarity and experience. Another is my observation of some common 
knowledge gaps, things that don't seem to be covered well in the usual aero engineering education. But we'll 
also spend a good part of our time on some of the very familiar things that we tend to take for granted. Our 
understanding of these things is never so good that it can't benefit from taking a fresh look. We'll put a different 
spin on some familiar topics, for example, what the Biot-Savart law really means and why it causes so much 
confusion, what “Reynolds number” and “incompressible flow” really mean, and a real physical explanation for 
how an airfoil produces lift. 

As we'll see in chapter 1, the subject matter of aerodynamics consists of physical principles, conceptual mod- 
els, mathematical theories, and descriptions and physical explanations of flow phenomena. Some of this subject 
matter has direct practical applications, and some doesn't. We'll spend considerable time on some topics that 
have no apparent practical import, for example, physical explanations of things for which we have perfectly 
good quantitative theories and esoterica such as how lift is felt in the atmosphere at large. We'll do these things 
because they provide general fluid-mechanics insight and because they serve to expand our appreciation of the 
cognitive dimension of the subject, the processes by which we think about aerodynamic phenomena and the 
practical problems that arise from them. They also help us to see how mistaken thinking can arise and how to 
avoid it. The medical profession in recent years has begun to pay more attention to the cognitive dimension of 
their discipline, studying how doctors think, in an effort to improve the accuracy of their diagnoses and to avoid 
mistakes. Doing some of the same would be good for us as well. 

Aerodynamics as a subject encompasses a wide variety of flow situations that in turn involve a multitude 
of detailed flow phenomena. The subject is correspondingly multifaceted, with a rich web of interconnections 
among the phenomena themselves and the conceptual models that have been developed to represent them. Such 
a subject has a logical structure of course, but it is not well suited to exposition in a single linear narrative, and 
there is therefore no ideal solution to the problem of organizing it so that it flows completely naturally as a 
single string of words. The organization I have chosen is based not on the historical development or on a pro- 
gression from “easy” concepts to “advanced,” but on a general conceptual progression, from the basic physics, 
to the flow phenomena, and finally to the conceptual models. I have tried to organize the material so that it can 
be read straight through and understood without the need to skip forward. I have also tried to provide direct 
references whenever I think referring back to previous chapters would be helpful and to alert the reader when 
further discussion of a topic is being deferred until later. 

The general flow of the book is as follows. First, we take an overview of the conceptual landscape in chapter 
1. Then we consider the basic physics as embodied in the NS equations in chapters 2 and 3. We turn to the 
phenomenological aspects of general flows in boundary layers and around bodies in chapters 10 and 5. We then 
enter the more specific realm of aerodynamic forces and their manifestations in flowfields to deal with drag in 


chapter 6 and lift generation, airfoils, and wings in chapters 7 and 8. All of this sets the stage for a bit of a re- 
gression into theory, with discussions of theoretical approximations and CFD in chapters 9 and 10. 

When I started writing I had something less ambitious in mind, something more on the scale of a booklet 
with a collection of helpful ways of looking at aerodynamic phenomena and a catalog of common misconcep- 
tions and how to avoid them. As the project progressed, it became clear that effective explanations required 
more background than I had anticipated, and the book gradually grew more comprehensive. The first draft in 
something close to the final form was completed in late 2008 and was reviewed by several Boeing colleagues 
(acknowledged below). Their feedback was incorporated into a second draft that was used in a 20-week after- 
hours class for Boeing engineers in 2009. Feedback from class participants and others led to significant revi- 
sions for the final draft. As it turned out, the general argumentative approach I've taken to the subject extended 
to the writing process itself. Many sections saw multiple and substantial rewrites as my thinking evolved. 

I gratefully acknowledge the help of many people in getting me through this long process. First, my wife, 
Theresa, who put up with the many, many weekends that I spent in front of our home computer. Then The Boe- 
ing Company, which allowed me to spend considerable company time on the project, Boeing editors, Andrea 
Jarvela, Lisa Fusch Krause, and Charlene Scammon, who turned my raw Word files and graphics into a present- 
able draft and helped me take that draft through several revisions, and Boeing graphics artist John Jolley, 
who redrew nearly half the graphics. Finally, the friends and colleagues without whose help the book would 
have been much poorer. Mark Drela (MIT), Lian Ng, Ben Rider, Philippe Spalart, and Venkat Venkatakrish- 
nan provided very detailed feedback and suggestions for improvement. Steve Allmaras and Mitch Murray 
made special CFD calculations just for the book. My former Boeing colleague Guenter Brune wrote the ex- 
cellent 1983 Boeing report on flow topology that introduced me to the topic and served as the basis of much 
of Section 5.2.3. Another former Boeing colleague, Pete Sullivan, did the CFD calculations plotted in Sec- 
tion 6.1.5. And many others contributed feedback on various drafts of the manuscript: Anders Andersson, John 
D. Anderson (University of Maryland), Byram Bays-Muchmore, Bob Breidenthal (University of Washington), 
Julie Brightwell, Tad Calkins, Dave Caughey (Cornell University), Tony Craig, Jeffrey Crouch, Peg Curtin, 
Bruce Detert, Scott Eberhardt, Winfried Feifel, David Fritz, Arvel Gentry, Mark Goldhammer, Elisabeth Gren, 
Rob Hoffenberg, Paul Johnson, Wen-Huei Jou, T. J. Kao, Edward Kim, Alex Krynytsky, Brenda Kulfan, Louie 
LeGrand, Adam Malachowski, Adam Malone, Tom Matoi, Mark Maughmer (Penn State University), the late 
John McMasters, Kevin Mejia, Robin Melvin, Greg Miller, Deepak Om, Ben Paul, Tim Purcell, Steve Ray, 
Matt Smith, John Sullivan (Purdue University), Mary Sutanto, Ed Tinoco, David Van Cleve, Paul Vijgen, Dave 
Witkowski, Conrad Youngren (New York Maritime College), and Jong Yu. 

Thanks also to the copyright owners who kindly gave permission to use the many graphics I borrowed from 
elsewhere. They are acknowledged individually in the figure titles. 


Doug McLean, 
April 2012. 


List of Symbols 


Many of the symbols listed below have different meanings in different contexts, as indicated when multiple defin- 
itions are given. When an example of usage (a figure or equation) is listed, 1t is not necessarily the only example. 


English Symbols 


a Acceleration 
Speed of sound 
A Streamtube area 


Amplitude of a disturbance in laminar-flow stability theory 
A1,A2 Coefficients of induced-drag polar (equation 8.3.17) 
Ai Coefficients of sin-series spanloads (equation 8.3.20) 
Ag Wake mone area (equation 6.1.6) 
AR _ Aspect ratio = b 2/5 
b Wingspan 
bo Span between the centers of trailing vortex cores (figure 8.3.3) 
B Constant in law of wall (equation 4.4.10) 
C Airfoil chord 
Cavg Wing average chord 
c Wing average chord (figure 8.3.7) 
Cp Specific heat at constant pressure 
Cy Specific heat at constant volume 
C Cylindrical part of the outer boundary of a control volume (figure 6.1.1) 
Ca Drag coefficient (2D or per unit span) 
Cp Drag coefficient (3D) 
Cpi Induced-drag coefficient (3D) 
CpiminMinimum induced drag coefficient on induced-drag polar (equation 8.3.19) 
Cpio Induced drag coefficient at zero lift (equation 8.3.17) 
Cpo Drag coefficient at zero lift (equation 8.3.15) 
Cr Skin-friction coefficient 
C) Lift coefficient (2D or per unit span) 
CL Lift coefficient (3D) 
CLmax Maximum lift coefficient of a 3D wing 
Cm ‘Pitching moment coefficient (2D or per unit span) 
Cm Pitching moment coefficient (3D) 
Cn Sectional normal-force coefficient (figure 8.3.7) 
CN Total normal-force coefficient (figure 8.3.7) 
Cp _ Pressure coefficient 





Smith's canonical pressure coefficient (equation 7.4.1) 
C* — Attachment-line Reynolds number (equation 8.6.15) 
d Viscous drag vector of a propeller blade section (figure 6.1.17) 
db Diameter of fuselage (equation 8.3.29) 
D Drag 


‘cai <0 


YoZWs 


Induced drag (equation 8.3.5) 
Thermodynamic internal energy 


Base of natural logs 

Induced-drag efficiency factor (equation 8.3.14) 

Induced-drag efficiency factor of untwisted version of a wing (equation 8.3.19) 
Oswald efficiency factor (equation 8.3.15) 


Force 

Function 

Genus of a region of a surface (equation 5.2.2) 
Enthalpy 


Height of a vortex generator 

Height of an excrescence (figure 6.2.1) 

Height above the ground (equation 7.4.2) 

Riblet protrusion height (figure 6.3.6) 

Total enthalpy 

Boundary-layer shape factor 

Unit vector in x direction (equation 6.1.1) 

Square root of minus one 

Index of a region of a surface (equation 5.2.3) 
Cumulative skin-friction integral (equation 6.1.13) 
Enstrophy integral (equation 6.1.15) 

Propeller advance ratio = V/ndp 

Thermal conductivity 

Roughness height 

Unit curvature vector (figure 4.2.4) 

Unit vector in z direction (equation 5.4.1) 
Equivalent sand-grain height 

Length 

Lift per unit span of a wing or propeller blade 
Length 

Lift in 3D (equation 5.4.1) 

Carry-through lift on fuselage (figure 8.3.15) 

Lift on tail or canard (figure 8.3.17) 

Lift on exposed wing (figure 8.3.15) 

Mach number 

Propeller shaft torque (equation 6.1.19) 

Wing-root bending moment (equation 8.3.28) 
Mass 

Exponent in power-law velocity distributions for laminar boundary layers (section 4.3.2) 
Excrescence-drag magnification factor (figure 6.1.13) 
Exponent in Smith's power-law airfoil velocity distributions (figure 7.4.12) 


Mass flux of a source or sink (equation 8.3.3) 
Normal direction 

Propeller revolutions per unit time 

Unit normal vector (figure 3.3.6) 
Nodal-point singularity (figure 5.2.6) 
Pressure 

Propeller shaft power (equation 6.1.20) 
Prandtl number 
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Ideal gas constant 
Reynolds number (any subscript indicates reference length used) 
Radius from airplane to a point on the ground (figure 8.5.3) 


Attachment-line Reynolds number parameter (equation 8.6.16) 

Average of absolute value of roughness height 

Critical Reynolds number, at onset of instability (figure 4.4.3) 

Radius 

Recovery factor (equation 4.6.4) 

Radius to the maximum-velocity point in a trailing vortex core (figure 8.1.8) 

Radius to the point of effectively zero vorticity in a vortex core (figure 8.1.8) 

Radius of fuselage (figure 8.3.14) 

Radius of vortex core (equation 8.3.6) 

Arc length 

Riblet spacing 

Wing area 

Propeller disc area 

Entropy 

Saddle-point singularity (figure 5.2.6) 

Denotes integration over a surface (equation 6.1.1) 

Temperature 

The Trefftz plane (figure 6.1.1) 

Thrust of a propeller (equation 6.1.18) 

Time 

Unit tangent vector 

Cartesian x-velocity component 

Perturbation velocity in the x direction (figure 7.4.1 (a)) 
pment 


ee a a / 
Friction velocity Ur = Viw/!P 
Undisturbed freestream velocity in the x direction 
Cartesian y-velocity component 
Perturbation velocity in the y direction (figure 7.4.1 (a)) 
Velocity vector 
Velocity magnitude 
Volume 
Cartesian z-velocity component 
Perturbation velocity in the z direction (equation 8.3.5) 
Outer function in the law of the wake (equation 4.4.13) 
Cartesian space coordinate 
Cartesian space coordinate 
Cartesian space coordinaten 


Greek Symbols 


Wavenumber in x direction (equation 4.4.1) 

Angle of attack 

Angle of attack at zero total lift (equation 8.3.16) 

Flow direction angle 

Laminar boundary-layer similarity parameter (figures 4.3.5 and 4.3.6) 
Turbulent boundary-layer similarity parameter (equation 4.3.8) 


Wavenumber in z direction (equation 4.4.1) 
0) Boundary-layer thickness 
0) Boundary-layer thickness 
0) Boundary-layer displacement thickness (equation 4.2.13) 


Loc Local 8* integral (equation 4.2.14) 
d Velocity potential (equation 3.10.1) 
y Ratio of specific heats, cp/cy 
I ~— Circulation 
Io Circulation of vortex core (equation 8.3.6) 
n Dimensionless spanwise coordinate on a wing = 2y/b 
n Propeller efficiency = thrust work out/shaft work in (equation 6.1.21) 
ni‘ Propeller induced efficiency 
K Streamline curvature (figure 4.2.4) 
Von Karman constant (equation 4.4.10) 
A Wing sweep (figure 8.6.1) 
rX Mixing length 
u Coefficient of shear viscosity 
Propeller torque coefficient (equation 6.1.19) 


Melt Effective viscosity, sum of viscous and turbulent (equation 6.1.15) 
Vv Kinematic viscosity, u/p 
nm pi=3.14159... 
II Constant in the law of the wake (equation 4.4.13) 
vy) Boundary-layer momentum thickness (equation 4.2.11) 
Angular coordinate around circular cylinder (figure 5.1.3) 
Flow angles entering and leaving a cascade (figure 7.4.23 (b)) 


Dihedral angle (equation 8.3.11) 


p Density 
O Propeller power loading (equation 6.1.20) 
40 Shear stress 


Propeller thrust loading (equation 6.1.18) 
? Vorticity magnitude 
Frequency (equation 4.4.1) 
/ = Vorticity vector 
yw _ Transformed spanwise coordinate (equation 8.3.22) 


Subscripts 


1 In the boundary-layer x direction (equation 4.2.15) 
Denotes conditions upstream of a cascade (figure 7.4.23 (b)) 
Denotes conditions upstream of the shock on a transonic airfoil (figure 7.4.31) 
2 Denotes conditions downstream of a cascade (figure 7.4.23 (b)) 
Denotes conditions downstream of the shock on a transonic airfoil (figure 7.4.31) 
In the boundary-layer z direction (equation 4.2.15) 
Of the boundary-layer coordinate system (figure 4.2.2) 
c Pertaining to a vortex core (figure 8.3.3) 
Cross-flow component (figure 4.1.7) 
Airfoil chord 
ch Chordwise (figures 4.3.11 and 4.3.12) 


Oo WwW 


cut Pertaining to a cut through a wake (figure 6.1.3) 
d___ Based on diameter 
Drag per unit span 


D_ Drag 

e Atthe edge of the boundary layer 
f Friction 

i Incompressible 


Induced, as applied to a propeller 
j Pertaining to a blowing jet (equation 4.5.1) 
K Kinematic (equation 4.6.8) 
/ Lift per unit span 
L_ Based on length L 
Lift 
localDenoting the effective local freestream condition for an excrescence 
m Pitching moment per unit span 
n Connectivity of a2D domain (equation 5.2.5) 
Direction normal to wake cut (equation 8.3.10) 
o Denotes conditions at the start of an airfoil pressure recovery (equation 7.4.1) 
At constant pressure 
Propeller 
ref Reference 
rms Root-mean-square 
s Relative to streamwise direction at boundary-layer edge (figure 4.1.8) 
sep Denotes conditions at separation (discussion of figure 7.4.13) 
sp Spanwise (figures 4.3.11 and 4.3.12) 
sw From steamwise at boundary-layer edge to the wall (figure 4.1.8) 


t Turbulent (equation 3.7.5) 
Total (or stagnation) (equations 3.8.3—6) 


T Thermal (equation 4.6.2) 
v_ Atconstant volume 

w At the wall 

x Based on x 

CO 


At infinite distance, far field 

At infinite height from ground (figure 8.3.12) 

| Perpendicular 

perp Perpendicular to constant-percent-chord lines (figure 8.6.11) 
|| Parallel 


Greek Subscripts 


o*Based on displacement thickness 
i Momentum, as in Cy, (equation 4.5.1) 
§ Based on momentum thickness 
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t Friction velocity, in 


Superscripts 


' Fluctuating part 
Independent variable in integration 
Denotes a half singularity at a boundary of a surface (equation 5.2.5) 
—Time average 
Averaged along the length 
A Nondimensional 
* Denotes conditions at Mach | (equations 3.11.4 and 3.11.5) 
Displacement thickness, when used with 6 
+ Turbulent-boundary-layer wall variables (equations 4.4.3 and 4.4.4) 


= Tensor (equation 5.4.1) 


Acronyms and Abbreviations 


1D One dimensional 
2D Two dimensional 
3D Three dimensional 
BC Boundary condition 


BLC Boundary-layer control 

CFD Computational fluid dynamics 

CF Cross-flow 

CPU Central processing unit 

DES Detached-eddy simulation 

DNS Direct numerical simulation 

ESDU _ Engineering Sciences Data Unit 

GGNS __ General geometry Navier-Stokes 

HLFC — Hybrid laminar flow control 

LES Large-eddy simulation 

LFC Laminar flow control 

LTA Lighter than air 

NACA National Advisory Committee for Aeronautics 
NASA National Aeronautics and Space Administration 
NLF Natural laminar flow 

NS Navier-Stokes 

ODE Ordinary differential equation 

ONERA Office National d'Etude et Recherches Aérospatiales 
PC Personal computer 

PDE Partial differential equation 

RABL _ Reynolds-averaged boundary-layer equations 
RANS _ Reynolds-averaged Navier-Stokes equations 
SA Spalart-Allmaras 

SBVG _— Sub-boundary-layer vortex generator 

SST Shear Stress Transport 

TKE Turbulence kinetic energy 

TS Tollmien-Schlichting 

URANS. Unsteady Reynolds-averaged Navier-Stokes 
VG Vortex generator 


WINGOP Wing Optimization 


Chapter | 


Introduction to the Conceptual Landscape 


The objective of this book is to promote a solid physical understanding of aerodynamics. In general, any under- 
standing of physical phenomena requires conceptual models: 


It seems that the human mind has first to construct forms independently before we can find them in things. 
Kepler's marvelous achievement is a particularly fine example of the truth that knowledge cannot spring 
from experience alone but only from the comparison of the inventions of the intellect with observed fact. 


—Albert Einstein on Kepler's discovery that planetary orbits are ellipses 


Einstein wasn't an aerodynamicist, but the above quote applies as well to our field as to his. To understand the 
physical world in the modern scientific sense, or to make the kinds of quantitative calculations needed in engin- 
eering practice, requires conceptual models. Even the most comprehensive set of observations or experimental 
data is largely useless without a conceptual framework to hang it on. 

In fluid mechanics and aerodynamics, I see the conceptual framework as consisting of four major components: 

1. Basic physical conservation laws expressed as equations and an understanding of the cause-and-effect 
relationships those laws represent, 


2. Phenomenological knowledge of flow patterns that occur in various situations, 


3. Theoretical models based on simplifying the basic equations and/or assuming an idealized model for the 
structure of the flowfield, consistent with the phenomenology of particular flows, and 


4. Qualitative physical explanations of flow phenomena that ideally are consistent with the basic physics 
and make the physical cause-and-effect relationships clear at the flowfield level. 


By way of introduction, let's take a brief look at what these components encompass, the kinds of difficulties they 
entail, and how they relate to each other. 

The fundamental physical conservation laws relevant to aerodynamic flows can be expressed in a variety of 
ways, but are most often applied in the form of partial-differential equations that must be satisfied everywhere 
in the flowfield and that represent the local physics very accurately. By solving these basic equations, we can in 
principle predict any flow of interest, though in practice we must always accept some compromise in the physical 
accuracy of predictions for reasons we'll come to understand in Chapter 3. 

The equations themselves define local physical balances that the flow must obey, but they don't predict what 
will happen in an overall flowfield unless we solve them, either by brute force numerically or by introducing 
simplified models. There is a wide gulf in complexity between the relatively simple physical balances that the 
equations represent and the richness of the phenomena that typically show up in actual flows. The raw physical 
laws thus provide no direct predictions and little insight into actual flowfields. Solutions to the equations provide 
predictions, but they are not always easy to obtain, and they are limited in the insight they can provide as well. 
Even the most accurate solution, while it can tell us what happens in a flow, usually provides us with little under- 
standing as to how it happens or why. 


Phenomenological knowledge of what happens in various flow situations is a necessary ingredient if we are 
to go beyond the limited understanding available from the raw physical laws and from solutions to the equa- 
tions. Here I am referring not just to descriptions of flowfields, but to the recognition of common flow pat- 
terns and the physical processes they represent. The phenomenological component of our conceptual framework 
provides essential ingredients to our simplified theoretical models (component 3) and our qualitative physical 
explanations (component 4). 

Simplified theoretical models appeared early in the history of our discipline and still play an important role. 
Until fairly recently, solving the “full’’ equations for any but the simplest flow situations was simply not feas- 
ible. To make any progress at all in understanding and predicting the kinds of flow that are of interest in aero- 
dynamics, the pioneers in our field had to develop an array of different simplified theoretical models applicable 
to different idealized flow situations, generally based on phenomenological knowledge of the flow structure. 
Though the levels of physical fidelity of these models varied greatly, even well into the second half of the twen- 
tieth century they provided the only practical means for obtaining quantitative predictions. The simplified mod- 
els not only brought computational tractability and accessible predictions but also provided valuable ways of 
“thinking about the problem,” powerful mental shortcuts that enable us to make mental predictions of what will 
happen, predictions that are not directly available from the basic physics. They also aid understanding to some 
extent, but not always in terms of direct physical cause and effect. 

So the simplified theoretical models ease computation and provide some degree of insight, but they also have 
a downside: They involve varying levels of mathematical abstraction. The problem with mathematical abstrac- 
tion 1s that, although it can greatly simplify complicated phenomena and make some global relationships clearer, 
it can also obscure some of the underlying physics. For example, basic physical cause-and-effect relationships 
are often not clear at all from the abstracted models, and some outright misinterpretations of the mathematics 
have become widespread, as we'll see. Thus some diligence is required on our part to avoid misinterpretations 
and to keep the real physics clearly in view, while taking advantage of the insights and shortcuts that the simpli- 
fied models provide. 

We've looked at the roles of formal theories (components | and 3) and flow phenomenology (component 2), 
and it is clear that the combination, so far, falls short of providing us with a completely satisfying physical un- 
derstanding. Physical cause-and-effect at the local level 1s clear from the basic physics, but at the flowfield level 
it is not. Thus to be sure we really understand the physics at all levels, we should also seek qualitative physical 
explanations that make the cause-and-effect relationships clear at the flowfield level. This is component 4 of 
my proposed framework. 

Qualitative physical explanations, however, pose some surprisingly difficult problems of their own. We've 
already alluded to one of the main reasons such explanations might be difficult, and that is the wide gulf in 
complexity between the relatively simple physical balances that the raw physical laws enforce at the local level 
and the richness of possible flow patterns at the global level. Another is that the basic equations define implicit 
relationships between flow variables, not one-way cause-and-effect relationships. Because of these difficulties, 
misconceptions have often arisen, and many of the physical explanations that have been put forward over the 
years have flaws ranging from subtle to fatal. Explanations aimed at the layman are especially prone to this, but 
professionals in the field have also been responsible for errors. Given this history, we must all learn to be on the 
lookout for errors in our physical explanations. If this book helps you to become more vigilant, I'll consider it a 
SUCCESS. 

This completes our brief tour of the conceptual framework, with emphasis on the major difficulties inherent 
in the subject matter. My intention in this book is to devote more attention to addressing these difficulties than 
do the usual aerodynamics texts. Let's look briefly at some of the ways I have tried to do this. 

The theoretical parts of our framework (components | and 3) ultimately rely on mathematical formulations 
of one sort or another, which leads to something that, in my own experience at least, has been a pedagogical 


problem. It is common in treatments of aerodynamic theory for much of the attention to be given to mathem- 
atical derivations, as was the case in much of the coursework I was exposed to in school. While it 1s not a bad 
thing to master the mathematical formulation, there is a tendency for the meaning of things to get lost in the 
details. To avoid this pitfall, I have tried to encourage the reader to stand back from the mathematical details 
and understand “what it all means” in relation to the basic physics. As I see it, this starts with paying attention 
to the following: 

1. Where a particular bit of theory fits in the overall body of physical theory, that 1s, what physical laws 

and/or ad hoc flow model it depends on; and 


2. How it was derived from the physical laws, that is, the simplifying assumptions that were made; 
3. The resulting limitations on the range of applicability and the physical fidelity of the results; and 


4. The implications of the results, that 1s, what the results tell us about the behavior of aerodynamic flows 
in more general terms. 


The brief tour of the physical underpinnings of fluid mechanics in Chapters 2 and 3 is an attempt to set the stage 
for this kind of thinking. 

How computational fluid dynamics (CFD) fits into this picture is an interesting issue. CFD merely provides 
tools for solving the equations of fluid motion; it does not change the conceptual landscape in any fundamental 
way. Still, it is so powerful that it has become indispensable to the practice of aeronautical engineering. As im- 
portant and ubiquitous as CFD has become, however, it is not on a par with the older simplified theories in one 
significant respect: CFD is not really a conceptual model at the same level; and a CFD solution is rightly viewed 
as just a simulation of a particular real flow, at some level of fidelity that depends on the equations used and the 
numerical details. As such, a CFD solution has some of the same limits to its usefulness as does an example of 
the real flow: In both cases, you can examine the flowfield and see what happened, and, of course, a detailed 
examination of a flowfield is much easier to carry out in CFD than in the real world. But in both CFD and real- 
world flowfields, it 1s difficult to tell much about why something happened or what there is about it that might 
be applicable to other situations. 

Before we proceed further, a bit of perspective is in order: While correct understanding is vitally important, 
we mustn't overestimate what we can accomplish by applying it. As we'll see, the physical phenomena we deal 
with in aerodynamics are surprisingly complicated and difficult to pin down as precisely as we would like, and 
it is wise to approach our task with some humility. We should expect that we will not be able to predict or even 
measure many things to a level of accuracy that would give us complete confidence. The best we'll be able to 
do in most cases is to try to minimize our unease by applying the best understanding and the best methods we 
can bring to bear on the problem. And we can take some comfort in the fact that the aeronautical community, 
historically speaking, has been able to design and build some very successful aeronautical machinery in spite of 
the limitations on our ability to quantify everything to our satisfaction. 


Chapter 2 


From Elementary Particles to Aerodynamic 
Flows 


Step back for a moment to consider the really big picture and ponder how aerodynamics fits into the whole body 
of modern physical theory. The tour I'm about to take you on will be superficial, but I hope it will help to put 
some of the later discussions in perspective. 

First, consider some of the qualitative features of the phenomena we commonly deal with in aerodynamics. 
Even in flows around the simplest body shapes, there is a richness of possible global flow patterns that can be 
daunting to anyone trying to understand them, and most flows have local features that are staggeringly complex. 
There are complicated patterns in how the flow attaches itself to the surface of the body and separates from it 
(Figure 2.la, 2.1b), and these patterns can be different depending on whether you look at the actual time-depend- 
ent flow or the “mean” flow with the time variations averaged out. Even in flows that are otherwise steady, the 
shear layers that form next to the surface and in the wake are often unsteady (turbulent). This shear-layer turbu- 
lence contains flow structures that occur randomly in space and time but also display a surprising degree of or- 
ganization over a wide range of length and time scales. Examples include vortex streets in wakes and the various 
instability “waves,” “spots,” “eddies,” “bursts,” and “streaks” in boundary layers. Examples are shown in Figure 
2.1c—f, and many others can be found in Van Dyke (1982). Such features usually display extreme sensitivity to 
initial conditions and boundary conditions, so that their apparent randomness is real, for all practical purposes. 
The “butterfly effect” we've all read about doesn't just apply to the weather; the details of a small eddy in the 
turbulent boundary layer on the wing of a 747 are just as unpredictable. 





Figure 2.1 Examples of complexity in fluid flows, from Van Dyke (1982). (a) Horseshoe vortices in a laminar 
boundary layer ahead of a cylinder. Photo by S. Taneda, © SCIPRESS. Used with permission. (b) Rankine 
ogive at angle of attack. Photo by Werle (1962), courtesy of ONERA. (c) Tollmien-Schlichting waves and spir- 
al vortices on a spinning axisymmetric body, visualized by smoke. From Mueller, et a/. (1981). Used with per- 
mission. (d) Emmons turbulent spot in a boundary layer transitioning from laminar to turbulent. From Cant- 
well, et al (1978). Used with permission of Journal of Fluid Mechanics. (e) Eddies of a turbulent boundary 
layer, as affected by pressure gradients. Top: Eddies stretched in a favorable pressure gradient. Bottom: 
Boundary layer thickens and separates in adverse pressure gradient. Photos by R. Falco from Head and Bandy- 
opadhyay (1981). Used with permission of Journal of Fluid Mechanics. (f) Streaks in sublayer of a turbulent 
boundary layer. From Kline, et al (1967). Used with permission of Journal of Fluid Mechanics 
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How does all this marvelous richness and complexity arise? It is natural to expect that complexity in the flow 
requires complexity in the basic physics and that complex behavior in the flow must therefore have its origin at 
a “low level,” in the statistical behavior of the molecules that make up the gas or in the behavior of the particles 
that make up the molecules. But this natural expectation is wrong. Instead, the complexity we see arises from 
the aggregate behavior of the fluid represented by the continuum equations. In fact, the essential features of 
everything we observe in ordinary aerodynamic flows could be predicted from the equations for the continuum 
viscous flow of a perfect gas, that 1s, the Navier-Stokes (NS) equations, provided we could solve them in suffi- 
cient detail. 

But there are two caveats that must accompany this sweeping claim. The first is that although the NS equa- 
tions are a high-fidelity representation of the real physics, they are not exact. Imagine comparing a real turbulent 
flow with the corresponding exact solution to the NS equations, starting at an initial instant in which the theor- 
etical flowfield is exactly the same as the real one in every detail. We would find that the NS solution matches 
the detailed time history of the real flow only for a short time and then gradually diverges from it. Detailed time 
histories of flows, however, are rarely of much interest in aerodynamics, where a statistical description of the 
flow nearly always suffices. In a statistical sense, we expect that a real flow and the corresponding NS solution 
would be practically indistinguishable. The second caveat is that even this less ambitious claim of statistical 
equivalence is nearly impossible to evaluate quantitatively. For one thing, exact solutions to the NS equations 
are not practically available for anything but the simplest of flows, and agreement for these simple cases doesn't 
prove much. For all other flows, especially turbulent flows, we must settle for numerical solutions. Numerical 
calculations that fully resolve the turbulence down to the last detail have been carried out only for simple flow 
geometries and relatively low Reynolds numbers. Comparisons of such calculations with the real world, at the 
level of detailed time histories, would be extremely difficult, 1f not impossible, and have not yet been attempted. 
Comparisons of statistical quantities have been encouraging, but for some of the most interesting and revealing 
quantities, Reynolds stresses, for example, the uncertainties in the experimental measurements are large. Still, 
in spite of these reservations, I'm confident that the NS equations could in principle predict any phenomenon of 
interest in practical aerodynamics to an accuracy sufficient for any reasonable engineering purpose. This doesn't 


mean that they can do so practically, as we'll discuss later in connection with the computational work involved 
in generating solutions. 


The NS equations are, of course, part of a larger system of theory applicable to a hierarchy of physical do- 
mains. Figure 2.2 illustrates this by listing the levels of physical phenomena and the corresponding levels of 
physical theory that deal with them. Here and in the following discussion I use the word “level” both in a con- 
ceptual sense, as in levels of physical detail, and in a physical sense, as in levels of physical and temporal scale, 
from small to large. As already mentioned, the NS equations are an aggregate-level theory applicable to a phys- 
ical domain separated by several levels from that of the elementary particles that make up the molecules of the 
fluid. Starting at the lowest level shown and moving upward, each domain or level represents a narrowing of 
focus, a specialization to a particular situation or class of conditions. We can follow the same logical sequence 
in deriving the theoretical models, starting with the known properties of elementary particles and eventually 
reaching the NS equations. The historic development of the theories didn't follow this orderly progression, but 
the required steps have now been filled in, and the required assumptions and approximations are now under- 
stood. 


Figure 2.2 Hierarchy of domains of physical theory leading to computational and theoretical aerodynamics 
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I've already made the claim that the potential for complexity in aerodynamic flows arises at the level of 
continuum gas flows in Figure 2.2. Is this potential for complexity really independent of what goes on at the 


lower levels? How can this be? In all of the levels below that of the continuum gas, there is considerable local 
complexity, and this is reflected in the difficulty of the corresponding mathematical theories. However, adjacent 
levels are separated by huge gaps in physical scale and very little of the complexity at one level is felt at the 
next level up. These gaps in scale act a bit like low-pass filters that allow only certain “integrated” effects of the 
structure at lower levels to be felt at higher levels. For example, the structure of a nitrogen molecule is affected 
very little by the detailed internal structure of electrons and atomic nuclei. Then, although the electron cloud of 
a molecule has a complex structure, the details of that internal structure have very little effect on the statistical 
mechanics of a dilute gas made up of large numbers of molecules. Finally, the continuum properties of gases 
that really matter to us in aerodynamics are highly insensitive to the details of the molecular motions. For ex- 
ample, the viscous behavior of the fluid under shearing deformation that we assume in the NS equations does 
not even depend on the fact that the fluid is a gas; it is the same for liquids such as water. As we move up the 
line toward the NS equations, instead of increasing complexity, we encounter a series of natural simplifications, 
and surprisingly, these simplifications cost us little loss of physical fidelity for flows that interest us. 


So the interesting behavior we see in aerodynamic flows is not inherent in any of the lower levels of the 
physics. Instead, it emerges in the behavior of the fluid at the continuum level and can be captured in solutions 
to the NS equations. This is a bit surprising at first, because the physical balances represented by the equations 
at the local level aren't all that complicated. However, in recent years, studies in the field of complexity science 
have identified a broad class of seemingly simple systems that can exhibit complex behavior. The NS equations 
are one example of this kind of system, and solutions (fluid flows) commonly exhibit emergent behavior in 
which great complexity arises from many simple local interactions. The NS equations are, after all, a set of non- 
linear partial-differential field equations in space and time, with multiple, interacting dependent variables, and 
the space of possible solutions is huge. This, combined with the fact that the possible interactions between flow 
variables at different points in space are sufficiently rich, makes the emergence of complexity in the solutions a 
natural outcome. 

There is also a much more mundane way in which fluid flows must be considered complex, and that is that 
even some of the simplest flows are not easy to understand or “explain” even qualitatively in a satisfying way 
without appealing to mathematics. Why is it so difficult? Remember that we are dealing with multiple, inter- 
acting flow quantities (dependent variables) and that if we are to understand a flow properly, the behavior of 
these quantities must be known over some extended spatial domain. Understanding simultaneous behavior over 
an extended spatial domain 1s inherently difficult, and the problem of local physics versus global flow behavior 
contributes to the difficulty. The basic physical laws impose relationships between flow quantities locally, while 
global behavior is constrained by the requirement that these laws be satisfied everywhere simultaneously. A 
flowfield is a global phenomenon in which what happens at one point depends to some extent on what happens 
everywhere else. 


Another major difficulty has to do with assigning cause and effect. Here I'm not referring to the general 
philosophical difficulties of linking causes and their effects. The NS equations are based on Newtonian mech- 
anics and classical thermodynamics, in which clear physical cause-and-effect relationships are assumed. The 
problem is that when these physical laws are combined in the NS equations, they define relationships between 
different flow quantities, but they don't define one-way trains of causation. For example, consider Bernoulli's 
equation, which relates the pressure to the velocity under certain conditions. In trying to explain a flowfield, do 
we consider the behavior of the velocity to be known and then use Bernoulli's principle to deduce the behavior 
of the pressure? Or vice versa? In most situations, the right answer is “neither.” Cause-and-effect relationships 
in fluid mechanics tend to be circular, or reciprocal, in the sense that A and B cause each other and are caused by 
each other at the same time, and often at the same point in space. The upshot of this is that linear explanations 
assigning one-way cause and effect (A causes B, which in turn causes C) are nearly always wrong. Instead, we 
must seek explanations of the type that begin with a hypothesis and eventually come back around to consistency 


with the hypothesis. (If A, then B, then C, which is consistent with A.) We will consider the issue of cause and 
effect in greater detail with regard to basic fluid mechanics in Section 3.5. Later, we will see examples, such 
as nonmathematical explanations of the lift on a wing or airfoil in Chapter 7, where trying to force one-way 
causation has led to errors. 

So far, we have considered some of the generic characteristics of continuum flows governed by the NS equa- 
tions. We've seen how the emergence of complexity in the structure of the flows themselves is a natural outcome 
of the physics, and we've identified some of the reasons why understanding and explaining flows can be dif- 
ficult. We've looked at how the NS equations fit into the overall body of physical theory, and we've seen that 
they represent an aggregate-level theory that involves some simplifying approximations. I've also asserted that 
within their range of validity the NS equations are a highly accurate representation of reality and that they can 
in principle predict anything of interest in practical aerodynamics. 

Given this essentially complete predictive capability of the NS equations, we could say that they represent 
all of the “hard science” we should need in aerodynamics. But, of course, the NS equations by themselves are 
not enough, and there is much more to aerodynamics as a science than what we've seen so far on this brief tour. 
For the foreseeable future, we won't be calculating an NS solution every time we want to predict or understand 
what will happen in an aerodynamic flow, and until recently in the history of our discipline, we couldn't have 
done so anyway. The pioneers in the field had to devise other ways of getting answers: higher level theories 
that provided both intuitive insight and computational tractability. Such theories are usually derivable at least 
in part from the NS equations using additional simplifying assumptions, but often also depend on conceptual 
models of the flows in question. These theories therefore tend to be specialized to particular situations and to 
have more restricted ranges of applicability than the basic equations. The strategies are several and varied, and 
we'll consider them in some detail in later chapters. In any case, aerodynamic theory carries a considerable su- 
perstructure of higher level theoretical models in addition to the basic NS equations. 


Considering this body of aerodynamic theory as a whole, what is its status as science? Like everything else 
on the conceptual or theoretical side of science, it is “just a theory.” This is something the proponents of cre- 
ationism like to say of Darwinian evolution, implying that it is merely a tentative hypothesis. But of course 
evolution is much more than a tentative hypothesis, and so is aerodynamic theory. Both have been tested again 
and again against empirical observations, and have so far always passed the test. 

In principle, of course, any theory could be contradicted tomorrow by new evidence and end up needing to 
be replaced. The NS equations, however, are about as secure as a scientific theory can ever be. In the context 
of science as a whole, the NS equations purport to be valid only for a narrow range of phenomena, and within 
this range the NS equations are unlikely ever to be significantly contradicted. The higher level extensions of 
aerodynamic theory are also not likely ever to be completely overthrown. They, however, typically have even 
more limited ranges of applicability than the NS equations and are known to be seriously contradicted in com- 
mon situations that are outside their ranges but still within the realm of aerodynamics. In using the higher level 
conceptual models of aerodynamics, we must always keep their limitations in mind. 


So aerodynamics is on solid ground as a science. But what can we say of its general character? Where does 
it stand on the deductive/inductive spectrum? In this regard, it has a distinctly split personality. On one hand, 
we have an all-encompassing theory, the NS equations with a no-slip condition, which 1s solidly tied to the rest 
of modern physical theory and from which we can in principle deduce anything of interest in the field. On the 
other hand, the computational intractability of the equations has greatly limited what we can deduce from first 
principles in most situations, and much of what we know comes from empirical observations, that is, the induct- 
ive approach. This is something we've already discussed in connection with the vital role that phenomenological 
knowledge plays in our conceptual framework. 

The actual physics embodied in our theoretical framework consists of Newtonian mechanics and classical 
thermodynamics, combined with a mathematical formalism that enables us to bookkeep material properties, 


forces, and fluxes in a continuous material. Thus when we apply the framework correctly, we are adhering to 
what I call a Newtonian worldview, in which all effects must have causes that are consistent with Newtonian (or 
post-Newtonian) scientific principles. As we'll see, some of the errors that can arise in aerodynamic reasoning 
can be traced to regressions to pre-Newtonian ways of thinking. 


Chapter 3 


Continuum Fluid Mechanics and the Navier- 
Stokes Equations 


The Navier-Stokes (NS) equations provide us with a nearly all-encompassing, highly accurate physical theory 
that can predict practically all phenomena of interest in aerodynamics, including “aerodynamic” flows of liquids 
such as water. In Section 3.1, we briefly consider the general way in which these equations represent the physics, 
the assumptions that had to be made to arrive at them, and their range of validity. Then 1n the sections after that, 
we delve into the specifics of the equations and what they mean. 


3.1 The Continuum Formulation and Its 
Range of Validity 


In the NS formulation, the fluid is treated as a continuous material, or continuum, with local physical properties 
that can be represented by continuous functions of space and time. These continuum properties, of course, depend 
on the properties of the molecules that make up the gas or liquid and on the lower level physics of their motions 
and interactions. However, the continuum properties represent only the integrated effects of the lower level phys- 
ics, not the details. As I noted in Chapter 2, this provides a representation that is not merely adequate, but highly 
accurate over a wide range of conditions. 

The early historic development of the NS formulation followed an ad hoc approach, assuming continuum be- 
havior a priori and developing a model for the effects of viscosity based on experiments in very simple flows. 
Much of the hard work involved in this development was devoted to the development of the mathematical form- 
alism that was required to generalize from simple flows to more general ones. We'll touch again on mathematical 
formalism issues in Section 3.2. 

The NS formulation can also be derived formally from the lower level physics, with simplifying assumptions 
to get rid of the dependence on the details. For gases, the appropriate next lower level to start from is a statistical 
description of the motion of the molecules and the conservation laws that apply to them, as embodied in the 
Boltzmann equations. With reference to this kind of derivation, the statement that continuum properties represent 
only “integrated effects” takes on a literal meaning. We use time-and-space averaging, that is, integration, over 
molecular motions to define the continuum properties of the flow at every point in space and time: the density 
and temperature of the fluid, and its average velocity. For the definitions of these basic flow quantities, we don't 
have to make any simplifying assumptions beyond the averaging process itself and that the fluid must be suffi- 
ciently “dense” for the averages to “converge.” This “convergence” problem is one we'll consider in more detail 
below. 


Although the averaging process gives us rigorous definitions of our basic continuum flow quantities, it 
doesn't get us all the way to the NS formulation. When we apply the averaging process to the basic conserva- 
tion laws for mass, momentum, and energy, we get two different types of terms that represent separate sets of 
phenomena and end up requiring different assumptions: 

1. Terms in which only the simple averages defining the continuum density, temperature, and velocity 
appear explicitly. No further assumptions are needed because these are already the basic variables of the 
NS formulation. Terms of this type represent the local time rate of change of a conserved quantity or the 
convection of a conserved quantity by the local continuum velocity of the flow. 


2. Terms that involve averages of products of molecular velocities or products of a velocity component 
and the kinetic energy. Such terms represent transport of a conserved quantity relative to the local con- 
tinuum motion of the flow. The transport of thermal energy is just the heat flux due to molecular con- 
duction. The transport of momentum has the same effect as if a continuum material were under an in- 
ternal stress and is thus the source of both the local continuum hydrostatic pressure and the additional 
continuum “stresses” due to viscous effects. The averaging process alone leaves these terms in a form 
that still depends on statistical details of the molecular motions, and further simplifying assumptions are 
required to get them into forms that can be expressed as functions of our basic continuum flow variables. 


In the NS equations, the terms representing the above transport phenomena have very simple functional de- 
pendences on local continuum properties. The hydrostatic pressure is given by an equilibrium thermodynamic 
relation (an equation of state). The heat flux and the viscous “stresses” are given by gradient-diffusion expres- 
sions in which the flux of a conserved quantity is proportional to a gradient of the conserved quantity. Fluids 
exhibiting the simple behavior of the viscous stresses described in the NS equations are often referred to as 
Newtonian. To get to these simple forms from the general ones that we get from the averaging process requires 
some simplifying assumptions about the physics. For gases, we must assume the fluid is everywhere locally 
near thermodynamic equilibrium. This means that the probability distribution functions for molecular velocity 
that appear in the full transport expressions must be near their equilibrium forms, which in turn requires that sig- 
nificant changes can take place only over length and time scales that are long compared with the mean-free path 
and time. When these conditions are satisfied, that is, when the local deviations from equilibrium are small, the 
transport-related terms can be represented very accurately by the simple relationships we use in the NS equa- 
tions. 

The main relationships comprising the NS equations are the basic conservation laws for mass, momentum, 
and energy. To have a complete equation set we also need an equation of state relating temperature, pressure, 
and density, and formulas defining the other required gas properties. For aerodynamics applications it is usually 
a good approximation to assume the ideal gas law, along with a constant ratio of specific heats (y) and viscosity 
and thermal conductivity coefficients (u and k) that depend on temperature only. It seems counterintuitive that 
the transport coefficients up and k are well represented as being independent of density at constant temperature, 
but there is a simple way to understand why this is. As density increases, one might think that the transport coef- 
ficients should increase as well because there is more mass per unit volume to transport momentum and thermal 
energy. However, as density increases, the molecular mean free path decreases, which hinders molecular trans- 
port. At the ideal-gas level of approximation, the effects of increasing mass per unit volume and decreasing 
mean free path exactly offset each other. Thus, practically speaking, the effectiveness of molecular transport de- 
pends only on the average speed of the molecules, or the temperature. In some forms of the equations, the local 
speed of sound (“‘a’’) appears, which for an ideal gas also depends only on temperature. 

The NS equations, like any field equations, need boundary conditions (BCs). At flow boundaries, where the 
flow simply enters or leaves the domain, the NS equations themselves determine what combinations of BCs can 
be imposed and what combinations are required to “determine” the solution in various ways. For boundaries 
that are interfaces with other materials, for example, gas-solid or gas-liquid interfaces, the NS equations them- 


selves don't fully define the situation, and we need to introduce additional physics. According to theoretical 
models and experimental evidence, the interaction between most of the liquid and solid surfaces encountered in 
engineering practice and air at most ordinary conditions is such that the continuum velocity and temperature of 
the air accommodate almost perfectly to the velocity and temperature of the surface. Thus assuming no slip and 
no temperature jump at the “wall,” and imposing BCs accordingly, is an extremely good approximation. 

A correct physical interpretation of the no-slip BC requires care. In some popular descriptions, the fluid is 
said to “stick” or “adhere” to the surface. This description is not completely inappropriate, but it is misleading, 
especially in the case of gasses. Saying that something “adheres” conjures up an image of a bond that can with- 
stand tension as well as shear. Of course a gas cannot be put in tension at all, let alone form a tension-resisting 
bond with anything else. But the no-slip condition does assume no sliding between the fluid and the solid, so 
that with regard to shear, the fluid does behave as if it were adhering to the surface. 

The no-slip condition applies to both liquids and gasses. How it comes about is easier to explain in the case 
of a gas. While an occasional gas molecule may adhere temporarily to a solid surface (or react chemically with it 
and remain more permanently), an overwhelming fraction of the molecules that impinge on the surface bounce 
off. The no-slip condition arises from the nature of these bounces. First, imagine the gas molecules as smooth 
spheres bouncing specularly off a smooth surface and not losing any tangential momentum in the process. In 
this case, there would be no shear force exchanged between the surface and the gas, the gas would slip easily 
along the surface, and there would be no no-slip condition. But on the scale of the molecules, no real surface 
acts as a smooth surface. All real surfaces consist of atoms similar in size to the gas molecules, and thus even 
the smoothest is rough on the scale of a gas molecule. And most real surfaces have considerable roughness 
on larger scales as well. The upshot is that gas molecules impinging on real surfaces bounce off in effectively 
random directions, which forces the average tangential velocity of molecules near the surface to be very small. 
Kinetic theory can be used to estimate the effective slip velocity (see White, 1991, Section 1-4.2), showing that 
in practical situations it is practically zero. And this must be true even for surfaces that feel slick to the touch, 
for which our intuition wrongly imagines air being able to slide freely. 

Thus our complete physical model consists of the NS equations combined with the no-slip and no- 
temperature-jump BCs. The range of applicability of this formulation is very broad, and there are only a few 
applications of practical “aerodynamics” interest where it doesn't apply. Some examples of such exceptions are 
gas flows at very low densities (for example, very high altitude) and the detailed internal structure of shock 
waves. Even flows in which ionization, dissociation, or chemical reactions take place are not generally excep- 
tions, because such effects can be incorporated into our continuum formulation by the inclusion of appropriate 
species-concentration variables, reaction-rate equations, and equations of state. Fortunately for us in aerody- 
namics, we don't have to deal with the complexities of non-Newtonian liquids, which are important in biological 
systems and many industrial processes. 

So what is it that causes our NS formulation not to apply in the exceptional situations? Is it that very low 
densities in very high altitude flight, or very small length scales as in the shock-wave problem, cause our av- 
eraging process not to converge, a possibility I alluded to earlier? This can happen, but in many cases it is not 
the cause of “failure.” Of course, at a single instant in time, the convergence of a spatial average would require 
integrating over a large enough volume to include a large number of molecules. Such instantaneous spatial av- 
erages might not resolve the internal structure of a shock wave very well, for example, but many flows are close 
enough to being steady that we can get around this problem and define averages in small spatial volumes by 
averaging over a sufficiently long period of time. I would think that most of the interesting cases of flight at 
extreme altitudes or of detailed shock-wave physics can be resolved in this way. In these cases, then, the “fail- 
ure” of our continuum formulation comes not from the failure of our averaging process to converge, but from 
the failure of the local-thermodynamic-equilibrium assumption behind our modeling of the “transport” effects 
when flow gradients become significant on the scale of a mean-free path. Another thing that tends to happen 


under such conditions is that the errors inherent in the no-slip and no-temperature-jump BC, negligibly small 
under “ordinary” conditions, become much bigger fractions of the differences in flow quantities in the rest of 
the field, and these approximations break down as well. 


3.2 Mathematical Formalism 


Now let's consider some of the issues that arise in casting our formulation of the physics in mathematical terms. 
Our final formulation will take the form of a set of partial-differential field equations (PDEs), along with some 
algebraic auxiliary relations. The variables we use, as well as which variables are independent and which are 
dependent, depend on how we choose to describe the flow. We can choose to describe it in terms of what hap- 
pens as seen at “fixed” points in space and time, the so-called Eulerian description, or we can choose to define 
the trajectories of “fixed” parcels of fluid as they evolve in time, the Lagrangian formulation. In the Eulerian 
description, time and the coordinates in some spatial reference frame, which may or may not be inertial, are 
the independent variables, and the velocity, pressure, and other state variables of the fluid are dependent. In the 
Lagrangian description, the independent variables identify the fluid parcels, for example, in terms of their spa- 
tial coordinates at an initial instant, and the dependent variables include the spatial coordinates of the parcels 
at succeeding instants. These two modes of description are in principle equivalent in the sense that they can be 
used to model exactly the same physics, but they do it in such different ways that they are not practically inter- 
changeable. 

For most purposes, the Eulerian framework is more convenient and is therefore the basis for nearly all quant- 
itative work in theoretical aerodynamics and computational fluid dynamics (CFD). A major reason for this is 
that the Eulerian description provides a much more natural framework for treating steady flows, which are the 
predominant focus of aerodynamics. All of the higher level conceptual modeling we'll encounter involves the 
Eulerian description, but we'll still find it helpful to invoke the Lagrangian description in some of our discussion 
of the basic physical laws. 

The time rate of change of any physical quantity (e.g., velocity and temperature) associated with a Lagrang!- 
an fluid parcel is called the Lagrangian derivative and 1s usually denoted by the upper case D/Dt. This Lagrangi- 
an rate of change is made up of contributions from either or both of two effects, as seen in the Eulerian frame. 
First, the quantity may be changing with time at the points in space through which the parcel is moving, as 
reflected in the unsteady-flow term O/ot, or the Eulerian rate of change. Second, if the parcel is moving with 
velocity V through a nonuniform field, it must experience a rate of change V © V inaddition to the unsteady- 
flow term. In general, then, the Lagrangian derivative is related to derivatives in the Eulerian frame by 

= Z +VeY¥. 

3.2.1 Dt at 

This transformation has interesting consequences when we apply it to the fluid velocity, to determine the 
Lagrangian acceleration. For example, for the simplest case of a 1D steady flow, Equation 3.2.1 applied to the 
velocity reduces to 


Du Ou 
— = i—. 
3.2.2 Dt ax 
From this we see that a given material acceleration Du/Dt requires a large spatial gradient Ou/Ox when u 


is small, but only a small Ou/Ox when u is large. This is a consequence of a Lagrangian fluid parcel's motion 
through the velocity field. In Section 3.4.6, we'll look at how the convective acceleration appears in the con- 


vective terms in the momentum equation, and in Sections 4.1.2 and 4.2.1 we'll consider it again in the context 
of boundary-layer flows. 

A factor that complicates the mathematics is that some of the quantities we must deal with are vectors and 
tensors. The velocity is a vector, and the equation for conservation of momentum is a vector equation. In 3D, 
this results in three variables and three of our equations, which 1s a pretty straightforward thing to grasp intuit- 
ively. 

What is less intuitive is the problem of representing the transmission of forces by “contact” between adjacent 
fluid parcels. Physically speaking, of course, these forces are a result of momentum transfer by molecular mo- 
tions, but in the continuum formulation the integrated effects of the motions of many molecules are represented 
as apparent internal stresses in the fluid, or forces per unit area of the boundary of a parcel. We'll encounter this 
idea again in our discussion of the momentum equation in Section 3.4.2. The mathematical problem we face 
is the general problem of representing the state of stress in a continuous material. First, we have to get used to 
the idea of imaginary boundaries separating adjacent parcels of material. Then we must visualize how any two 
adjacent parcels exert equal-and-opposite stresses on each other across their common bounding surface. Our 
description must be able to define the state of stress at any point in the fluid in such a way that it gets the right 
value for the opposing forces for any orientation of the imaginary boundary. The stress is a force per unit area (a 
vector) that depends on the orientation of an imaginary dividing surface, which can be defined by the direction 
of the normal to the surface (another vector). 

The stress is thus a tensor. An entire field of mathematics, tensor analysis, was developed just to provide 
rigorous means for mathematical manipulation of such quantities, not just in continuum mechanics, but in other 
branches of physics as well, and along with it came powerful shorthand notations for expressing the manipu- 
lations. Tensor notation provides the least error-prone way to deal with the stress terms and convection terms 
in our equations, especially when it comes to deriving the many terms that arise when the equations are trans- 
formed to different coordinate systems. Such manipulations can be done without tensor notation, but avoiding 
errors becomes much more difficult. With or without tensor notation, however, such manipulations quickly be- 
come exercises in manipulating symbols, and it can be difficult to maintain any grasp of the physical meaning. 
We'll try to reach a physical understanding of the most important aspects of the viscous stresses by using very 
simple flow situations as examples in Section 3.6. 

So far, we have talked about the NS equations only in their local or differential form, which 1s the form that 
will relate most directly to most of our succeeding discussions. However, in some applications, a more glob- 
al view of the flow suffices and can be easier to deal with. For these situations, we have the control-volume 
form of the equations, in which the equations have been integrated over a volume and the surfaces bounding the 
volume. The control-volume equations are “exact” in the sense that there is no loss of accuracy relative to the 
differential equations, but they are “simplified” in the sense that they can tell us only what happens to integrated 
quantities and nothing about how the local quantities are distributed over the volume and bounding surfaces. 
We'll use the control-volume approach to calculate viscous drag in Section 6.1.4 and lift-induced drag in Section 
8.3.2, and we'll consider the general approach further in Chapter 9. 

In ordinary solutions to the NS equations, all flow quantities are continuous and differentiable, even through 
shocks (see more about shocks in Section 3.11.2). This brings some powerful mathematical facts to our aid, 
without our having to introduce any “physics” at all, which brings us to the topic of the next section. 


3.3 Kinematics: Streamlines, Streaklines, 
Timelines, and Vorticity 


Kinematic descriptions are basic to all our attempts to understand flowfields. Obviously, we must understand 
the kinematic structure of a flow before we can understand the underlying dynamics. The kinematic structure of 
a flowfield is constrained to have certain characteristics because the velocity field is a continuous vector field. 


3.3.1 Streamlines and Streaklines 


Two kinematic concepts we often appeal to are streamlines and streaklines. Streamlines are simply 3D space 
curves, defined as being everywhere parallel to the velocity vector. A streakline is also a 3D space curve, but is 
defined by the locations of a string of Lagrangian fluid parcels that all passed through a given “point of origin” 
somewhere upstream in the flowfield. (We introduced the Lagrangian description of the flowfield in Section 
3.2, and we'll define what a Lagrangian fluid parcel means more precisely in Section 3.4.) The point of origin 
for a streakline is usually taken to be a fixed point in space, but it needn't be; it can be allowed to move with 
time. A streamline is obviously a mathematical construct that can be defined only by “solving” a mathematical 
problem, that is, “construct a curve everywhere parallel to a given vector field.” A streakline, on the other hand, 
can be “realized,” at least approximately, in real flows that are marked by a passive contaminant such as dye in 
liquids or smoke in air. 

If the flow is steady, streamlines and streaklines from fixed points of origin will coincide and will be the same 
as individual particle paths, that 1s, the paths of individual Lagrangian parcels. Even in steady flows, however, 
interesting issues can arise in the interpretation of flow patterns. Figure 3.3.1, for example, compares a stream- 
line pattern constructed from a steady-flow CFD solution with the corresponding nominally steady streakline 
pattern marked by dye 1n a water tunnel. The two patterns should of course be the same, and on close inspection 
they agree about as well as a CFD calculation and an experiment can be expected to. But at a glance, the two 
images give very different impressions. In the CFD solution, the separation at about 60% chord and the form- 
ation of a closed separation bubble stand out clearly, while in the water-tunnel photo the separation is evident 
only if you look very carefully. Part of the problem is that the field of view of the photo doesn't show the whole 
length of the separation bubble. I must admit that I didn't realize that this flow separates ahead of the trailing 
edge until I saw the CFD streamlines. And I'm in good company: Van Dyke (1982) published this same photo 
with the comment that the flow “appears to be unseparated.” 


Streamlines and streaklines in the entirely laminar steady flow around a NACA 64A015 airfoil at 
zero incidence, R = 7,000. (a) Streamlines in a steady-flow CFD solution showing a separation bubble starting 
at about 60% chord. Laminar RANS calculation by Steven R. Allmaras. (b) Streaklines marked by dye re- 
leased from upstream in a water tunnel. The streaklines closest to the trailing edge apparently consist of dye 
streaming forward from the closure region of the separation bubble, beyond the right edge of the photo. Aft of 
mid-chord there are variations in streakline spacing that are not present in the CFD solution. Photo by Werle, 
1974, courtesy of ONERA 





If the flow is unsteady, the situation is much more complicated, and streamlines, streaklines, and particle 
paths will generally all be different. Looking at the pattern formed by any one of them gives an incomplete and 
usually misleading picture of the flow. shows how different the unsteady flow in the wake of a cir- 
cular cylinder looks in terms of streaklines (a) and streamlines (b). Timelines (c), which we'll define in Section 
3.3.3, also present a very different picture. 


, Unsteady wake (von Karman vortex street) of a circular cylinder at Reynolds numbers in the 
range 136—140. Photos by S. Taneda, © SCIPRESS. Used with permission. (a) Streaklines marked by dye in- 
troduced at the cylinder surface. (b) Streamlines approximated by short time exposure of suspended particles. 
Used with permission. Photos by S. Taneda, © SCIPRESS. Used with permission. (c) Timelines marked by 
hydrogen bubbles from a pulsed wire upstream. Photos by S. Taneda, © SCIPRESS. Used with permission 





3.3.2 Streamtubes, Stream Surfaces, and the Stream 
Function 


The concept of a streamtube is one that is usually applied only to steady flows. The definition of a streamtube 
starts with a closed curve in the flowfield, as illustrated in Figure 3.3.3a. Steady streamlines or streaklines 
passing through all points on the curve define a surface that forms the boundary of a curvilinear tube. Because 
the bounding surface is parallel to the velocity vector, no continuum fluid parcel passes through it. In a steady 
flow, according to the principle of continuity, which we'll discuss in the next section, the mass flux in a 
streamtube is the same at any cross section along its length. In a 2D flowfield, we could still define a streamtube 
the same way we did in 3D, using a closed curve to define the boundary, but a more useful definition is to allow 
the closed curve defining the streamtube to degenerate into two points, so that the streamtube becomes a 2D 
layer of flow defined by one streamline through each point, as illustrated in Figure 3.3.3b. 


Figure 3.3.3 Illustrations of streamtubes. (a) As a compact streamtube defined by a closed contour in a 3D 
flow. (b) As a sheet of flow defined by two points in a 2D flow 





(b) 


The bounding surface of a streamtube 1s a special case of the more general concept of a stream surface, which 
also is usually applied only to steady flows. The space curve from which a stream surface originates needn't be a 
closed curve, and a stream surface needn't form a closed tube. A general stream surface 1s also a surface through 
which no continuum fluid parcel passes. In 3D flows, stream surfaces that start out relatively flat can become 
highly contorted as the flow progresses downstream. 

The stream function is a concept that applies only to 2D flows. Considering any two points A and B in a 2D 
flow, the mass flux across any curve joining the two points depends only on the locations of the points and on 
time, provided the flow is either incompressible or steady. (For example, for the two points in Figure 3.3.3b, the 
mass flux across any contour joining the points is the mass flux in the shaded streamtube.) Thus if we fix point 
A, the mass flux defined in this way for all other points B defines a single-valued function we call a stream 


function. It follows then that the stream function is constant along streamlines and that the difference in its value 
between two streamlines is the mass flux in the streamtube bounded by them. The stream function was used 
more frequently in the past than it is now. It was often used in earlier theoretical discussions of incompressible 
flows (see Section 3.10) and was sometimes used in numerical methods for solving the NS equations in 2D. 


3.3.3 Timelines 


Another useful kinematic concept is that of timelines, which are usually considered most useful in 2D flows, 
though they can be defined in any flow, steady, or unsteady. The definition starts with the marking of a string 
of Lagrangian fluid parcels arrayed across the flow at some initial instant. A timeline is then the space curve 
defined by that same string of parcels at some future instant. Timelines are most useful when defined in sets of 
multiple lines whose initial instants are separated by equal time intervals. In real flows, timelines can be realized 
approximately by passive-contaminant markers, usually emanating from a fine wire stretched across the flow. 
In air, the wire is coated with oil, and a pulsed electric current in the wire produces brief puffs of smoke, mark- 
ing cross-stream lines that convect downstream. In water, electric pulses can produce lines of tiny hydrogen or 
oxygen bubbles that mark the flow. In Figure 3.3.2c, we saw timelines in the flow past a circular cylinder. 

Figure 3.3.4 shows an example of timelines in a turbulent boundary layer, illustrating a key aspect of 
timelines in turbulent flows. In a fully turbulent boundary layer, the turbulent velocity fluctuations are not large 
fractions of the mean velocity, and as a result, the younger timelines near the left edge of the photo remain 
ordered and build up distortions slowly, looking as if they were in a smoother flow than that in the rest of the 
photo. As the flow progresses from left to right, the distortions accumulate until, in the right half of the photo, 
the timelines that are entirely inside the boundary layer appear as a chaotic jumble. In this fully turbulent flow, 
the timeline picture gives the misleading impression that the intensity of the turbulent motions is increasing 
from left to right. In steady flows, timeline patterns tend to be simpler and less prone to misinterpretation, as 
we'll see later in the case of a circular cylinder in ideal potential flow in Figure 5.1.3c. 


Figure 3.3.4 Timelines in a turbulent boundary layer in water, marked by hydrogen bubbles from a pulsed 
wire at the left edge of the photo. 
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3.3.4 The Divergence of the Velocity and Green's Theorem 


The fact that the velocity is a continuous and differentiable vector field also means that the usual theorems of 
vector analysis apply. Some constrain the physics in ways that can simplify our task considerably. 


Green's theorem relates the divergence of the velocity to a surface integral: 


[][ vevav= |] Venas. 
3.3.1 J. J. 


where the triple integral is over any volume occupied by the fluid, and the double integral is over the surface 
that encloses the volume. This is a key ingredient in derivations of the control-volume forms of the equations. 


3.3.5 Vorticity and Circulation 


Relationships involving the vorticity are extremely useful for purposes both of conceptualizing and doing quant- 
itative calculations. In this section, we'll concentrate on the kinematics of vorticity and its relationship to the 
velocity field. We'll encounter the Biot-Savart law, which leads naturally to the idea of vortex induction, and 
we'll take pains to come to a correct understanding that induction is not a dynamic phenomenon, as implied by 
the way people often talk about it, but is instead a strictly kinematic concept. The real dynamical aspects of 
vorticity are yet to come, in Sections 3.6 and 3.8 and in the discussion of lift in Chapters 7 and 8. 


The vorticity is just the curl of the velocity: 
33.2" = Vx V, 

fonwnieh it follows by a basic vector-calculus identity that the vorticity is divergence free: 
3.3.3 Vew = 0). 


If we know the vorticity at a point in a flowfield, we know something about how the velocity varies in the 
neighborhood of that point, but we don't know everything. In a constant-density flow, the deviations in velocity 
in the neighborhood of a point can be expressed as the sum of two parts: a deformation velocity field and a 
solid-body rotation with angular velocity w/2, a result known as Helmholtz's first theorem (see Milne-Thomson, 
1966, Section 3.22). Examples of how these two components of motion look in isolation and in combination in 
2D flow are illustrated in Figure 3.3.5. In each sketch, a square fluid parcel and perpendicular lines bisecting 
it at an initial instant are shown as solid lines, and the same lines anchored in the fluid are indicated at a later 
instant by dashed lines. Because we're interested in velocity deviations, we anchor our reference frame to the 
fluid at the center of the square and show the center as not moving. In Figure 3.3.5a we have a pure solid-body 
rotation in which the two bisecting lines have rotated in the same direction by the same amount. In Figure 3.3.5b 
we have a pure deformation in which the vorticity, and therefore the average angular velocity of the fluid, are 
zero. This is reflected in the fact that the bisecting lines have rotated by equal-and-opposite amounts. In Figure 
3.3.5c we have added components Figure 3.3.5a,b together. The result is a simple shearing motion in which the 
horizontal bisecting line hasn't rotated at all, and the vertical bisecting line has rotated twice as far as in either 
Figure 3.3.5a or b. The average angular velocity, and thus the vorticity, are the same as in Figure 3.3.5a. These 
examples are 2D, but they are indicative of what these effects would look like in 3D in planes perpendicular to 
the vorticity vector. 


Figure 3.3.5 Illustrations of the effects of solid-body rotation and deformation on initially square fluid parcels 
in 2D flow. The square and perpendicular lines bisecting it at an initial instant are shown as solid lines, and the 
same lines anchored in the fluid are indicated at a later instant by dashed lines. (a) A pure solid-body rotation 
in which the two bisecting lines have rotated in the same direction by the same amount. (b) A pure irrotational 
deformation in which the vorticity, and therefore the average angular velocity of the fluid, are zero. This is re- 
flected in the fact that the bisecting lines have rotated by equal-and-opposite amounts. (c) Components (a) and 
(b) added together. The result is a simple shearing motion in which the horizontal bisecting line hasn't rotated 
at all, and the vertical bisecting line has rotated twice as far as in either (a) or (b). The average angular velo- 


city, and thus the vorticity, are the same as in (a) 





(c) 


The deviation velocity components comprising the solid-body rotation part of the motion are in a plane per- 
pendicular to the vorticity vector, but there is no such constraint on the deformation part. Note that the vorticity 
does not determine the deformation part of the field, but that the vorticity and the solid-body-rotation part of 
the field are proportionally related. Thus at any point where the local velocity field has a solid-body-rotation 
component to it, the vorticity must be nonzero. Likewise, if the vorticity is zero, there is no solid-body rotation 
component, and because of this, flows with zero vorticity are often called irrotational. 


Much of our theorizing in subsequent sections will make use of the interplay between vorticity and the cir- 
culation, which 1s defined as the line integral of the velocity around a closed contour. The crucial relationship is 
given by Stokes's theorem: 


[= f Vetdl= [| wends 
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where the double integral is over any continuous surface that is bounded by the contour and is piecewise smooth 
over its entire area, as shown in Figure 3.3.6. For Equation 3.3.4 to hold, the integrands need merely to be in- 
tegrable; they needn't be continuous. Thus creases in the surface are allowed, because they occupy zero area, or 
a subset of zero measure, as the mathematicians would say. Likewise, kinks in the closed contour are allowed. 





Figure 3.3.6 Illustration of a closed contour and bounded surface to which Stokes's theorem applies 





What Stokes's theorem says, in words, is that the circulation around a closed contour is equal to the flux of 
vorticity through the contour. When we discuss dynamics in Section 3.8, we'll find this relation useful for draw- 


ing conclusions about the persistence of the circulation, or lack of circulation, and often we'll be able to do so 
without any actual calculation. 


3.3.6 The Velocity Potential in Irrotational Flow 


If the flow 1s irrotational (zero vorticity everywhere), we can derive a further useful result from Equation 3.3.4, 
which now states that the circulation around any closed contour must be zero. We start by noting that for any 
two points A and B in the field, we can define many different closed paths from A to B and back to A. Because 
the line integral of the velocity around any of these paths must be zero, the line integral along one path segment 
from A to B must be the negative of the line integral along the other path segment from B to A, and because this 
must be true for any choice of the two path segments, both line integrals must be independent of the path taken. 
Now if we fix point A, the line integral from A to any other point B is a single-valued scalar function of the 
location of point B, and it is easy to show that the velocity vector must be equal to the gradient of that function. 
Thus whenever the velocity field is irrotational, 1t can be expressed as the gradient of a scalar function we call 
a velocity potential 0: 


3.3.5 V = Vo. 


The existence of a velocity potential can greatly simplify the analysis of inviscid flows by way of potential- 
flow theory, which we'll discuss further in Section 3.10. In the above discussion, we assumed the simplest situ- 
ation, in which the vorticity is zero everywhere. However, many of the situations in which potential-flow theory 
is applied are not so simple. Many practical flows are effectively irrotational everywhere except for isolated 
concentrations of vorticity. Potential-flow theory can still be applied in the irrotational parts of these flows, but 
special treatments are required to account for the presence of the isolated vorticity. Examples include the jumps 
in velocity potential that must be allowed across vortex sheets, as discussed in Section 3.3.7, and the special 
treatment required in the potential-flow theory for 2D airfoils, for which the region of irrotational flow is not 
simply connected, as we'll discuss further in Section 7.1. 


3.3.7 Concepts that Arise in Describing the Vorticity Field 


We have a variety of concepts that are useful for thinking about how vorticity is distributed in the flowfield. The 
first ones we'll consider are applicable to the usual realistic situation in which vorticity 1s continuously distrib- 
uted. 


Anywhere that the vorticity is nonzero, we can define a vortex line as a curve in space that 1s parallel to the 
vorticity vector, just as a streamline is parallel to the velocity vector. So a vortex line in the vorticity field is 
analogous to a streamline in the velocity field, and just as we extended the concept of a streamline to define a 
streamtube, we can extend the concept of a vortex line to define a vortex tube. By definition, the flux of vorticity 
across the bounding surface of a vortex tube 1s zero. This, combined with Equation 3.3.3, means that the flux 
across any cross-section of the tube, anywhere along its length, 1s the same. 

The fact that the vorticity flux in a vortex tube is constant dictates the changes in vorticity magnitude that 
must accompany vortex stretching. If the cross-sectional area of a vortex tube decreases, either in time or along 
the length of the tube, the strength of the vorticity (the magnitude of the vorticity vector) must increase. For a 
section of vortex tube containing a given amount of fluid, a reduction in cross-sectional area usually requires an 
increase in length, or a stretching. (It definitely requires it if the fluid density is constant, for reasons we'll dis- 
cuss in Section 3.4.1 in connection with the conservation of mass.) Thus the stretching of a vortex tube usually 
increases the local vorticity magnitude. 


A vortex filament is a vortex tube whose cross section has a maximum dimension that is infinitesimally 
small. The cross-sectional area of a vortex filament is thus also infinitesimally small, but it is still assumed to 
vary along the length of the filament, so that the filament can still satisfy the definition of a vortex tube. For 
a vortex filament, the flux of vorticity across a cross-section reduces to the product of the vorticity magnitude 
and the cross-sectional area, which 1s called the intensity of the filament. Note that this definition of the intens- 
ity as the flux of vorticity through an infinitesimal area is different from other concepts of intensity you may 
be familiar with, for example, the intensity of a light beam, which is defined as the energy flux per unit area. 
The result that the intensity of a vortex filament is constant along its length is Helmholtz's second theorem (see 
Milne-Thomson, 1966, Section 3.22). This conservation of intensity means that a vortex filament cannot end 
anywhere inside the fluid domain and must either form a closed loop (vortex loop) or end on the boundary of 
the domain. 

Depending on the nature of the boundary, there will be constraints on how vortex filaments or vortex lines 
can end there. First, consider the special case of an isolated vortex filament that is surrounded by irrotational 
flow. If the flow is steady, and the boundary is an interface across which the fluid cannot flow, such a vortex 
filament can intersect the boundary only in the normal direction. This is so because there must be an essentially 
circular flow pattern in the neighborhood of the filament, in planes perpendicular to the filament, as we'll see in 
Section 3.3.8. This would violate the no-through-flow condition at the boundary if the filament were not normal 
to the boundary. Further, if the boundary is a stationary solid surface at which the no-slip condition applies, the 
velocity components in planes perpendicular to the filament must vanish at the wall, and the vorticity magnitude 
must go to zero. Thus an isolated vortex filament cannot end at all at a solid surface with a no-slip condition. 

In the more general case of distributed vorticity, vortex lines may intersect a no-through-flow boundary at 
which there is slip, and the intersection need not be in the normal direction. On the usual kind of stationary sur- 
face with no slip, the situation is much more constrained. Because the tangential velocity is zero on the surface, 
the component of vorticity normal to the surface must be zero everywhere on the surface. Then if the magnitude 
of the vorticity is nonzero, the vortex lines must be tangent to the surface. In the viscous flow about a station- 
ary body, this applies practically everywhere on the surface. The only exceptions are isolated singular points of 
separation or attachment, which we'll discuss further in Section 5.2.2 and which are places where the magnitude 
of the vorticity on the surface is zero. At such a point, a vortex line may intersect the surface in the normal 
direction, but it does so with a “whimper,” because the normal component of the vorticity must still go to zero 
where the line intersects. Thus vortex lines can intersect a no-slip surface only at isolated singular points. It 1s 
sometimes erroneously stated that vortex lines cannot intersect a no-slip surface at all, without acknowledgment 
of the above exception (as pointed out by Saffman, 1992; Section 1.4). 

So we see that when vortices approach a solid no-slip surface anywhere other than at an isolated singular 
point, the vortex lines must turn to avoid intersecting the surface, and in doing so they often become part of the 
vorticity in a viscous boundary layer on the surface. Figure 3.3.7 illustrates one such situation, the “inlet vortex” 
that often forms when an engine inlet is near the ground, in this case made visible by water droplets. Lines were 
added to the photo to illustrate how the vortex lines must spread out in all directions along the ground in this 
situation. (There is often a spiral component to the vortex lines' alignment, which is omitted here for clarity.) 
Though this vortex is not a single thin filament, it is fairly concentrated and is surrounded by mostly irrotational 
flow. Just outside the boundary layer on the ground, the vortex is close to perpendicular to the ground, roughly 
in keeping with our conclusion above that an isolated vortex filament must approach a no-throughflow bound- 
ary in the normal direction. 


Figure 3.3.7 An inlet vortex marked by water droplets. Because all but one of the vortex lines cannot end on a 
solid surface with a no-slip condition, the vortex lines must spread out in the boundary layer on the ground as 
illustrated. The possible spiral component to the vortex lines' alignment is omitted here for clarity. Modified 
from original photo by Alastair Bor. 





Now let's consider concepts that were developed for idealized models of flows with highly concentrated vor- 
ticity. Concentrations of vorticity in limited regions are important features in some flows we'll study later. For 
example, as we'll see in Chapter 8, the vorticity in the wake behind a lifting wing starts out concentrated in a 
relatively thin shear layer and ends up concentrated 1n two isolated, more or less axisymmetric vortices, all sur- 
rounded by practically irrotational flow. In conceptual models of such flows, these vortical structures are often 
idealized as mathematically thin concentrations, with the shear layers idealized as vortex sheets, and the vortices 
as line vortices. These idealized entities carry vorticity fluxes that are finite even though the vorticity is concen- 
trated in a region with zero cross-sectional area. The vorticity distribution must therefore be singular, or infinite, 
at the location of the sheet or line. For a vortex sheet, we must generally integrate over a cut through a finite 
width of sheet to find a finite flux of vorticity, though the area we have integrated over is still zero, because the 
sheet is infinitely thin. For a line vortex, we need only integrate over a single cut through the line (a point) to 
find a finite flux. There is a formal mathematical theory that makes all of this rigorous, but we won't go into it 
here because the concepts can be understood well enough without it. 


A line vortex looks superficially similar to the vortex filament we defined earlier, but there are crucial differ- 
ences. The cross-sectional area of a line vortex is zero, while that of a filament 1s infinitesimal, and the vorticity 
flux of a line vortex is finite, while that of a filament is infinitesimal. We must also take care not to confuse 
a line vortex, which is a singular distribution of vorticity, with a vortex line, which is simply parallel with the 
vorticity vector, usually in fields in which vorticity is continuously distributed. 

A line vortex in a 2D planar flow is often called a point vortex, because it must be a straight line that extends 
to infinity in both directions perpendicular to the 2D plane and therefore appears as a single point in the 2D 
plane. The line vortex is one of the elementary singularities that can be used as a building block to construct 
solutions in potential-flow theory, as we'll discuss further in Section 3.10. In more general flows, a line vor- 
tex would usually be curved, a situation that raises a special problem. At any point on a line vortex where the 
curvature of the vortex is nonzero, the fluid velocity at that point on the vortex, in a direction perpendicular to 
the vortex, must be infinite. This makes it impossible to determine a realistic velocity at which such a vortex 


line will be convected by the flow. (Convection of vorticity is discussed in Sections 3.6 and 3.8.) In real flows, 
the vorticity 1s spread out continuously and has finite magnitude, and such infinite velocities do not occur. 


3.3.8 Velocity Fields Associated with Concentrations of 
Vorticity 


We've just seen that highly concentrated vorticity is often idealized as a vortex sheet or a line vortex. With 
Stokes's theorem in hand, we are in a position to determine the nearfield velocity distributions that must accom- 
pany these idealized distributions of vorticity, as illustrated in Figure 3.3.8. 


Figure 3.3.8 Velocity distributions in the neighborhood of common concentrations of vorticity. (a) An ideal- 
ized vortex sheet in 2D. (b) A physical shear layer in 2D with a finite thickness. (c) Plan view of a vortex sheet 
in 3D in the common situation of no jump in velocity magnitude, only direction. The velocity jump AV 1s per- 
pendicular to the vorticity @, which 1s parallel to the mean velocity Vbar (d) An idealized line vortex. (e) A 
physical vortex with a core of finite radius. (f) The Rankine vortex: an idealization of a physical vortex in 
which the vorticity is constant in a circular core and zero outside the core 
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(a) (b) 
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A vortex sheet in 2D flow is shown in Figure 3.3.8a. By applying Stokes's theorem to a small closed contour 
inclosing a short section of the sheet, we see that there must be a jump in velocity magnitude across the sheet 
equal to the local vorticity strength, or vorticity per unit distance along the sheet in the direction perpendicular 
to the vorticity vector. In this 2D case, the vorticity vector is perpendicular to the plane of the paper, and the 


distance along the sheet is measured in the flow direction. The physical flow corresponding to this idealized 
vortex sheet is a shear layer with the velocity jump spread across a finite thickness, as shown in Figure 3.3.8b. 

In a 3D flow, the velocity jump across a vortex sheet, in a vector sense, must still be perpendicular to the vor- 
ticity vector. A common situation in aerodynamics, as we'll see in the modeling of wing flows in 3D in Chapter 
8, 1s to have a sheet with no jump in velocity magnitude, only in direction. In this case, the jump 1n the velocity 
vector is perpendicular to the vorticity vector, which 1s parallel to the direction of the mean of the velocity vec- 
tors on the two sides of the sheet, as sketched in Figure 3.3.8c. It is easy to show that if the vorticity vector were 
not parallel to the mean of the two velocity vectors, there would have to be a jump 1n velocity magnitude. 

Vortex sheets of the kind sketched in Figure 3.3.8c are often modeled in 3D potential-flow theory. It is clear 
from the definition of the velocity potential in Section 3.3.6 that the jump in the velocity vector requires a jump 
in the velocity potential as well. 

If a physical shear layer is effectively thin, that 1s, 1f the flow changes across the layer are much faster than 
changes in other directions, the velocity jump will be approximately equal in magnitude and perpendicular to 
the integral of the vorticity across the layer. This is an observation we'll find helpful in some of our thinking 
about boundary layers and wakes in later chapters. 

Our next example 1s the idealized line vortex shown in Figure 3.3.8d. For purposes of this discussion, we'll 
assume the line is locally straight, so that we don't have to deal with the problem of the infinite perpendicular 
velocity of a curved line vortex that we mentioned in Section 3.3.7. The circulation on any circular contour of 
radius r centered on the vortex line must be the same, provided the contour encloses no other vorticity. We con- 
clude that the circumferential velocity must go as 1/r as shown and that the circumferential velocity is singular 
on the vortex line itself. In the corresponding physical vortex shown in Figure 3.3.8e, the vorticity is spread over 
a vortex core with a radius re and a circumferential velocity distribution that depend on the flow process that 
produced the vortex. Outside of the core, the vorticity is zero, the circulation is constant, and the velocity goes 
as 1/r, just as it did for the ideal line vortex. An idealization that lies between the line vortex of Figure 3.3.8d 
and the physical vortex of Figure 3.3.8e is the Rankine vortex, shown in Figure 3.3.8f. In the Rankine vortex, 
the vorticity is constant throughout a circular core of radius re and zero outside the core. The motion within the 
core 1s thus a solid-body rotation with velocity proportional to r, as shown. The Rankine vortex is an ingredient 
in one of the theories of induced drag we'll consider in Section 8.3. 

The point vortex and Rankine vortex are idealized flow structures that can be sustained only in inviscid flow. 
In the real world, viscosity would diffuse them, forming a physical vortex core as shown in Figure 3.3.8e. 


3.3.9 The Biot-Savart Law and the “Induction” Fallacy 


Now we come to the more general question of what we can say globally about the velocity when the distribution 
of vorticity 1s known. Let's start with the general problem of determining velocity from the vorticity outright, 
in a mathematical sense. It turns out that inverting the definition of vorticity (Equation 3.3.2) does not entirely 
determine the velocity field, but determines it only to within an unknown additive part that must be irrotational. 
The Biot-Savart law expresses the solution for the part of the velocity field that is determined. It can be ex- 
pressed in three forms: applicable to vorticity distributed continuously through a volume, vorticity concentrated 
in thin sheets, or vorticity concentrated in line vortices. 
For the Biot-Savart law to hold, the following assumptions must be met (see Milne-Thomson, 1966): 
1. The fluid fills all of space. 


2. The fluid is at rest at infinity, with the velocity magnitude at large distances dying off at least as LA. 


For the ways we typically want to use the Biot-Savart law, these assumptions are not restrictive. If there is a 
solid body in the flowfield, we can assume that it is filled with fluid at rest that 1s separated from the external 
field by a vortex sheet that represents the surface of the body and provides the velocity jump from the interior 
to the exterior. A constant farfield velocity, as we often have in steady-flow aerodynamics, is not a problem be- 
cause the constant velocity can be removed via a Galilean transformation. 

The simplified theoretical models for flows around airfoils and wings that we'll discuss in Chapters 7 and 8 
are generally idealized inviscid models in which the vorticity is assumed to be concentrated in thin sheets, and 
we often discretize the sheets into line vortices. Thus the form of Biot-Savart that we use most often is the form 
for vorticity distributed as a line vortex: 
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where 1° and t are the strength and the tangent vector of the line vortex, the radius vector r is defined as illus- 
trated in Figure 3.3.9, and the integration is over all line vortices in the field. 
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Figure 3.3.9 Definitions of geometric quantities appearing in the Biot-Savart law 





ds 


line vortex 


The Biot-Savart law is of course useful for quantitative calculations, but just the qualitative idea that knowing 
the vorticity at one point allows us to infer something about the velocity at another point is valuable in itself. 
It is in fact one of our most powerful conceptual tools for reasoning about flowfields. However, as powerful as 
this idea is, it can be a mixed blessing because it frequently leads to confusion regarding cause and effect. 

The problem arises because the vorticity is the “input” and the velocity is the “output” in Equation 3.3.6, and 
it is common practice to refer to the velocity inferred from the vorticity as the induced velocity. Because of this, 
it's just too easy to think of the vorticity as somehow “causing” the part of the velocity that it “determines.” But, 
of course, this kind of thinking is wrong. In the absence of significant gravitational or electromagnetic body 
forces, there is no action at a distance in ordinary fluid flows. Significant forces are transmitted only by direct 
contact between adjacent fluid parcels. So there is no way a vortex at point A can directly “cause” a velocity at 
some remote point B, and terms such as “caused by” and “induced” and even “due to” misrepresent the physics. 
We must be careful to remember that Biot-Savart is just a calculus relation between a vector field and its curl, 
and that in fluid mechanics it doesn't reflect a direct physical cause-and-effect relationship. 

This is a crucial point that cannot be overemphasized, and yet it has received surprisingly little emphasis in 
the literature. It is interesting to look at what other authors have had to say about it. Sears (1960) on p. 1.12 
has a short paragraph that mentions the lack of a “mechanism” by which a vortex “induces velocities” remotely, 


but he doesn't provide further discussion other than to point out that it's just kinematics resulting from the as- 
sumption of irrotational flow. Batchelor (1967) on p. 87 states that the vorticity can be said to “produce” or 
“induce” the “velocity distribution in the surrounding fluid,” but then goes on to say that this does not imply 
a “mechanical cause and effect,” but only that the velocity distribution is the “solenoidal velocity whose curl 
has the specified value everywhere and which is therefore associated with the given distribution of vorticity.” 
Milne-Thomson (1966) on p. 167 defines the “induced velocity” as “the velocity field that coexists with a given 
distribution of vorticity and vanishes with it.” But he also makes the unequivocal statement that the vorticity 
and the induced velocity “occur together but neither can properly be said to cause the other” (emphasis his). 
Durand (1967a) introduces the term “induced velocity” on p. 135 and warns the reader that he will continue to 
use it even though the implied causation isn't real, saying “it would presumably be more correct to say ... that 
the field velocity is consistent with, or correlative to, the existence of the vortex.” 


Just as Equation 3.3.6 seems to imply that the vorticity “causes” the velocity, it also gives the misleading 
impression that individual elements of a vortex filament somehow make their own separate “contributions” to 
the velocity. Milne-Thomson (1966) on p. 171 is unequivocal on this point as well, saying that “This impression 
must be guarded against; otherwise an improper physical picture may be imagined” and pointing out that only 
the velocity given by the complete result of the integration can be “asserted to have physical reality.” This is 
correct, of course. Still, in our discussion of the flow around a 3D wing in Chapter 8, we'll find it instructive to 
think about separate “contributions” to the velocity “associated with” partial portions of the vorticity configur- 
ation. In such discussions, we must always be on our guard against attributing any direct physical causation to 
these associations. 

We aerodynamicists have contributed to our own confusion by using the terms “induced velocity” and “in- 
duction” much too freely. This terminology comes from another field where the Biot-Savart law applies, that 
is, classical electromagnetics, in which it is said that the magnetic field is “induced” by the electric current. In 
electromagnetics, the terminology is appropriate because there is supposed to be real action at a distance taking 
place, for which the term “induction” is physically appropriate. In fluid mechanics, however, there is no direct 
causal link. We know that vorticity is produced, convected, and diffused in ways we'll discuss in Section 3.6, so 
we know why the vorticity in our flowfields has to be there: It 1s there more as a manifestation of the overall 
flow pattern than as a cause of it. 


So, when we want to infer something about a flow pattern without solving the equations of motion, and we 
already know the general arrangement of the associated vortices, 1t can be very helpful to appeal to the vorticity 
and Biot-Savart. The simplified wing and induced-drag theories we'll discuss in Sections 8.2 and 8.3 make use 
of this approach for quantitative calculations as well. We should make full use of the insights and computational 
shortcuts that Biot-Savart provides, but we should avoid terms such as “caused,” “induced,” “induction,” and 
“due to.” Then when we want to explain why a flow pattern exists, we must appeal to the real physics, that is, to 
the local force balance among fluid parcels. 

This completes our discussion of the aspects of vorticity that can be deduced just from kinematic considera- 
tions. In Section 3.8, we'll consider vorticity further, looking at aspects that arise from dynamics. 


3.4 The Equations of Motion and their 
Physical Meaning 


My approach in this section is not to derive the equations but to try to give brief, intuitive explanations of what 
the various terms in the equations mean and to look at some of the general things we can infer from the equa- 
tions themselves about the behavior of flows. 


Our basic equations are expressions of conservation laws for mass, momentum, and energy. These laws can 
be described most directly and understood most easily in the Lagrangian reference frame, in which we describe 
the flow in terms of the trajectories of “fixed” parcels of fluid as they evolve in time. However, as I argued in 
Section 3.2, the Eulerian reference frame, in which we describe the flow as it streams past points in a spatial 
reference frame not tied to the fluid, is ultimately the preferred choice for conceptual and quantitative purposes. 
The approach I'll follow here is to briefly discuss what the conservation laws mean in the Lagrangian frame and 
then proceed to a discussion of how they are expressed in the Eulerian frame. 

In both the Lagrangian and Eulerian frames, we will be considering what happens to elemental volumes of 
fluid, just differently defined in the two cases. Deriving our conservation laws in the form of PDEs involves 
a formal procedure of taking the limit as the dimensions of our fluid parcels go to zero. We won't go through 
the details of that procedure in this discussion, but the reader should keep in mind that fluid parcels in either 
reference frame should be thought of as arbitrarily small. 

Lamb (1932) defines a fixed Lagrangian parcel of fluid as containing the same fluid particles, and only the 
same fluid particles, for all time. The bounding surface of our parcel must therefore move with the fluid in such 
a way that no fluid particles pass through it. This is, of course, an idealization that makes sense only in our 
imaginary continuum world. In the real world, molecules will always be diffusing across such a boundary in 
both directions, and the best we can do is to make the boundary follow the average motion of the fluid in such 
a way that it has no net flux of material across it. In either way of looking at it, our parcel will always have 
the same amount of material in it and have no net flux of material across its bounding surface. Sweeping mass 
diffusion under the rug in this way works fine for single-species fluids or multispecies fluids in situations where 
relative species concentrations remain constant. If relative species concentrations vary significantly, defining 
a Lagrangian fluid parcel becomes problematic. For now, we'll ignore this minor limitation on the Lagrangian 
description and continue our discussion. 

As we noted earlier, we have conservation laws for mass, momentum, and energy. Why do we have them for 
these quantities and not for others, such as the pressure or the viscous stresses? It is because mass, momentum, 
and energy are quantities whose conservation 1s required by elementary physics and thermodynamics, and the 
other quantities are not. Our conserved quantities are also physically tied to the fluid material in such a way 
that they are carried along, or convected, with it. Thus by definition these convected quantities are carried along 
with our Lagrangian fluid parcels. The amount of such a quantity in a parcel can change only if some physic- 
al process acting inside the parcel or at its boundaries accounts for the change. Our conservation laws simply 
quantify this accounting. Here is a brief description of what they mean within the Lagrangian framework. 


3.4.1 Continuity of the Flow and Conservation of Mass 


By our very definition of a fluid parcel in the Lagrangian description, we have implicitly enforced conservation 
of mass within a parcel. However, the equation that explicitly enforces conservation of mass must do more than 


that. The continuity equation relates the fluid density at all points to the volume occupied, so as to satisfy two 
requirements: 


1. Mass is conserved within each Lagrangian parcel, as required by the definition of the parcel and 


2. There are no voids between Lagrangian parcels, nor do adjacent parcels overlap. The entire volume 
occupied by fluid must be considered to be filled with Lagrangian parcels that conserve mass. 


The physical interpretation of the continuity equation in the Lagrangian description is very simple: As the 
volume of a parcel of fluid changes, the fluid density must change so as to keep the mass of the parcel constant. 

Although the basis for the continuity equation is physical (requirements 1 and 2 above), the requirements 
it imposes on the flow are not as direct in a cause-and-effect sense as those imposed by other equations. For 
example, in the conservation of momentum (Section 3.4.2), forces directly cause accelerations; and 1n the con- 
servation of energy (Section 3.4.3), work done in compressing the fluid directly causes a rise in temperature. 
The continuity equation is different in this regard. It can be tempting to think that a change in streamtube area 
“causes” a change in velocity, as a result of continuity. But of course the more direct cause of a change in velo- 
city is unbalanced forces applied to fluid parcels. So compared with the momentum and energy equations, the 
continuity equation is less directly tied to the dynamics and is more like a kinematic constraint imposed on the 
flowfield. 


3.4.2 Forces on Fluid Parcels and Conservation of 
Momentum 


In the Lagrangian reference frame, conservation of momentum is imposed explicitly in the form of Newton's 
second law, F = ma. Our Lagrangian fluid parcel has fixed mass, and its acceleration 1s the result of the sum 
of the forces acting on it. As we saw in Section 3.2, this is a vector relation that in the general case requires a 
vector equation, or equivalently, three scalar equations, for its expression. 

There can be external body forces (gravitational and electromagnetic) acting on the parcel, but in aerody- 
namics these are usually negligible, and we are concerned only with the forces exerted on the surface of the 
parcel by adjacent parcels. These surface forces that adjacent parcels exert on each other must be equal-and-op- 
posite across the shared boundary, according to Newton's third law. They are the same apparent internal fluid 
“stresses” we discussed in Section 3.1 in our consideration of the issues of modeling the flow as a continuum. 
There we saw that it is valid to view them as distributed stresses only in the idealized continuum world, and 
that in the real world they are just apparent stresses that are the result of momentum transferred relative to the 
average flow by molecular motion. In any case, from now on we'll think of them as if they were actual stresses. 


In Section 3.2, we discussed representing these stresses as a tensor. This is convenient for mathematical ma- 
nipulation, but for purposes of physical understanding, thinking in terms of force vectors is more intuitive. If 
we contract the stress tensor with the unit vector normal to the imaginary boundary between parcels, we get a 
vector representing the force per unit area acting across the boundary. Further, we can resolve this vector into 
a component perpendicular to the boundary and a component parallel. In the NS equations, the perpendicular 
component is assumed to be the local hydrostatic pressure (static pressure, for short). The parallel component 
is called the shear stress and is due entirely to the effects of viscosity. 

The pressure is one of the most fundamental quantities in continuum fluid mechanics, but understanding it 
intuitively isn't trivial. At a single point in space, the normal stress on any imaginary boundary containing the 
point is the same regardless of the orientation of the boundary. Thus we have the idea that the pressure at a 
point, a scalar quantity, “acts equally in all directions,” not an easy concept to grasp. The difficulty of expressing 
the concept in easily understood terms has led some commentators to errors, such as Anderson and Eberhardt's 


(2001) description of the static pressure as “the pressure measured parallel to the flow.” This contradicts two as- 
pects of pressure as correctly understood: The definition of the pressure is independent of the flow, and pressure 
acts equally in all directions. It is a bit easier to grasp pressure intuitively in terms of its effect on a small but 
finite fluid parcel, which in a field of constant pressure is pushed inwardly by the surrounding fluid equally in 
all directions. Understanding the shear stress intuitively entails similar difficulties, which we'll defer to Section 
3.6, where we'll discuss in some detail how the shear stress arises and now it is represented in the NS equations. 

For the surface stresses to contribute any acceleration to the parcel, the vector sum of the stresses acting on 
all the parcel's faces must be nonzero, that is, there must be an unbalanced force left over. Stresses on opposite 
sides of a parcel by definition act in opposite directions, and if their magnitudes are the same, they cancel. The 
normal stresses in a field of constant pressure, for example, would cancel each other out, and there would be 
no unbalanced force. For there to be an unbalanced force, the magnitudes of the stresses on opposite sides of a 
parcel must be different, and for this to happen the pressure or the viscous stress must be nonuniform. Thus the 
unbalanced force depends not on the stress itself but on a gradient of the stress, which in the case of the pressure 
is simply Vp. This generally requires nonuniform motion of the fluid. We'll look at some examples of how this 
works in the case of viscous stresses in Section 3.6. In any case, because the forces depend on the motion of the 
parcel and the motions of the parcel's neighbors, the cause-and-effect relationship between the stresses and the 
velocities is circular, which complicates our task and is a topic we'll consider in more detail 1n Section 3.5. 

Because the momentum equation governs a parcel's acceleration, determining the parcel's velocity requires 
integration of the equation. We'll see in Section 3.8.4 how an integration of the momentum equation for the 
steady flow of an inviscid fluid leads to Bernoulli's equation, one of our most useful special-purpose flow rela- 
tions. 


3.4.3 Conservation of Energy 


The principle of conservation of energy is just the first law of thermodynamics, which states that the rate of 
change of the energy stored by the material in our Lagrangian fluid parcel 1s equal to the rate at which energy 1s 
added to it from outside, in the form of heat added and/or mechanical work done. Only two parts of this would 
be new to a student of elementary thermodynamics. One is that the motion of the parcel is an important part of 
the picture, and thus the bulk kinetic energy of the parcel must be included as one of the forms of stored energy 
to be accounted for. The other is that viscous forces, not just the pressure, provide an avenue by which mechan- 
ical work can add to the energy. 

Heat can be added to or subtracted from the parcel both by electromagnetic radiation absorbed or emitted 
within the parcel or by molecular conduction across the parcel's boundary. Note that radiation to or from the 
interior of a parcel is a volume-proportional or “body” effect, while conduction across the boundary of a par- 
cel is a “surface” effect, and that in aerodynamics it happens that only the surface effect is usually significant. 
The mechanical work done on the parcel is done by the same forces we considered in momentum conservation. 
Again, in aerodynamics the external body forces are usually negligible, and we are concerned only with forces 
exerted on the parcel by adjacent parcels. But the effects of these internal fluid stresses on energy conservation 
are more complicated than their effects on momentum. In momentum conservation, we had to consider only 
the net force on the parcel. In energy conservation, the net force is important as well: Acting over the distance 
moved by the center of mass of the parcel, the net force contributes to changes in the bulk kinetic energy of the 
parcel. But there is more. If the parcel deforms, either volumetrically or in shear, parts of the parcel's boundary 
move relative to the parcel's center of mass, and significant work can be done on the parcel that way as well. 
The pressure, acting through compression or expansion, heats, or cools the parcel, and the viscous stresses heat 
the parcel, a process called viscous dissipation, which we'll consider further in Sections 3.6 and 6.1.1. 


Turbulence raises interesting issues with regard to conservation of energy. We often think of turbulent flows 
and model them theoretically in terms of time averages, in which the unsteady turbulence motions have been 
averaged out, an approach we'll discuss in detail in Section 3.7. In the time-averaged flowfield, the kinetic en- 
ergy of the turbulence is a form of energy that must, in principle, be accounted for. However, in many flow 
situations the production and dissipation of turbulence kinetic energy (TKE) are roughly in local equilibrium, 
and TKE can be neglected. We'll discuss this further in Section 3.7. 


3.4.4 Constitutive Relations and Boundary Conditions 


We've just taken a brief look at what the three basic conservation laws mean in the Lagrangian reference frame. 
Whether we implement these laws in the Lagrangian frame or the Eulerian, they provided us with five equations, 
and we have eight unknowns. Our unknowns are three space coordinates (Lagrangian) or velocity components 
(Eulerian) and five local material and thermodynamic properties: pressure, density, temperature, and the coeffi- 
cients of molecular viscosity and thermal conductivity. We therefore need three additional constitutive relations 
to complete the system. For aerodynamics applications, these relations are usually taken to be the ideal-gas 
equation of state relating the pressure, density, and temperature; the Sutherland law defining the viscosity as a 
function of temperature only; and Prandtl's relation for the thermal conductivity. 

The complete NS system provides all the internal-to-the-fluid physics we need. At the boundaries of our 
flow domain, the BCs we need to apply depend on the type of boundary. At flow boundaries, we need to invoke 
no additional physics, and the NS equations themselves determine what BCs are permissible or required, de- 
pending on the flow situation. At any boundary that is an interface with another material (often referred to as a 
“wall’), additional physical considerations are needed to define the BC. We saw in Section 3.1 that under most 
conditions where the continuum equations apply, the no-slip and no-temperature-jump BCs are appropriate. 


3.4.5 Mathematical Nature of the Equations 


As we've just seen, our system of equations consists of five field PDEs and three algebraic constitutive relations, 
with eight unknowns in all. The equations are of mixed hyperbolic/ elliptic type in space, so that the solution 
depends on conditions on the entire boundary of the domain. Numerical solutions can be “marched” forward in 
time, but not in space. The equations are nonlinear, so that solutions cannot generally be obtained by superpos- 
ition of other solutions. Even a steady-flow solution cannot be obtained by a single matrix-inversion operation, 
but must be approached by time-marching or some process of iteration. These are issues we'll discuss in greater 
detail in connection with CFD methods in Chapter 10. 

Solutions to the NS equations are sometimes nonunique, for example, when more than one steady-flow solu- 
tion exists for the same body geometry, as we'll see in the case of some airfoils at high angles of attack in Sec- 
tion 7.4.3. Generally, solutions without turbulence exist mathematically, but in most situations at high Reynolds 
numbers they are dynamically unstable and are not to be found in nature. The instabilities and other mechan- 
isms that can lead to the appearance of turbulence in solutions to the equations are discussed in Section 4.4. 

Because of the above general difficulties, analytic solutions to the NS equations are known for only a few 
simple cases with reduced dimensions and constant fluid properties, and even then only in limiting situations 
in which the inertia terms can be neglected. For example, there are effectively 1D solutions for steady, fully 
developed flow in planar 2D or circular-cross-section ducts or pipes and 2D solutions for flow around a cir- 
cular cylinder or sphere in the limit of low Reynolds number. For some idealized situations at high Reynolds 
numbers, boundary-layer theory provides approximate solutions to the 2D NS equations that require only the 


solution of an ordinary differential equation (ODE) in 1D, as we'll see in Section 4.1. For more general flows, 
numerical solutions are our only option, unless we can make simplifying assumptions. 


3.4.6 The Physics as Viewed in the Eulerian Frame 


In the Eulerian description, we track what happens as fluid flows past points in a given spatial reference frame. 
So now, instead of tracking what happens to fixed parcels of fluid, as we did in the Lagrangian description, 
we track what happens in infinitesimal elements of volume imbedded in our spatial coordinate system. These 
Eulerian volume elements have fluid continuously streaming through them and across their bounding surfaces. 
This is, of course, the same streaming motion that was part of the flow when we described it in the Lagrangian 
frame. We are just seeing it now in a different reference frame, and the difference in vantage point requires us 
to treat the convection process differently when we implement our conservation laws. In the Lagrangian for- 
mulation, convection is accounted for implicitly by our definition of a fixed fluid parcel, and our conservation 
equations have no terms representing convection across the boundaries of a fluid parcel, because there is none 
by definition. In the Eulerian formulation, where there is generally a flux of fluid across the boundaries of our 
volume elements, the convection process must appear explicitly in the form of additional terms 1n the equations. 

Mathematically, the additional terms arise when we replace the time derivatives in the Lagrangian equations 
with their Eulerian equivalents, using Equation 3.2.1. In the Eulerian equations that result, convection effects 


are represented by terms that arise from the V > ¥V term on the right-hand side. To see how this works, con- 
sider, for example, the x component of the momentum of a Lagrangian parcel of volume dV, which is given by 
op udV. Applying Equation 3.2.1 to this quantity gives 


dy : © (oud) +VeV(pudVv). 
3.4.1 LT ot 

The second term on the right-hand side represents the convection of momentum in the Eulerian x-momentum 
equation in its rawest form. There is another form often seen in the literature, in which the density is taken 
outside the derivative, and the relationship to the Lagrangian acceleration Du/Dt is clearer. To derive this oth- 
er form, we must invoke conservation of mass, which in its Lagrangian form simply states that the mass of a 
Lagrangian parcel doesn't change with time. Applying Equation 3.2.1 to that gives 


dD Dp D 
—(adV) = — dv ) p>, (dV ). 

3.4.2 Dt Dt Dt 

Using the product rule to expand the derivatives on the right-hand side of Equation 3.4.1 and invoking Equa- 
tion 3.4.2, we get the Lagrangian rate of change of x momentum, per unit volume: 

1 D Ou 
udV) = p— + pVe Vu. 

3.4.3 dV Dt Uy ae 

The last term on the right-hand side of Equation 3.4.3 is just the product of density and the convective ac- 
celeration that we introduced in Section 3.2 and that we looked at for the special case of 1D flow in Equation 
Duta. 

Now let's look at the convection process in more detail. One thing that almost goes without saying is that 
across a shared boundary between two parcels, convection is reciprocal. It is a kind of analog to Newton's third 
law in mechanics, which states that the mutual forces exerted by two bodies in contact with each other must be 
equal and opposite, because there is nothing at the interface that can support an unbalanced force. At a shared 
boundary between two fluid parcels, there 1s nothing that can add to or subtract from the flux of a conserved 





quantity, so the flux leaving one parcel must be equal to the flux entering the other. In our general NS formu- 
lation, we end up not having to enforce this reciprocity explicitly, because it is guaranteed by the continuity of 
all of our flow variables. There are specialized theories in which we allow surfaces of discontinuity, such as 
when we model shocks in solutions to the inviscid versions of the equations. In these cases, we must introduce 
additional equations to explicitly enforce the applicable conservation relationships across the discontinuity. 

The physical interpretation of the convection terms in our conservation equations is straightforward. When 
the rate of convection into a volume element is not balanced by the convection out, convection becomes a source 
of the conserved quantity and must be taken into account in the conservation law. The convection terms thus 
represent the net rate at which a conserved quantity is being convected into or out of a volume element. In the 
conservation of mass, this net convection is the only contribution to the time rate of change of the total mass 
inside the element. When the flow is steady, this means that the flux of mass into the element must equal the 
flux out. The same thing applies to larger volumes than just a local parcel. For a steady-flow streamtube, as we 
defined it in the previous section, this means that the mass flux through any surface that cuts across the tube 
must be the same. Net convection is an important part of the balance in the conservation of momentum and 
energy too, but we must also account for the contributions from forces applied to the fluid (momentum and en- 
ergy) and from heat conduction (energy only). 

Contributions to the momentum and energy balances from external sources such as gravitational or electro- 
magnetic forces exerted on the fluid, or heat transmitted to and from the fluid by absorption and emission of 
radiation, can be accounted for in a straightforward manner in our formulation. Likewise for exchanges of force 
or energy between parcels within the fluid that are not in direct contact with each other, which would be internal 
to the flow but nonlocal effects. In aerodynamics, however, such external effects and internal nonlocal effects 
are usually negligible, so that the only effects that remain to be represented in our equations are those that are 
transmitted by direct parcel-to-parcel contact. This leaves us with just the interparcel forces represented by the 
apparent internal stresses, and the heat fluxes due to conduction, that are exchanged between adjacent parcels 
of fluid. These quantities, as we have already seen, are not physically tied to the fluid material and are not con- 
vected with it. They are not affected by changes in the velocity of our reference frame, and they look the same 
in the Eulerian frame as they did in the Lagrangian. 

So in the usual situation that prevails in aerodynamics, the only significant transmission of forces within the 
fluid is between adjacent parcels of fluid. Convection effects in an Eulerian frame are similar in the sense that 
they too act only between adjacent Eulerian parcels. Thus there is no mechanism in our usual aerodynamic flows 
for any kind of exchange of “force at a distance,” and therefore no remote “induction” or other such effects. Al- 
though the Biot-Savart law seems to imply a kind of remote induction effect, as we saw in Section 3.3.9, it is a 
fallacy to think of the velocity at one point as being “induced” or “caused” by the vorticity at another point. This 
is just one example of the difficulties associated with assigning cause and effect in fluid mechanics. In spite of 
the difficulties, we will try to make some sense of cause and effect in Section 3.5. 


3.4.7 The Pseudo-Lagrangian Viewpoint 


So far in this section, we've discussed the equations of motion and the physical conservation relationships they 
represent, as viewed in both the Lagrangian and Eulerian reference frames. The two reference frames are gener- 
ally kept distinct, and theoretical models or quantitative calculations generally use either one frame or the other, 
usually the Eulerian. However, in qualitative discussions the distinction is not always so clear cut. This is es- 
pecially so in discussions of steady flows, for which time should enter into the description of the flow only in 
the Lagrangian frame, not the Eulerian. In spite of this, in many discussions in the literature and some in this 
book, you will notice that time-like terminology is used, even though the reference frame used is not explicitly 


Lagrangian in the sense of following individual Lagrangian fluid parcels. For example, a statement such as “As 
the flow approaches the leading edge of the airfoil, it is deflected upward” is distinctly time-like in the image 
it evokes, but it is not strictly Lagrangian, because it refers to the progress of some imprecisely defined mac- 
roscopic body of fluid instead of a small fluid parcel. In a sense, this is sloppy terminology, but the meaning is 
usually clear enough that no harm is done, and more rigorous alternatives are often awkward. 


3.5 Cause and Effect, and the Problem of 
Prediction 


A basic characteristic of mathematical theories, both in fluid mechanics and other branches of physics, is that 
governing equations by themselves are not predictive. Only solutions to the equations can make predictions 
with any “reach” in space and time. Just knowing this, however, doesn't tell us how difficult it will be to predict 
things in fluid mechanics, either computationally or mentally, or how difficult it will be to understand the flow 
phenomena that can be predicted. To try to gain insight into these complicated issues, I'll start by considering 
the nature of cause and effect, which will bring us naturally back to the problem of prediction. 

To understand a fluid flow, or to explain it, we would like to establish cause-and-effect relationships between 
what happens at different points in space, between the different flow quantities (for example, velocity and pres- 
sure), and between the flowfield and the forces and heat fluxes that it exchanges with its environment. One of 
the main reasons fluid mechanics is such a difficult discipline is that these cause-and-effect relationships are 
complicated, which can make it difficult to explain things in a satisfying way, as we'll see below. And in Section 
3.3.9, we saw an example of another kind of difficulty, where, in connection with the Biot-Savart law and the 
idea of induced velocity, it was easy to assign cause and effect incorrectly. 

Let's start our discussion of this issue by looking at what determines the motion of an individual parcel of 
fluid. According to Newton's second law, the acceleration that a Lagrangian fluid parcel experiences as 1t moves 
through the field is proportional to the net force exerted on the parcel. As we noted in Section 3.4.7, in aerody- 
namic flows there are usually no significant “body forces” acting over long distances, and the only forces we 
need to consider are those exerted by parcel-to-parcel contact, that is, by the pressure and the viscous stresses. 
So the motion of our Lagrangian parcel is influenced directly only by the resultant of the forces exerted by all 
the adjacent parcels. And the motions of the adjacent parcels are influenced in turn by the forces exerted by the 
parcels adjacent to them, and so on throughout flowfield. If all of these forces were known a priori, predicting 
all of the motions would be trivial, but of course we can't know the interparcel forces a priori because they de- 
pend on the motions of the parcels. And the motions depend on the forces, and around and around it goes. We're 
dealing with the motions of many parcels, all interacting with their neighbors, and as a result, we effectively 
have circular cause-and-effect between the forces and the motions. Regarding Newton's second law, we tend to 
think of the force as the “input” and the motion as the “output,” but that way of thinking is not consistent with 
the situation we face 1n continuum fluid mechanics, where circular cause and effect is a fact of life. 

Because the cause-and-effect relationship between interparcel forces and parcel motions is circular, the rela- 
tionship between the global flowfield and the integrated forces it exchanges with its environment is circular as 
well. We might be tempted to ask whether the lift on a wing 1s there as a result of the flowfield, or vice versa, 
that the flowfield is there as a result of the force, but the answer is “both.” The interaction between the integ- 
rated force and the flowfield is mutual, and the cause-and-effect relation is necessarily circular. 


All of this circularity of cause and effect is reflected in the governing equations, which define implicit rela- 
tionships between the independent variables, not explicit one-way causation relationships. Tracking even these 
implicit relationships intuitively is made more difficult by the fact that they are mediated simultaneously by 
more than one physical principle, which is reflected in the fact that there is more than one equation to deal 
with. The momentum equations relate the velocity components to the interparcel forces, but are not sufficient 
by themselves to determine either of them. To have a fully determined system, we must also include the energy 
and continuity equations and the auxiliary relations. To predict what happens in a flowfield, we must solve the 
whole system of equations. 

We've just seen that to predict the interactions between the interparcel forces and parcel motions, we can't 
just look at the equations themselves, we must solve the equations. The same thing applies when we consider 
the interactions between what happens at one point in space and what happens at other points throughout the 
flowfield. Parcel motions are the direct result of interparcel forces, but they are at the same time constrained by 
the principle of continuity, that 1s, the requirement that Lagrangian fluid parcels do not overlap each other or 
open gaps between them. Lagrangian parcels must therefore move in concert, not with their positions relative 
to each other rigidly fixed, but varying in a coordinated way as the fluid deforms. Thus we can't determine the 
individual motions separately, but must determine them collectively by solving the equations. 

Now we've looked at two major aspects of cause and effect: relationships between forces and motions, and 
relationships between what happens at different points in space. Both have reinforced what we already knew, 
that is, that the governing equations by themselves don't predict what will happen in a flowfield and that pre- 
diction requires solving the equations. What has also become clear is that there is a wide gap between our basic 
physical laws as embodied in the equations of motion, and the flow phenomena that the laws govern. The phys- 
ical balances represented by the equations are relatively simple to understand, and the direct interactions repres- 
ented are only local. A flowfield, on the other hand, is a global, collective phenomenon that can be exceedingly 
complex. The problem we face 1s that of bridging two disparate regimes: equations governing local interactions, 
on the one hand, and flowfields embodying complicated collective behavior on the other, with a huge gap in 
potential complexity in between. 

This gulf between local physics and global behavior is at the root of a fundamental difficulty of prediction 
in fluid mechanics. To predict from first principles what will happen in a flowfield, we must determine a flow 
pattern that satisfies the equations everywhere simultaneously, that is, we must solve the equations. The wide 
gulf we must bridge to do this is reflected in the difficulty of the equations we must solve: a set of PDEs 1n mul- 
tiple spatial dimensions and time, with multiple dependent variables (flow quantities) to be determined. Even in 
situations where a qualitative description of the flow would suffice, we humans are not well equipped mentally 
to do the required “solving” in our heads, and “mental” predictions based solely on the basic physics are usually 
not reliable. In all but the simplest flow situations, there are too many possible flow patterns, and determin- 
ing which one will prevail requires a degree of quantitative precision not attainable by mental means. In most 
situations that are complex enough to be of practical interest, a “first principles” prediction requires a detailed 
numerical solution to the equations. 

In principle, as I argued in Section 3.1, a numerical solution of the full NS equations should be able to predict 
any flow of interest in aerodynamics. Strictly speaking, this sweeping claim can be true only if we assume our 
idealized solution process can find multiple solutions in cases where they exist, as, for example, in the lift-curve 
hysteresis of some airfoils and wings, which we'll discuss in Section 7.4.3. In practice, the possible existence of 
multiple solutions is only one of our worries, however. In practical calculations, we cannot resolve the effects of 
turbulence directly and must model them, and our inability to model them accurately compromises the accuracy 
of our predictions to varying degrees depending on the situation, an issue we'll consider further in Section 3.7 
and encounter several more times in Chapters 7—10. 


Even with the shortcut of turbulence modeling, solutions to the NS equations are not always readily done, 
and even when a solution is available, the information it provides has limits. A solution can tell us what, for 
example, by determining a flow-separation pattern, or how much, as in determining lift or drag, but it provides 
very little information as to why. Thus we will still often seek qualitative predictions and explanations of flow 
phenomena. For these purposes, we must generally rely to some extent on experience, that is, on our knowledge 
of the phenomenology of general flow patterns in a variety of situations. Where starting from scratch with noth- 
ing but first principles to rely on usually gets us nowhere unless we resort to computation, simply predicting the 
likely flow pattern based on phenomenological knowledge of flows in similar situations at least gives us a start- 
ing point from which to build a more detailed explanation. This is a mental approach that comes so naturally that 
we tend to take it for granted and to forget that this is what we are actually doing. Of course, any explanation we 
construct from there must be true to the physics. It 1s especially important not to force one-way causation where 
it doesn't fit. Explanations should correctly acknowledge circular cause-and-effect relationships and avoid the 
temptation to oversimplify. 

A qualitative “analysis” constructed along these lines would start with a conjecture as to the basic flow pat- 
tern, drawn from our phenomenological experience base, followed by a sketch of the velocity field (qualitative 
velocity magnitudes and directions). If large regions of the flow are expected satisfy the conditions for a steady 
Bernoulli equation to hold (see Section 3.8.4), it should then be possible to sketch out a qualitative pressure 
field based on the velocity magnitudes. Then the qualitative flow curvatures can be assessed for consistency 
with the pressure gradients. Some mental “iteration” may be required to find velocity and pressure patterns that 
are consistent with the physics. Of course, the quantitative resolution of this kind of “analysis” 1s very low and 
not generally sufficient to distinguish whether the conjectured flow pattern was correct or not. The approach is 
really suited only to generating explanations of things we already know actually happen. We'll explore some 
arguments of this type in Chapters 7 and 8, as applied to airfoils, wings, and the generation of lift. 

In this section, we've identified two major characteristics of fluid mechanics that make prediction difficult: 
the circular nature of cause and effect and the great gulf between local physical laws and global flow behavior. 
Our task in much of what follows will be to make some headway in the face of the daunting challenge that these 
difficulties pose. 


3.6 The Effects of Viscosity 


Air and water are “simple fluids” in the sense that they cannot resist deformation in the same way a solid can. 
When a solid body is subjected to a steady force that tries to deform it in a way that does not change the volume, 
the solid can resist by assuming a steady deformation, usually small. A simple fluid cannot put up such steady- 
state resistance. Another way of saying this is that the resistance of a simple fluid to such deformation vanishes 
as the rate of deformation vanishes. The upshot is that when a fluid is at rest, all of the off-diagonal terms in 
the apparent-stress tensor, that is the tangential or shear stresses, must be zero. At rest, only the normal stresses 
represented by the terms on the diagonal can be nonzero, and the three of them must be equal. 


The viscous behavior of a fluid is defined by how the apparent internal stresses respond to a deformation 
that changes with time. In general, a nonzero rate of deformation will result in nonzero shear stresses, and the 
normal stresses will no longer be equal. For gases, the description of this response that we use in the NS equa- 
tions can be arrived at through statistical analysis of the motion of the molecules (kinetic theory of gases) and 
an assumption of small deviations from thermodynamic equilibrium. The NS formulation can also be arrived at 
through a strictly continuum approach. We'll not go into the mathematical details of this continuum derivation 
here, but the assumptions and their consequences are worth enumerating. The assumptions are: 


1. The deviatoric stress tensor (the stress tensor with the pressure subtracted out) is a linear function of 
the rate-of-deformation tensor. At this level of generality, the factor of proportionality (the “viscosity’’) is 
a fourth-order tensor. The direct response of the pressure to the rate of deformation (the “bulk viscosity” 
effect) is left as a separate issue and 1s usually assumed negligible. 


2. The fluid behaves isotropically, meaning that the response of the stress in a parcel to a given deforma- 
tion field is independent of the orientation of the parcel. 


The consequences are: 


1. Because of the exclusion of the pressure from consideration and the assumption of isotropic behavior, 
the viscosity of the fluid can be represented by a single material property u. 


2. A simple shearing deformation Ou/Cy results in a resisting shear stress wOu/Oy. We'll look in greater 
detail at what this means a little later. 


3. In the special case of a uniform fluid (constant density and viscosity), the net viscous force on a fluid 
parcel due to the shear stresses on all of its faces is proportional to the curl of the vorticity. Thus the vis- 
cous shear stresses can affect the motion only if the vorticity 1s nonuniform (and therefore not everywhere 
zero). Even in more general flows in which density and viscosity vary, there must usually be significant 
vorticity for viscosity to exert a net force on a fluid parcel. 


4. A corollary of item 3, again for uniform fluids, is that irrotational flows (flows with zero vorticity) that 
satisfy the inviscid equations also satisfy the NS equations. This is not because the viscous stresses are 
zero, but because the stresses are distributed so as not to exert unbalanced forces on fluid parcels. 


The derivation of these consequences is a nontrivial exercise in tensor analysis, of which a clear account is 
given in Chapters | and 3 of Batchelor (1967). 

The range of applicability of the resulting NS formulation is wider than we might expect. The linear rela- 
tionship between the stresses and the rate of deformation, which we would expect to be a good approximation 
for small rates, seems to hold quite accurately for rates as large as we commonly encounter in applications. The 
dependence of the stresses on the instantaneous gradient of the velocity requires slow change on the scale of the 
mean-free time, but this restriction is practically never an issue because the mean free time is extremely short. 
And with the exception of high-frequency acoustic waves and the internal structure of shock waves, the bulk- 
viscosity effects are negligible, and only the shear stresses are significant. We can usually ignore any deviation 
of the pressure from what is given by the equilibrium equation of state. 

Recall from Section 3.1 that the continuum viscous stresses are really just apparent stresses representing 
transport of molecular momentum by random molecular motion relative the mean flow. In the simple linear re- 
lationship described in item 2 above, this transport of molecular momentum is represented as diffusion that is 
proportional to a gradient. Likewise, the transport of heat by conduction is represented by a simple linear diffu- 
sion relationship. Both of these relationships work in the direction such that they try to reduce the nonuniformity 
that drives the transport, which 1s the direction we would expect based on the second law of thermodynamics. 


First, consider how this works for the viscous stresses. The viscous shear stresses act like ordinary friction 
in that the work done against them is dissipated irreversibly into heat. We can see this by considering a small 
parcel of fluid undergoing a shear deformation. 

In a reference frame moving with the center of mass of the parcel, we would see a local distribution of relat- 
ive velocity as sketched in Figure 3.6.1, which would result in the stresses shown, as imposed on the parcel by 
its surroundings. The forces on the parcel faces are in the same direction as the relative velocities, so that the 
work done on the parcel by the surroundings is positive. Because the parcel has no average translational kinetic 
energy in this frame, and rotational kinetic energy must be negligible if the parcel is small enough, the work 


done can add only to the internal energy of the parcel. In other words, the work is dissipated irreversibly into 
heat. 


Figure 3.6.1 Typical velocities relative to the center of mass of a parcel (short arrows), and resulting viscous 
stresses exerted by the surrounding fluid on the top and bottom of the parcel (long arrows). The stresses and 
the relative velocities are in the same direction, so that the work done on the parcel by its surroundings is pos- 
itive and is dissipated into heat 








The linear relationship for the diffusion of heat by conduction guarantees that heat is always conducted in the 
direction from higher temperature to lower temperature. Thus our linear relationships for viscous shear stress 
and thermal conduction are both consistent with the second law of thermodynamics in that they never contribute 
to decreases in the total entropy of the system. 

Now let's take a very elementary look at how the shear stresses typically affect fluid motion. In the simple 
steady-flow situations illustrated in Figure 3.6.2, the effects of viscosity are sufficiently isolated from other 
complicating effects that they can be easily understood. For simplicity, we'll assume the viscosity u is constant. 
Figure 3.6.2a,b show 2D flows confined between parallel walls with velocity distributions as shown, assumed to 
be unchanging in the x direction, so-called Couette flows. Figure 3.6.2c shows an “external” flow, bounded on 
only one side by a wall. In all cases, a no-slip condition is enforced at walls. The small square in each diagram 
represents a typical fluid parcel, and the arrows illustrate the balance of horizontal forces: pressure forces on 
vertical faces, and shear forces on horizontal faces. Note that the forces on opposing faces are in opposite dir- 
ections and mostly offset each other. A net force on the parcel requires a change in the magnitude of the force 
between one opposing face and the other. For a sufficiently small parcel, net forces are proportional to gradients 
of the stresses, that is, the pressure gradient Op/Ox and the shear-stress gradient Ot/Oy, where tT = wOu/dy is the 
shear stress. 


Figure 3.6.2 Idealized flow situations illustrating how viscous stresses affect fluid motion. (a) Couette flow 
with one moving wall and no pressure gradient. The shear stresses balance and the pressures balance. (b) Cou- 
ette flow with stationary walls and a pressure gradient. The difference in shear stresses 1s balanced by the dif- 
ference in pressures. (c) External flow along one wall (boundary-layer flow). The unbalanced shear stresses 
result in slowing of the fluid parcel and thickening of the boundary layer 





In Figure 3.6.2a, the lower wall is stationary, and the upper wall moves to the right at a constant velocity. The 
motion is a simple shear with a constant velocity gradient Ou/Oy from bottom to top. The shear stress tT = pOu/ 
Oy 1s also constant, so that the typical parcel shown has equal and opposite stresses on its top and bottom faces 
that contribute no net force in the x direction. If the pressure is uniform in the x direction, it contributes no net 
force either, and the parcel therefore has no acceleration, which is consistent with our assumption that velocity 
is constant in x. 

In Figure 3.6.2b, both walls are stationary, but we assume there is flow from left to right. The simplest ve- 
locity profile we can assume that satisfies the no-slip condition 1s a parabola with the maximum at midchannel. 
Now the stresses imposed on the top and bottom faces of our typical parcel are different in magnitude, and 
the parcel experiences an unbalanced force proportional to the difference, which is in turn proportional to the 
second derivative of the velocity. For the parabolic profile we've assumed, Ot/Oy is thus constant and negative 
across the channel. A constant negative pressure gradient Op/Ox in the x direction can balance these unbalanced 
shear forces and allow an unaccelerated flow consistent with our assumption that velocity 1s constant in x. This 
flow requires pressure decreasing in the x direction to sustain it and is thus an idealized prototype for flows 
driven by pressure drops in long ducts or pipes. 

Figure 3.6.2c is an idealization of external flow past the surface of a solid body. Here we have only one wall 
with flow streaming past it, and the flow is assumed to have uniform velocity except close to the wall. We as- 
sume the pressure is constant in x, but that the velocity close to the wall can vary. Qualitatively the velocity 
profile will be as shown, typical of flow in “wall boundary layers,” which are quite important in aerodynamics 
and which we'll take up in much greater detail in Chapters 4 and 5. 0t/Oy is negative, as it was in Figure 3.6.2c, 
though it is not constant in this case because the velocity profile is not a simple parabola. As in Figure 3.6.2b, 
the unequal shear stresses on the top and bottom of a typical parcel exert an unbalanced force on the parcel, but 
in this case there is no pressure gradient to balance it, and the parcel slows down as it moves from left to right. 
As this happens to all parcels in the layer as they move downstream, the velocity profile is “stretched” away 
from the wall, and the boundary layer becomes thicker. 


The view we have just taken of these simple viscous-flow examples is based on conservation of momentum 
and on thinking of the viscous stresses as real continuum stresses. It 1s important to remember, however, that 
the continuum stresses are only apparent and that they really reflect transport of momentum by the motion of 
molecules relative to the average (continuum) motion of the fluid. So the view we have just taken is really the 
momentum transport view. 


It is also instructive to view viscous effects in terms of the transport of vorticity. If we take the curl of the mo- 
mentum equation, we get the vorticity equation governing the creation, convection, stretching, and diffusion of 
vorticity. The vorticity equation for constant-density flow can be found in many books (Batchelor, 1967; White, 
1991, for example). Other specialized forms are discussed in the meteorology literature. The general form is 
harder to find, but at the time of this writing, Wikipedia had a good description under “Vorticity equation.” 

An examination of the vorticity equation for flows with variable properties shows that there are only two 
ways that vorticity can be created or destroyed in the interior of a flow: by the pressure gradient acting in the 
presence of nonuniform density or by an external body force that is nonconservative. As a result, under most 
conditions of importance in aerodynamics, the only creation of vorticity that can take place in the interior of a 
flow is through shocks of nonuniform strength. However, unless a shock is quite strong, the vorticity it gener- 
ates is very weak. Thus for purposes of this discussion, we can assume that only boundaries can serve as sources 
or sinks of vorticity and that transport in the interior involves only convection, stretching, and diffusion. In the 
special case of zero viscosity, there is no diffusion, only convection and stretching, and if there is any vorticity, 
it remains attached to the same fluid parcels and 1s carried with them as they move through the field. In Section 
3.8, we'll discuss two of Helmholtz's theorems that deal with this situation. 

A simple boundary-layer flow like that in Figure 3.6.2c raises interesting vorticity- transport issues. In 2D 
flow, the vorticity cannot be changed through vortex stretching, and in constant-property flow, there can be no 
production of vorticity within the volume, only convection and diffusion. We'll look at the resulting vorticity 
“budget” of a 2D constant-property boundary layer in some detail in Section 4.2.4. There we'll see that the vor- 
ticity gradient can diffuse vorticity either away from the wall or toward the wall, and that the wall can thus act 
as either a source or sink for vorticity. The vorticity that 1s convected along within the boundary layer can be 
thought of as having been “created” at the wall upstream, through the action of the no-slip condition and the 
shear stress at the wall, at locations where the wall was acting as a source. Farther from the wall, the vorticity 
gradient weakens, the rate of diffusion decreases, and eventually the flow becomes effectively irrotational far 
from the wall. 


This is a general pattern that characterizes flows past bodies when the viscosity is small. Near the solid sur- 
face, there is formed a viscous, vortical boundary layer outside of which the flow remains effectively irrota- 
tional, as if 1t were inviscid. Note that no matter how small the viscosity is, a viscous boundary layer will form 
along the surface as long as the no-slip condition prevails, and there will always be at least this thin layer where 
the effects of viscosity are important. We'll consider this general flow structure further, including more details 
of the behavior of boundary layers, in Chapters 4 and 5. Also note that “small viscosity” is usually referred to 
as “high Reynolds number,” where the inverse of the Reynolds number (1/R) is a dimensionless parameter that 
multiplies the viscous terms in the nondimensional form of the momentum equations. We discuss the Reynolds 
number and what it means in some detail in Section 3.9.2. 


3.7 Turbulence, Reynolds Averaging, and 
Turbulence Modeling 


At the high Reynolds numbers typical of engineering applications, boundary layers usually transition from lam- 
inar to turbulent before reaching the back of the body, and the viscous wakes behind bodies are always turbulent. 
Turbulence is a complicated beast that is highly random but at the same time displays a surprising degree of 
organized structure over ranges of length and time scales that depend on the situation. In Chapters 4 and 5, we'll 
look at some of the details of turbulent structures and their consequences in boundary layers and wakes. In this 


section, our focus will be on the effects that turbulence has on the flow that are important 1n applications and 
how these effects can be practically handled in the framework of the NS equations. For computational purposes, 
our main concern will be to avoid having to solve for the details of the turbulent motions, which is generally 
impractical. This will lead us to the problem of turbulence modeling. For the kinds of effects we generally wish 
to model, we'll find that we're more interested in the local statistical properties of the turbulence than in the or- 
ganized structure, and in fact, that our increasing knowledge of organized structures has not contributed much 
to our quantitative prediction capabilities. 

Turbulence in boundary layers and wakes is of course unsteady, but in most applications the unsteadiness is 
not of direct engineering interest because the length scales associated with the turbulent motions are typically 
small compared with the dimensions of the body. As a result, the fluctuations in the integrated forces on the 
body are also very small, and in most cases we will be interested only in the time-averaged properties of the 
turbulent flowfield. (Buffeting due to large-scale unsteadiness, usually associated with flow separation, airplane 
cabin noise coming from the surface-pressure fluctuations of the external boundary layer, and “airframe noise,” 
mostly from deployed landing gear and flaps, are some exceptions.) 

Defining the time-averaged properties of a flow by averaging out the turbulent motions is very similar to de- 
fining continuum properties by averaging out molecular motions. In both types of averaging, we can still treat 
globally unsteady flows, provided there is sufficient separation between the global time scales we wish to re- 
solve and the random-motion time scales we wish to average out. Because turbulent time scales are typically 
many orders of magnitude longer than molecular-collision time scales, the requirement to keep the time scales 
separate is potentially more limiting for turbulence averaging than for molecular. In most applications, however, 
we'll be looking at time-averaged flows that are nominally steady, so that this is not a problem. In any case, 
the result of the averaging process is an imaginary continuum flow, or mean flow, in which we can identify the 
same kinematic features as in a real flow, such as streamlines, streamtubes, and timelines, and all of the same 
kinematic rules apply. And Lagrangian and Eulerian fluid parcels have the same meaning in the mean flow as 
they would in any other continuum flow. 

The analogy between averaging turbulent motions and averaging molecular motions is a close one, not just 
in terms of the mathematics, but also in terms of the effects of the physical processes that are averaged out. In 
Section 3.1, we saw how molecular motions relative to the average (continuum) motion of the fluid transport 
momentum and heat and that averaging over these motions defines the static pressure as well as the apparent 
continuum stresses due to viscosity and the heat flux due to conduction. Turbulent motions relative to the mean 
flow also transport momentum and heat and can do so even more effectively than molecular transport does. 
Time averaging the turbulent motions gives rise to apparent steady stresses that we call Reynolds stresses and to 
an average heat flux due to turbulent transport. These apparent stresses and heat fluxes are directly analogous to 
the viscous stresses and molecular heat conduction that we studied in Section 3.1. It is important to remember 
that both the turbulent and molecular versions of these stresses and fluxes are apparent and that in both cases 
they are actually due to transport by random unsteady processes that have been averaged out. 

Reynolds stresses affect the motion in the same ways as the apparent viscous stresses do, that is, in the ways 
we discussed in Section 3.6: 

1. Gradients in the stresses produce unbalanced forces on mean-flow fluid parcels, contributing to the 
parcels' accelerations. 

2. The unbalanced forces on fluid parcels result in the diffusion of mean-flow vorticity, usually in the dir- 
ection away from the body surface into the field. 


3. The apparent turbulent stresses result in dissipation into heat just as molecular viscous stresses do. This 
dissipation is not as direct as that due to viscous stresses, but the end result is much the same. 


It is helpful to think of the turbulent dissipation process as consisting of two separate processes taking place 
simultaneously: production and dissipation of TKE. First, the work done by the mean deformation against the 
turbulent stresses feeds directly into the kinetic energy of the turbulent motions. This turbulence production is 
nearly always positive, like molecular viscous dissipation. Second, the turbulent motions themselves contain 
local, unsteady velocity gradients that produce molecular viscous stresses that dissipate the TKE directly into 
heat (turbulence dissipation). It happens that practically everywhere in most flows, production and dissipation 
are roughly in local equilibrium, and the end result is as if the work done against the turbulent stresses were dis- 
sipated directly into heat. The fact that the energy dissipated by turbulent stresses goes through the intermediate 
stage of the kinetic energy of turbulence is therefore irrelevant for most practical purposes. For example, for 
purposes of computing the flowfield, it has been shown that TKE does not contribute significantly to the energy 
balance that determines the temperature and density distributions and so can be ignored in CFD calculations 
(see Cebeci and Bradshaw, 1984). 


Given that the basic physical effects of turbulence and molecular viscosity are so similar, in all of the pre- 
ceding discussion of viscous effects, and in the discussion in the remainder of the book, terms such as “shear 
stress” and “dissipation” can be taken to refer to either the molecular or turbulent variety. 

Of course, there are also important differences between turbulent and laminar flows. Outside the sublayer 
in a turbulent boundary layer, or in a wake, the apparent turbulent stresses are much larger than the viscous 
stresses, and the direct contribution of turbulence to viscous drag is therefore typically large. Thus keeping the 
flow laminar over at least part of the body surface can be a powerful means of drag reduction (discussed further 
in Section 6.3.1). In most practical applications, however, turbulent flow over at least part of the surface is ac- 
tually beneficial because of its greater resistance to flow separation, as we'll see in Section 4.1.4. 


These are the qualitative physical effects of turbulence on the mean flowfield. How are these effects repres- 
ented in the time-averaged equations? Think back for a moment to how this worked in the case of molecular 
transport. In Section 3.6, we saw that the continuum viscous stresses in the NS equations are represented in 
terms of simple velocity-derivative expressions such as wou/dy. This simple linear dependence of the stress on 
an instantaneous velocity derivative very accurately represents the effects of the molecular-motion physics that 
was “thrown out” in the averaging process that leads to the continuum equations. And not only is the NS model 
for momentum transport very simple, it introduces only one new fluid property, the viscosity coefficient , and 
it can usually be determined with sufficient accuracy as a function of the local static temperature alone; likewise 
for molecular heat conduction. Unfortunately, the physics thrown out in averaging over the turbulent motions 
cannot be so simply or accurately modeled. However, the practical importance of the effects of turbulence de- 
mands that we make the attempt, which leads us to the difficult problem of turbulence modeling. 

It is worth taking a brief look at how the turbulence-modeling problem arises formally. First, we'll look at 
how the raw turbulent stresses arise and are expressed in the equations, and then we'll consider some of the ways 
they are modeled. We start with the full, unsteady NS equations and apply the process of Reynolds averaging. 
Each dependent variable is decomposed into a time-averaged part and a left-over fluctuating part that contains 
all of the turbulent time dependence. For the velocity component u, for example, we have 


37, u=T4+u. 
We substitute expressions like this for all the dependent variables in the equations, and then time-average the 
equations. To simplify the result, we make use of identities that apply to the averaging integration, such as 


u—wU 


3.72 u' = 0, 
and appeal to the continuity equation, which applies separately to the mean and fluctuating parts of the flow- 
field. For constant-property flow, the process is relatively simple, and the details can be found in Schlichting 


(1979) or White (1991). For compressible flow, things are more complicated, and further assumptions have to 
be made, the details of which are given in Cebeci and Bradshaw (1984). For purposes of this discussion, we'll 
look at the forms that arise in incompressible flow. In the momentum equation, for example, the convection 
terms involve products of velocity components, which, when decomposed as in Equation 3.7.1 and simplified 
yield components of the Reynolds shear stress that look like 
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and components of the Reynolds normal stress that look like 
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The resulting time-averaged equations with these terms included are called the Reynolds-averaged Navier- 
Stokes (RANS) equations, and the time-averaged flow is what we have been calling the mean flow. The apparent 
turbulent stress terms like those given above are unknowns in these equations, and they represent the turbulent- 
motion physics that was thrown out in the averaging process. The addition of the unknown Reynolds stresses 
to our problem requires us to include additional equations embodying what we call a turbulence model. The 
Reynolds stresses depend on what's happening in the flowfield in complicated ways, and as we observed earlier, 
they cannot be “modeled” nearly as simply or as accurately as the viscous-stress terms are “modeled” in the NS 
equations. 

Remember that our objective is to avoid having to solve the full, unsteady NS equations for the details of 
the turbulent motions in what is called direct numerical simulation (DNS), which generally requires too much 
computational work to be practical. By introducing the RANS equations plus a turbulence model, we accom- 
plish this objective, but not by “simplifying” the equations. As we'll see below, the addition of a turbulence 
model actually makes the equation set more complicated than the unsteady NS equations. Instead, the way the 
turbulence-modeling approach saves computation is by enabling a lower resolution numerical solution to suf- 
fice. Resolving the turbulent motions in a DNS solution would require very fine spatial grids 1n all directions in 
the turbulent regions and very small time steps. And a 3D spatial grid would be required even for a nominally 
2D flow, because turbulent motions always require 3D resolution. Spalart (2000) has ey es that a DNS oe 
culation for a full large-airplane configuration at flight Reynolds number will require 10! ° orid points and 10 vs 
time steps to reach a statistically steady solution. The computing power that would make such a calculation 
practical is not likely to be available until around 2080. On the other hand, for the resolution of just the mean 
flow using the RANS equations with a turbulence model, a much coarser grid suffices, pee) in directions 
ea to the turbulent shear layers, and a full-airplane simulation requires only about 6 x 107 grid points and 
10° iterations or pseudo-time steps to converge to a steady solution, which became practical around 1990. As of 
2008, such a 3D solution could be computed in 40 000 seconds of clock time on a PC cluster using 64 CPUs. A 
RANS solution for a nominally 2D flow requires only a 2D grid, and as of 2008 a solution for the flow around 
an airfoil on a grid with 30 000 points could be obtained in 1400 seconds on a PC using a single CPU. 

Before we consider the various strategies for modeling the Reynolds stresses, let's look a little further at the 
nature of the physical problem we're dealing with. We've already pointed out the general similarities between 
molecular transport and turbulent transport of momentum. At a more detailed level, however, the two processes 
are very different, and it is in the differences that we will find the reasons why turbulence modeling is such a 
difficult problem. 

To explore the differences, let's start by reviewing the nature of molecular momentum transport. Consider 
one of the horizontal faces of a tiny Eulerian fluid parcel immersed in the fluid at a location in the flow where 
the mean velocity is horizontal with a value of if, as illustrated in Figure 3.7.1a. There is no net flux of mass 
across the face, because the face 1s parallel to the continuum flow velocity, but there is random motion of mo- 
lecules up and down relative to the mean flow, and therefore through the face from both above and below. These 


molecules carry with them, on average, horizontal velocities slightly different from I, reflecting, on average, 
the conditions where they experienced their last collisions, typically very small distances (on the order of the 
mean-free-path) above and below the face. The molecular shear stress txy in response to a positive velocity 
gradient Ou/Oy 1s a result of the transport, on average, of a slight excess of momentum from above the face and 
a slight deficit from below the face. The key feature of molecular momentum transport is that it is a result of 
molecular interactions over very short ranges and can therefore be represented by a simple, local model, as in 
the NS equations. 


Figure 3.7.1 Transport of momentum across the top face of a small fluid parcel. (a) By molecular motions. (b) 
and (c) By turbulent motions, seen at different instants in time 





(b) (c) 


Now consider the same mean-flow situation as we just did for the molecular shear stress, but assume that 





aayfaat 
there is now also a turbulent shear stress txy given by pet Again there is no mean velocity perpendicular to 


the face of our fluid parcel, but at any instant the perpendicular continuum velocity component v will usually 
be nonzero, as illustrated in Figure 3.7.1b. In these illustrations, v is shown as uniform over the parcel face be- 
cause the parcel is assumed small compared to distances over which v varies. In the case of molecular transport, 
we had molecules crossing the face in both directions at once, arriving from very short distances since their last 
collisions. In the case of turbulent transport, we have macroscopic masses of fluid sloshing across the parcel 
face from one direction and then the other, carrying with them some “memory” of conditions where they came 
from, conditions that depend on the complex turbulent structures (e.g., eddies, vortices, bursts, and sweeps) of 
which the sloshing motion is a part. The nature of the memory that the sloshing fluid masses carry is complic- 
ated because each part of the mass has its motion modified continuously by interaction with the surrounding 
fluid up until the instant that that part crosses the parcel face. The distances over which the sloshing carries 
some influence can be as large as the thickness of the turbulent shear layer or as small as the dimension of the 
smallest turbulent structure, but in any case many orders of magnitude longer than a molecular mean-free-path. 
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Thus although the Reynolds stress is a local statistical property of the turbulent motion, it depends dir- 


ectly on flow structure that is not only complex, but is much less local than the simple velocity gradient Ou/Oy 
that determines the viscous stress. 

So in the Reynolds-averaging process, we've thrown out some very nonlocal turbulence effects, and our task 
in turbulence modeling is to represent these effects in the context of the RANS equations, a set of local PDEs. 
The additional equations that are introduced in implementing various turbulence models tend to be mostly local 
algebraic or differential equations. Some models also include explicit nonlocal information, but generally only 
of a very low-level variety, such as the distance to the nearest solid surface, or some measure of the thickness 
of the turbulent shear layer as reflected in the mean-velocity profile. Otherwise, nonlocal effects are felt only 
through the behavior of turbulence-modeling variables such as “TKE,” “eddy viscosity,” “dissipation rate,” and 
so on, that depend on the flow solution. The local PDEs that govern the nonlocal behavior of such variables can 
represent only a limited repertoire of effects such as “generation,” “convection,” “diffusion,” “dissipation,” and 
“history” that at best are pale reflections of the complicated physics that was thrown out. This might seem like a 
recipe for failure, but so far, for obtaining practical predictions of turbulent flows, there seems to be no realistic 
alternative to this general approach to turbulence modeling. We just have to be realistic in our expectations and 
understand that our predictions will often be far from perfect. 

Ideally, we would like a turbulence model to possess the following: 

1. Universality, or the ability to reasonably represent any turbulent flow without requiring different em- 
pirical content for each category of flow; 


2. Invariance, by which the model gives equivalent results regardless of the reference frame in which a 
problem is posed; and 
3. Accuracy, sufficient for all reasonable engineering purposes. 

With regard to universality, currently available models tend to do well only on relatively narrow classes of 
flows. Models developed to do well on boundary layers and wakes do less well on highly 3D or highly curved 
flows, and so forth. Some models, but not all, meet the invariance requirement. With regard to accuracy, many 
models do very well on simple flows such as free shear layers and flat-plate boundary layers, for which much 
of the models' empirical content was derived, but do much less well on flows with strong pressure gradients or 
flow curvature. The effects of flow separation and of three-dimensionality in boundary layers are particularly 
difficult to model accurately, as we'll see in Sections 4.4.2 and 8.6.2. In RANS solutions for practical flow situ- 
ations, the turbulence modeling nearly always constitutes the most serious limitation on the physical fidelity of 
the simulation. 

There are two main distinctions to be made between the various modeling strategies: 

1. Whether the Reynolds stress is defined explicitly by the model (“Reynolds-stress models’’), or indirec- 
tly in the form of an eddy-viscosity or mixing-length relationship (“eddy-viscosity models” and “mixing- 
length models”) and 

2. The number and type of additional equations that constitute the model, over and above the mean-flow 
equations. In “algebraic” models, the additional equations are not differential (ODE or PDE). “One-equa- 
tion” or “n-equation” generally refers to the number of additional PDEs in a model. 

Eddy-viscosity modeling is based on an assumption proposed by Boussinesq in 1877 (see White, 1991), that 
the turbulent shear stress can be expressed in a manner analogous to the molecular shear stress in the NS equa- 
tions, that 1s, 


ou 
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where ut 1s the “turbulent” or “eddy” viscosity. When applied to an actual flow, Equation 3.7.5 simply defines 


ut as a function of the local statistics of the turbulent motions, that is, the ut!” correlation. When used for 
making flow predictions, as part of a turbulence model, Equation 3.7.5 shifts the problem from that of modeling 
tt to that of modeling py. The advantage of this is that u¢ can be simpler to model than tt, at least in the simple 
flows on which many models are based. A disadvantage is that there are limited regions in some flows in which 
the real prt is not well-behaved, and eddy-viscosity models will miss some of the details of such flows. Practical 
eddy-viscosity models are usually kept simple. For example, in 3D boundary-layer flows, where shear stresses 
in two directions must be calculated, the eddy viscosity is usually assumed to be isotropic, that 1s, the same in 
both directions, an assumption that is seriously violated in many flows, as we'll discuss further in Section 4.4.2. 
The mixing-length assumption proposed by Prandtl (1925) is closely related and is usually expressed in terms 
of the eddy viscosity: 
5 | OU 
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where / is the mixing length. When applied to a real flow, this equation is simply a definition of the mixing 
length /, just as Equation 3.7.5 was just a definition of pt. As part of a turbulence model, Equation 3.7.6 shifts 
the modeling problem again, from that of modeling pt to that of modeling !. Some of the earliest modeling ideas 
for turbulent boundary layers were more naturally expressed in terms of / than ut, an issue discussed in White 
(1991, Section 6.7). 

Early mixing-length and eddy-viscosity models were cast in terms of algebraic equations and made use of 
rudimentary nonlocal information such as the distance to the wall and a measure of the thickness of the shear 
layer. In general CFD grids, such nonlocal information is not always easy to define unambiguously and can be 
expensive to compute. Reducing or eliminating the need for nonlocal information in more recent models has 
generally dictated moving from algebraic equations to PDEs. 





A common strategy for generating additional equations for PDE-based models is to take “moments” of the 
momentum equation, that is, to take one or more of the components of the full unsteady momentum equation 
and to multiply each component of the equation by one of the velocity components, and to Reynolds-average 
the result. Depending on what combination of equations and velocity components is used, this can yield a con- 
servation equation for TKE or a transport equation for Reynolds shear stress. The new equation 1s effectively 
an equation for the unknown second-order turbulence correlation, that is, the TKE or the Reynolds stress, but it 
contains new unknowns in the form of third-order correlations such as pressure-velocity correlations and triple- 
velocity correlations. Such new equations shift the modeling problem from that of modeling the Reynolds stress 
to that of modeling the third-order correlations. The process can be repeated to shift the modeling to higher or- 
ders, but there seems to be little to gain. The appearance of adding more “physics” to a model simply by adding 
more equations using this type of operation is an illusion. There 1s only one physical (vector) equation involved, 
and generating higher moments of it does not add physics; it is just algebraic manipulation. The only way this 
approach could lead to models with higher accuracy would be if the behavior of the higher-order correlations 
turned out to be less dependent on the particular flow situation, thus giving the model greater “universality,” but 
there is little evidence so far that this is the case. 

Another approach that has been surprisingly successful is to invent a totally ad hoc “transport” equation for 
a quantity such as eddy viscosity or mixing length that is not actually governed by a physical conservation law. 
The model by Spalart and Allmaras (1994, the “SA model’) that is used in many RANS codes is an example 
of such a model, and for many types of flow, its performance is comparable to that of many more complicated 
models. 


Viscous flows in most practical applications involve turbulence, and quantitative prediction thus requires tur- 
bulence modeling. Turbulence models and flow predictions using turbulence models will be discussed several 
more times in the course of the book. In Section 6.2.2, we'll look at incorporating surface-roughness effects into 
turbulence models; in Section 7.4, we'll use calculations made using turbulence models to illuminate several as- 
pects of airfoil flows; and in Section 8.6.2, we'll discuss the turbulence-modeling issues related to swept wings. 


3.8 Important Dynamical Relationships 


In Section 3.3, we discussed the many things that can be deduced about the nature of flowfields based just on 
kinematics and on the fact that the velocity is a continuous vector field with only isolated discontinuities and 
singularities allowed in idealized models of the flow. In this section, we delve into the general relationships that 
can be deduced based on dynamics, with and without viscosity. 


3.8.1 Galilean Invariance, or Independence of Reference 
Frame 


Something we all take for granted as part of our Newtonian worldview is that the phenomena of fluid mechanics 
are physically the same, regardless of what reference frame we choose to view them in. Any fluid flow may be 
described with equal validity whether the reference frame is inertial (not accelerating or rotating) or 1s acceler- 
ating and/or rotating. Of course, this idea applies to classical mechanics in general, and a formal justification 
of it requires knowledge of Newton's laws. But there was some intuitive realization of it before Newton, as for 
example in the writings of Galileo and Leonardo da Vinci (see Anderson, 1997, p. 24). 

For most work in fluid mechanics we use an inertial reference frame. How the idea of independence of ref- 
erence frame is reflected in the mathematics in this case is worth at least a cursory look. First, the NS equations 
and the general statements of the BCs used with them are invariant under Galilean transformation, which means 
that the forms of the equations and BCs are unchanged by a transformation from one inertial frame to another. 
However, the phenomena the equations represent are not invariant in the same sense. They don't /ook the same 
in different reference frames; they are just physically equivalent when the effects of the different viewpoints 
have been accounted for. What we mean by “accounted for” is that things have been appropriately transformed 
in going from one frame to another. The descriptions of the same flow in two different reference frames are 
transformed versions of each other. So while the equations are invariant, the flowfields they govern, which are 
solutions to the equations, are merely transformable. 


In some applications such as propellers or other turbomachines, the flow may be steady and simpler to de- 
scribe in the reference frame of the rotating object. Then the effects of the rotating reference frame must be 
accounted for in the equations by inclusion of appropriate rotation terms. The equations are not invariant with 
respect to the rotational motion of the reference frame, but they can be transformed to account to the rotational 
motion. 

The fact that we can look at aerodynamic phenomena in different reference frames and know that they are ef- 
fectively the same comes in handy in many ways. For example, it is basic to the similarity between wind-tunnel 
testing and flight. We'll look at other aspects of dynamic similarity in Section 3.9. And in developing physical 
understanding, it is often helpful to look at different aspects of a phenomenon in different reference frames. 


3.8.2 Circulation Preservation and the Persistence of 
Trrotationality 


In Section 3.3.5, we defined the circulation as the line integral of the velocity around a closed contour and saw 
that it 1s equal to the total vorticity piercing any capping surface bounded by the contour, by virtue of Stokes's 
theorem. The circulation can reflect the dynamics in ways that make it a useful quantity in numerous situations; 
for example, as we'll see in Section 7.2, it is related to lift through the Kutta-Joukowski theorem. 

It is often useful to be able to make general statements about whether circulation is preserved or not. Consid- 
ering the circulation around a closed material contour that moves with the flow, under what conditions can we 
expect that it is preserved, that is, that it remains constant in time? In other words, under what conditions is the 
Lagrangian rate of change of the circulation equal to zero? This can be written as 

a = — pVetd=0 
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Note that we can reverse the order of differentiation and integration in Equation 3.8.1, so that it becomes 
the statement that the line integral of DV/Dt around a closed contour is zero, which is true only if DV/Dt is 
the gradient of a scalar function, called an acceleration potential. (See the analogous argument in Section 3.3.6 
regarding the velocity potential in irrotational flow.) By invoking the momentum equation, we can show that 
this requires: 

1. No net viscous force on any fluid parcel on the contour. This requires either that the viscosity is zero, 
or that the fluid is uniform and the flow is irrotational. But the second of these options is of little interest, 
because requiring irrotationality means that Equation 3.8.1 is satisfied trivially. 


2. Any external body force is the gradient of a potential, such as gravity. 


3. The fluid is barotropic, that is, the density is a function of pressure only. This is commonly satisfied 
only if the fluid is homocompositional and entropy is constant in the flowfield, conditions that in aerody- 
namics often apply to inviscid flow in the absence of shocks. But zero viscosity is not strictly required for 
this condition to be met. A uniform fluid meets this requirement by definition, even if it is viscous. 


This result is sometimes referred to as Kelvin's circulation theorem. In some references (for example, Milne- 
Thomson, 1966), condition 3 is not mentioned explicitly, but it is still implicitly assumed. 

Circulation preservation of this kind supports the persistence of irrotationality in many situations. This is 
easy to see if we think of a flow that starts out irrotational, say upstream of its approach to a body or other 
source of disturbance. In this initially irrotational state, the circulation must be zero around any reducible closed 
contour. Then if the flow is circulation preserving, any circulation must remain zero as the flow progresses. 
The vorticity must also remain zero, since if any nonzero vorticity appeared, there would be nonzero circulation 
around some closed contour, which would violate our assumption of circulation preservation. So circulation 
preservation means that a flow that starts out irrotational will remain irrotational downstream. 

Preservation of irrotationality is especially simple in the special case of a uniform fluid (constant density 
and viscosity). A uniform fluid satisfies condition 3 above by definition, and an irrotational flow of a uniform 
fluid produces no net forces on fluid parcels even if the viscosity is nonzero (see Section 3.6), so that it also 
satisfies condition 1. For the kinds of compressible flows we are likely to encounter in practice, variations in 
fluid properties tend to be slow enough that condition | is effectively, though not exactly, satisfied in regions of 
irrotational flow, and condition 3 is satisfied in regions where the flow is isentropic. 

In Section 3.6, we discussed the general flow pattern that characterizes external flow past bodies at high 
Reynolds numbers (small viscosity), in which a vortical boundary layer develops close to the body surface, sur- 


rounded by essentially irrotational flow that behaves as if it were inviscid. What we have seen here regarding 
the persistence of irrotationality is consistent with that picture. Ordinarily, you should expect that when the flow 
approaching a body from upstream is irrotational, significant vorticity will appear in the flowfield only where it 
has diffused and convected from the body surface or where it has been produced by a strong, curved shock (see 
Crocco's theorem, Section 3.8.5 and the discussion of shocks in Section 3.11). 


3.8.3 Behavior of Vortex Tubes in Inviscid and Viscous 
Flows 


In Section 3.3.7, we introduced the concepts of vortex tubes and vortex filaments, and we saw that the intensity 
of a vortex filament is constant along its length (Helmholtz's second theorem). By noting that we can consider a 
vortex tube as a bundle of vortex filaments, we can define the intensity of a vortex tube as the sum of the intens- 
ities of the filaments within the tube. (For a vortex tube of finite cross section, this “sum” is actually an integral, 
because vortex filaments are of infinitesimal size.) Now we assume that the flow is circulation preserving in 
the sense defined in Section 3.8.2, that is, that the conditions for Kelvin's circulation theorem are met. In this 
instance, we are by definition concerned with flows with nonzero vorticity, so the only option that satisfies con- 
dition | for circulation preservation is for the fluid to be inviscid. Then two dynamics-based theorems apply: 


Helmholtz's third theorem: As a vortex tube evolves in time, fluid parcels cannot cross the bounding 
surfaces of the tube, and as a result the tube is always made up of the same fluid parcels. 


Helmholtz's fourth theorem: The intensity of a vortex tube is constant regardless of how the tube moves 
around. 


A proof that makes use of Stokes's theorem is given by Milne-Thomson (1966). In a rigorous sense, 1t might 
seem that these two theorems are not very informative or useful, because they deal with vorticity in inviscid 
flows, while in most cases of interest in aerodynamics the only way for vorticity to be there is through the ef- 
fects of viscosity. But if we view the inviscid case to which they apply as being the limiting case for flows with 
small viscosity, we can get some useful insight from them. Basically, what Helmholtz's third theorem tells us 1s 
that for small viscosity, the natural tendency of a vortex tube is to remain anchored to the same material tube, 
and that vorticity migrates into or out of that material tube only through viscous diffusion. In Section 3.6, we 
discussed the vorticity equation and the transport of vorticity by convection and diffusion. Helmholtz's third 
theorem provides further insight into what it means for vorticity to be transported mainly by convection. Helm- 
holtz's fourth theorem tells us something that's a bit harder to visualize: that vorticity migrates into or out of a 
vortex tube only by viscous diffusion. 

The assumption of small viscosity is not always as limiting as it might seem. Nearly all of the vorticity in 
aerodynamic flows is generated at solid surfaces, and in the thin viscous or turbulent boundary layers on those 
surfaces, the effects of viscosity are not small. So in these parts of the flow, the insights provided by these theor- 
ems are not very applicable. But when the vorticity is convected into the wake away from the body, the viscous 
and turbulent stresses are rapidly reduced, and the above insights become more useful, as, for example, in Sec- 
tion 8.1, when we consider the vortex wake behind a 3D wing. 


3.8.4 Bernoulli Equations and Stagnation Conditions 


We've seen that effectively frictionless flow conditions are frequently encountered in practice, particularly out- 
side the viscous boundary layer that develops near the surface in flows around bodies. Under these conditions, 
with a couple of additional restrictions, Bernoulli's principle defines a simple relationship between the pressure 


and the velocity magnitude. This is analogous to the many situations in classical particle mechanics in which 
the total force exerted on a particle can be expressed as the gradient of a potential that is a function of position 
only, a condition that often applies in the absence of friction effects. In these situations, the sum of the “potential 
energy” and kinetic energy is constant, which leads to a simple relation between the potential function value 
and the particle's velocity magnitude. 

To determine the requirements for the analogous situation to exist with regard to the net force on a fluid par- 
cel, we can either look directly at the energy equation or take a line integral of the momentum equation. Either 
way, we find that if: 

1. There is no net viscous force on fluid parcels; 
2. external body forces are negligible; 
3. there is no heat conduction; and 


4. the flow 1s steady, 





then the total enthalpy 
l p 
H=—>—+e+= 
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is constant along streamlines, where e is the thermodynamic internal energy per unit mass. Note that under these 
conditions p/p acts like a form of potential energy. Note also that conditions (1) and (3) amount to specifying the 
absence of entropy-generating processes (assuming bulk-viscosity effects are also negligible), so that entropy 
must also be constant along streamlines. 

Now we introduce the concept of stagnation conditions, which can be defined for general flows that don't 
necessarily satisfy the assumptions for Equation 3.8.2. From any point in a general flow, we can imagine bring- 
ing the fluid parcel passing that point to rest (stagnation, or zero velocity) by an imaginary steady-flow process 
in which the viscous forces and heat conduction that may be active in the general flow are “turned off” and for 
which Equation 3.8.2 therefore applies. Because the imaginary stagnation process is isentropic, the stagnation 
values of T, p, and p are uniquely defined, and we'll designate them by the subscript ¢ for “total.” In general 
flows, these stagnation conditions need not be constant, either along or across streamlines. It is important to 
keep in mind that the stagnation temperature, pressure, or density is not generally a real temperature, pressure, 
or density, but is only an imaginary construct defined by an imaginary process. The exception 1s a real stagna- 
tion point in a steady flow, where the stagnation condition can actually be realized. 

Returning our attention to flows that satisfy Equation 3.8.2 and introducing stagnation quantities, Equation 
3.8.2 becomes 
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along streamlines, where all of the subscript ¢ quantities are constant along streamlines as well. Now if we as- 
sume a perfect gas and impose the constant-entropy condition, we have 
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and 





so that Equation 3.8.3 becomes (see National Advisory Committee for Aeronautics (NACA) Report 1135, Ames 
Research Staff, 1953) 
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along a streamline, where the subscript t quantities are constants for that streamline. We'll call this the steady, 
compressible Bernoulli equation for frictionless, nonconducting flow of a perfect gas. If the density 1s constant, 
that is if the flow can be assumed effectively incompressible, in that the density change in response to any pres- 
sure change in the flowfield is a sufficiently small percentage (see Section 3.10), this reduces to 
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along a streamline. 


To recap, the Bernoulli principle can be simply stated: For steady, frictionless flow with no body forces 
and no heat conduction, the total enthalpy is constant along streamlines. Because such flows are also generally 
isentropic along streamlines, the total pressure and temperature are also constant along streamlines. How the 
“Bernoulli constant” and the stagnation conditions vary from streamline to streamline, however, depends on 
how the flow was established, which brings us to our next topic. 


3.8.5 Crocco's Theorem 


Assume that a flow satisfies the conditions given above for the Bernoulli principle to apply, so that Equation 
3.8.2 holds along streamlines. If we assume further that the fluid is homocompositional, that is, that it consists 
everywhere of the “same stuff,’ as would be the case if the relative species concentrations in a mixture of gasses 
were constant, then a standard thermodynamic relationship between T, S, e, p, and p (one of the familiar “TdS” 
or “tedious” equations, see Liepmann and Roshko, 1957) applies throughout the flowfield. Combining this with 
the momentum equation (remember that the flow is assumed steady and frictionless), we get 


3.8.8 VA=TVS+V xw, 


which is known as Crocco's theorem. 

We saw earlier that it is common in external flows around bodies for the flow to be “established,” that is, to 
come from upstream, in an irrotational state. If it continues to satisfy the requirements for Crocco's theorem to 
apply in the part of the flowfield outside the viscous boundary layers near the body surface, the flow will remain 
irrotational, in which case we can see that constant H implies constant S and vice versa. Thus in the effectively 
inviscid parts of many external flows, the total enthalpy, our “Bernoulli constant,” will usually be constant from 
streamline to streamline, as well as along streamlines. If such a flow passes through a curved shock (see Section 
3.11.2), however, the situation downstream of the shock will be more complicated. The portions of the shock 
through which different streamlines pass will have different obliqueness, and S will therefore vary from stream- 
line to streamline, though H will still be constant. In Equation 3.8.8, the RHS will be zero, and the varying S 
will require nonzero vorticity. Thus a curved shock inevitably produces some vorticity downstream, though it is 
typically very weak unless the shock is very strong. 

For a more detailed discussion of circulation preservation, Bernoulli's principle, and Crocco's theorem, read- 
ers would do well to consult Chapter 3 of Batchelor (1967). I hope those who do will accuse me at worst of 
oversimplifying and not of plagiarizing. 


3.9 Dynamic Similarity 


Using flows around subscale models to simulate what would happen at full scale has been a major tool 1n aero- 
dynamic development since the Wright brothers built their first wind tunnel. Subscale model testing has made 
it practical to explore many more design options and a wider range of flight conditions than could be explored 
by flight testing alone, and the savings in time, money, and test pilots' lives have been incalculable. Advances 
in CFD in recent years have made it possible for CFD to replace some subscale model simulation, but it will be 
a long time before computation replaces all of it. 

In practice, the similarity between a subscale flow simulation and the real thing is rarely even close to perfect. 
However, defining the theoretical requirements for a perfect simulation will tell us what factors are important 
and provide at least some basis for anticipating how accurate our actual simulations are likely to be. 

A perfect simulation requires that the model geometry and the model flow be geometrically scaled versions 
of the real things, with all of the detailed forces and accelerations and other internal processes reproduced in 
a properly scaled fashion. Presumably if we knew everything we needed to know to be able to set up such a 
simulation correctly, we would also know the scaling factors for converting quantities such as pressures and 
integrated forces from model scale to full scale. The early pioneers were able to deduce some of the important 
scaling factors from a combination of intuition and experience, such as the scaling of forces with surface or 
cross-sectional area times the square of the velocity, but they were not always aware of the more subtle require- 
ments for similarity. With our advanced understanding of the governing equations, we are 1n a position to derive 
these requirements with some rigor. 

The basic idea is that if we “nondimensionalize” the governing NS equations and BCs appropriately for both 
situations (model and full scale), the two flows will be similar if they represent the same solution to the dimen- 
sionless equations. Of course, this can be expected to happen only if the dimensionless equations themselves, 
and the dimensionless BCs, are the same for both situations. This sounds simple enough, but there are subtleties 
involved, as we'll see below. 

To derive the dimensionless equations, we start by replacing each dimensional quantity in the equations with 
a product of a dimensional reference constant and a dimensionless version of the original variable. The refer- 
ence constants are simply quantities appropriate to the flow situation, such as freestream flow properties (e.g., 
velocity and density) and a reference length that characterizes the body, and we naturally have to use equivalent 
definitions for these constants for both flow situations. We make these substitutions likewise for all of the inde- 
pendent spatial coordinates and all of the dependent flow variables. Then for each equation, we choose a group 
of the reference constants having the right combined dimensions so that when we divide the entire equation by 
that group, the equation becomes dimensionless. There is no unique right choice of the group of constants to use 
for this nondimensionalizing step, but the resulting equations are of course equivalent, no matter what group 1s 
chosen. 

The conventional choice is based on the observation that inviscid flows have no inherent scale dependence, 
and that it is therefore convenient to choose the nondimensionalizing group so that the nonviscous terms in the 
dimensionless equations end up looking just as they do in the original equations, with no scale-dependent coef- 
ficients multiplying them. For the momentum equations, this means dividing by Pref Uref”/Lref, and for the en- 
ergy equation in total-enthalpy form by uref pref/Lref. In our final set of dimensionless equations, the nonviscous 
convective terms look the same as they did in the dimensional equations, while the pressure gradient, viscous- 
transport, and heat-conduction terms have dimensionless “scaling” or “similarity” parameters multiplying them. 

Now we see that for the dimensionless equations to be the same for both the simulation and for full scale, the 
similarity parameters multiplying the pressure and transport terms must have the same values in both cases. 


Now let's look specifically at the dimensionless parameters that arise. We form dimensionless versions of all 
the basic variables in the equations, designating them by putting “hats” over them. We nondimensionalize the 
basic variables as shown in Table 3.9.1. 


Table 3.9.1 Basic variables 


Type of quantity 
Spatial coordinates i 





We invert these definitions (e.g., “& = +l “f “) and substitute them into the form of the equations valid for a 
perfect gas with constant specific heats. We also make use of the specific heat relationship cp = yR/(y — 1), 
where y is te ratio of specific heats and R 1s the gas constant, and of the fact that the speed of sound is given by 
a = (yp/ py! >’ In the resulting dimensionless equations, the continuity equation and the convective terms in the 
momentum equation and in the energy equation in total-enthalpy form look exactly as they did in the original 
dimensional equations. Terms in which dimensionless similarity parameters appear are as shown in Table 3.9.2, 
where Moo = Uco/aco 18 the freestream Mach number, Ry = pooUol/oo is the Reynolds number, and Pr = Lcp/k is 
the Prandtl number. 


Table 3.9.2 Terms appearing in dimensionless equations 


Type of term i we 
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Based on this, we might expect perfect similarity if we match y Mo”, Ri, and Pr. But things aren't quite so 


fa! 





simple with regard to y and Mo. One of our primary variables in the energy equation is H , which is defined in 
terms of other dimensionless variables as 
ar 
H = ——T +- = 2 i”. 
3.9.1 y-1 . 
So the internal energy (first term) and kinetic energy (second term) in the total enthalpy depend on y in dif- 
ferent ways, and true similarity requires that we match y and Mw separately. 








To summarize, similarity parameters that must be matched for perfect similarity are: 


Ratio of specific heatsy=cp/ey 


Reynolds number RL = Pootcol/Loo 


Prandtl number 
Pr = Ucp/k 





Note that some of these are fluid properties (y and Pr) and some (Mw and RL) depend in addition on the 
reference flow conditions, in this case the far field. Now let's consider the major scaling and similarity issues 
in detail: compressibility effects, the effects of viscosity, scaling of pressure forces, and the consequences of 
failing to match the similarity requirements. 


3.9.1 Compressibility Effects and the Mach Number 


Throughout the flowfield the interaction between the pressure and density for compressible flows is governed 
by implicit relationships built into multiple equations, and y and Ma correspondingly appear in several places. 
We saw above that similarity requires us to match y and Mw separately. 

Of the flow-dependent parameters, Mach number has the most straightforward physical interpretation. It de- 
termines how large the pressure changes in the flowfield will be, relative to the absolute pressure, and thus how 
important pressure-induced density changes will be, a topic we'll consider further in Sections 3.10 and 3.11. The 
Prandtl number is important with regard to thermal effects in boundary layers, and we'll consider it further in 
Section 4.6.2. 


3.9.2 Viscous Effects and the Reynolds Number 


The Reynolds number is also widely familiar, but its proper interpretation is more subtle. Reynolds number is 
sometimes described as measuring the “ratio of inertial forces to viscous forces,” which is incorrect, or the “rel- 
ative importance of inertial forces and viscous forces” in the flow, which is not as bad, but not quite right either. 
Actually, the term 1/R multiplies the viscous terms in the momentum and energy equations, but it does not dir- 
ectly reflect the relative magnitudes of the forces locally. In any external flow with a Reynolds number above 
about 107, there will be part of the flow outside the boundary layer where the ratio of inertial forces to viscous 
forces will be practically infinite and part of the flow deep in the boundary layer where it will be practically 
zero. Thus the ratio of local forces is obviously not what the Reynolds number directly determines. 

What the Reynolds number does determine is how “fast,” relative to the flow velocity, momentum will be 
diffused in the cross-stream direction by viscosity or turbulence and thus how thick the boundary layer will 
grow relative to the dimensions of the body. As Reynolds number increases, the diffusion of momentum be- 
comes relatively slower, and the boundary layer will be thinner. This increases the velocity gradient inside the 
boundary layer, which offsets the reduction in the 1/R multiplier in front of the viscous terms. The result is that 
the variation across the boundary layer in the relative importance of viscous forces stays essentially the same, 
regardless of the Reynolds number: At the “bottom” of the boundary layer, viscous and pressure forces domin- 
ate, while at the “edge” of the boundary layer viscous forces are small, and inertial and pressure forces domin- 
ate. Thus the idea that the Reynolds number measures “relative importance” makes sense only if we apply it to 
global viscous “effects” as opposed to local viscous “forces.” As the Reynolds number increases, the boundary 
layer becomes thinner and has less effect on the rest of the flow. It is only in this indirect sense that the Reynolds 
number measures the relative importance of viscous effects. 


We've just used a simple physical argument to explain why the effects measured by the Reynolds number 
are subtle. The corresponding mathematical argument is less intuitive and starts with the observation that the 
viscous terms involve the highest order derivatives of the velocity and therefore play a unique role in the equa- 
tions. If we set 1/R to zero, the viscous terms vanish, and we have the inviscid equations. We might expect, then, 
that for small values of 1/R, a viscous solution could be obtained as a small perturbation to a corresponding 
inviscid solution. This expectation is wrong, however, because when the 1I/R terms are dropped to obtain the 
inviscid equations, the order of the equations is reduced, and with it the number of allowable BCs. The BC we 
must give up to allow nontrivial inviscid solutions is the no-slip condition. To obtain a viscous solution from 
the corresponding inviscid solution, the no-slip condition must be reinstated, and there will always be a region 
close to the wall (the boundary layer) where the perturbation imposed on the inviscid solution by viscosity 1s 
not small. In mathematical terms, this is an example of a singular perturbation, and the kind of behavior we've 
been discussing is characteristic of singular-perturbation problems. In Section 4.2.1, we'll discuss how the im- 
plementation of the singular-perturbation idea leads to a rigorous derivation of boundary-layer theory. 

So far, we've seen that the way the characteristics of a flow depend on the Reynolds number is complicated 
by the fact that a change in the 1/R multiplier in front of the viscous terms tends to be offset by a change in 
boundary-layer thickness. This complication is compounded by additional subtleties. The location on the body 
where the boundary layer transitions from laminar to turbulent usually depends on Reynolds number, and this 
can strongly affect the development of the boundary layer and thus the rest of the flow. Though the variation 
of boundary-layer thickness with Reynolds number in laminar and turbulent boundary layers is similar qualitat- 
ively, it is significantly different quantitatively. The location of laminar-to-turbulent transition strongly affects 
both the profile drag in the attached-flow regime and the resistance of the boundary layer to separation down- 
stream. Such effects can be very important in airfoil flows, as we'll see in Section 7.4, and in bluff-body drag, 
which we'll discuss in Section 6.1.6. 


3.9.3 Scaling of Pressure Forces: the Dynamic Pressure 


2 
When we divide the momentum equations by ref Uvef/Lref, the resulting dimensionless convection terms look 
just as they did in their original dimensional form. For example, the original dimensional term 
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When we nondimensionalized the equations to derive the similarity parameters, we nondimensionalized p by 
Po. But this nondimensionalization of the pressure isn't very convenient for dealing with low-speed flows. If, 
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instead, we take our reference pressure equal to what is called the dynamic pressure ret the 


original dimensional pressure-gradient term: 
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where /? ~ F’/ fl. We introduced the factor of 1/2 in the definition of our reference pressure because it arises 
when we integrate the momentum equation to obtain Bernoulli's equation, as we saw in Section 3.8.4, and g 
defined in this way is equal to the difference between local static pressure and total pressure in the incompress- 
ible, or low-Mach-number case, as in Equation 3.8.7. (A discussion of what “incompressible” means 1s coming 
up in Section 3.10.) 

Normalizing pressure differences by g is very convenient because it removes the dependence on density and 
velocity in incompressible flows, and it has therefore become second nature to aerodynamicists. And g is used 
as a reference for compressible flows as well, though outside of the low-Mach regime, it no longer removes all 
of the dependence on velocity, or Mach number. 


3.9.4 Consequences of Failing to Match All of the 
Requirements for Similarity 


In most subscale flow simulations we must tolerate a substantial mismatch in one or more of the similarity para- 
meters. How does the accuracy of a simulation suffer as a result? In this regard, the first thing to note 1s that the 
parameters are not all of equal importance and that their relative importance depends on the situation. 

In flows with sufficiently low Mach number throughout the field, matching Mach number is not important, 
while in flows where local Mach numbers approach or exceed one, matching Mach number is crucial. For- 
tunately, Mach number is formally scale independent and therefore relatively easy to match regardless of scale. 
In cases where Mach number is important, we can usually avoid having to tolerate a serious mismatch. 


Unfortunately, we can't say the same of the Reynolds number, which varies directly with scale. For a given 
flow condition, a reduction in scale (our main reason for doing subscale simulation) entails a reduction in Reyn- 
olds number. Testing at elevated pressures can alleviate the problem but can't generally eliminate it. At high 
subsonic speeds, testing at total pressures above about 5 atm is impractical because air loads on a typical air- 
plane model become too high, even for a model constructed of solid steel. The only way the full-scale Reynolds 
number of a large airplane can be matched on a small-scale model in a wind tunnel is through a combination of 
elevated pressure (up to 5 atm) and very low temperature (to — 250 °C), called cryogenic pressure testing. 

Such testing is very expensive and accounts for only a miniscule fraction of all wind-tunnel flow simulation. 
Most other subscale testing 1s done at Reynolds numbers roughly an order of magnitude below full scale. As 
we Saw previously, the dependence of a flow on the Reynolds number is complicated, with contributions from 
a number of physical factors. The sacrifice in simulation accuracy due to a mismatch in the Reynolds number 
varies greatly depending on the situation, but it 1s usually serious. A great deal of work has been devoted to the 
problem of understanding, minimizing, and accounting or correcting for the “Reynolds-number effects” associ- 
ated with testing at sub-full-scale Reynolds numbers, with mixed success (see Bushnell, 2006, for a review). 

In our derivation of the terms in which similarity parameters appear (Table 3.9.2), we assumed a perfect gas. 
The idea of a perfect dynamic similarity is problematic if we consider the more complicated behavior of real 
gases over wide ranges of conditions. In the general case, the pressure and temperature, as well as the compos- 
ition of the gas, would have to be matched, in which case there could be no difference in density, and no differ- 
ence in scale would be allowed. Fortunately, air under conditions of interest to us in aerodynamics behaves very 
much like a perfect gas, with practically constant y and Prandtl number, so that mismatches in parameters other 


than the Mach and Reynolds numbers tend to have very minor effects (see White, 1991). For example, because 
of the nonlinear form of the Sutherland law for the dependence of viscosity on temperature, perfect similarity 
would require matching temperature. However, because the temperature variation within a given flow is usually 
not large on an absolute scale, the effect of even a large mismatch in temperature at a given Reynolds number is 
very small. 


3.10 “Incompressible” Flow and Potential 
Flow 


In early theoretical work in fluid mechanics, the fluid was generally assumed to have constant density regardless 
of the pressure, and thus to be “incompressible,” an idealization that greatly simplifies the mathematics and 1s 
actually justified at sufficiently low Mach numbers. The term “incompressible,” however, is a potential source 
of confusion. Literally speaking, saying that something 1s “incompressible” means that it cannot be compressed. 
In this sense, the term is only approximately applicable to liquids and not at all applicable to gases. Yet even 
with reference to gases, we often talk loosely about “incompressible flow.” This is sloppy terminology. We don't 
mean what the term literally says (1.e., that the flow cannot be compressed). What we really mean is that the 
flow happens not to be getting compressed significantly in this particular situation and that it therefore behaves 
as if it were incompressible. 

In gases at low Mach numbers, a flow can act as if it were incompressible, in that we can make very accurate 
predictions using equations in which we have assumed that the density is constant. But how can constant density 
be a reasonable assumption for gases, which are by definition highly compressible? Even in flows at low Mach 
numbers, where velocity differences, pressure differences, and density differences are all small, the density dif- 
ferences are of the same order as the pressure differences. How can the pressure changes be important and the 
density changes not be? The answer lies in the different roles that pressure and density play in affecting the 
motion. In the application of Newton's second law to a fluid parcel, only a pressure difference can apply an un- 
balanced force, while the density itself provides the resisting inertia. In the momentum equation, this is reflected 
in the fact that the pressure appears inside a derivative (gradient operator), and the density does not. While the 
small velocity and pressure differences are crucial players in the momentum balance, the small density differ- 
ences have a comparatively much smaller effect on the flow. 

Note that to justify treating the flow as incompressible we do not have to assume small disturbances in the 
sense of small velocity changes relative to the freestream velocity. The changes in velocity can be large relative 
to freestream, as long as the velocity itself remains small relative to the speed of sound. 

So the equations of motion for an “incompressible” fluid are valid for flows of highly compressible gases, in 
the limit of low Mach number. If we also assume that the flow is steady and inviscid, and that the onset flow is 
irrotational, we have met the conditions for classical potential flow or ideal flow theory. As we saw in Section 
3.8.2, a constant-density inviscid flow is circulation preserving, so that irrotational onset flow means the flow 
will be irrotational everywhere. Then, as we saw in Section 3.3.6, the velocity field can be represented as the 
gradient of a scalar potential function: 


3.101 ¥ = V9, 


and the continuity equation is Laplace's equation for 9: 
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Assuming the pressure is related to the velocity by the “incompressible” Bernoulli equation (Equation 3.8.7) 
with a constant total pressure, any flow that satisfies Equation 3.10.2 satisfies both the continuity and mo- 
mentum equations. Under these conditions, the energy equation is not needed, and Equation 3.10.2 with appro- 
priate BCs represents the complete system describing the inviscid, irrotational flow of a constant-density fluid. 
The problem has been reduced to a single linear equation with a single scalar as the dependent variable. 

Because the incompressible potential-flow equation is linear, solutions can be constructed by superposition 
of more than one known solution. The velocity potentials for elementary building-block flows around simple 
singularities are often used for this purpose. Some examples are illustrated in Figure 3.10.1. A vortex Figure 
3.10.la can be used to produce circulation, and sources or sinks Figure 3.10.1b can be used to simulate the 
thickness effect of a body. The combination of a source and sink of equal strength results in flow pattern Figure 
3.10.1c. In the limit as the separation distance between them goes to zero, keeping the product of the source/ 
sink strength and the separation distance constant, we have a doublet Figure 3.10.1d. Combining a doublet with 
a uniform flow produces a pattern Figure 3.10.le in which the outer part is the potential flow around a circular 
cylinder. And many other combinations are possible. 


Figure 3.10.1 Examples of streamline patterns associated with simple potential-flow singularities that can be 
used in constructing other potential-flow solutions by superposition. (a) Vortex. (b) Source or sink. (c) Com- 
bination of source and sink of equal strength. From Durand, (1932). Used with permission of Dover Publica- 
tions, Inc. (d) Doublet. From Durand, (1932). Used with permission of Dover Publications, Inc. (e) Combina- 
tion of a doublet and a uniform flow yielding a circular cylinder. From Durand, (1932). Used with permission 
of Dover Publications, Inc. 





For 2D flows, conformal mapping provides solutions for flows over convex and concave corners. Knowing 
the character of these solutions is useful just for thinking about the local effects of corners in general, and in 
Section 4.3.2 we'll see that they provide the outer-flow BCs for a family of special solutions to the boundary- 
layer equations. The basic idea is that a simple analytic mapping can transform the x axis into a line that is bent 
at the origin through an angle 2$/2. The uniform flow in the upper half plane, illustrated in Figure 3.10.2a, is 
then transformed into corner flows like those illustrated in Figure 3.10.2b—d. The velocity distribution along the 
surface is given by u ~ x", where m =— B/(B — 2). 


Figure 3.10.2 Illustrations of potential flows over concave and convex corners that can be produced by con- 
formal mapping of the uniform flow in the upper half plane. (a) Uniform flow. (b) A convex corner: — 1 < B < 
0. (c) A concave corner: 0 < B < 1. (d) A 90-degree concave corner: B = 1 


(c) (d) 


Velocity distributions that go with the different types of corners are illustrated in Figure 3.10.2. In Figure 
3.10.2b, we see that a convex corner produces infinite velocity at the corner and decelerating flow away from it. 
In Figure 3.10.2c, a concave corner produces zero velocity (a stagnation point) at the corner. Combine this flow 
with its reflection about the x axis, and we have the flow off of a wedge-shaped trailing edge. This demonstrates 
that in potential flow, an airfoil with a nonzero trailing edge angle must have a trailing-edge stagnation point. In 
Figure 3.10.2d, we have the special case of B = 1, a 90° corner with a stagnation point and linear acceleration 
of the flow away from the corner. Combine this flow with its reflection about the y axis, and we have the pro- 
totype for flow attaching itself to a smooth wall at a stagnation point. Conformal mapping can also be used to 
transform solutions for very simple shapes such as circles into solutions for more-complicated shapes. With the 
conformal-mapping approach, analytic solutions are possible for a wide variety of shapes. 


We can obtain numerical solutions for 2D flows either by discretizing the conformal-mapping problem or 
by the “panel method,” that is solving for a distribution of unknown singularity strength along a set of discrete 
panels representing the body surface. When the panel method is extended to 3D flows, not only the surface of 
the body must be “paneled,” but also any vortex wakes that exist in the field, which, as we saw in Section 3.3.8, 
can be idealized as surfaces of discontinuity in the velocity field. 

As we saw in Sections 3.6 and 3.8.2, external flows around bodies at high Reynolds numbers remain ef- 
fectively irrotational outside of a thin viscous boundary layer that develops near the surface. Potential flow is 
therefore not a bad approximation for many of these flows and can often provide very reasonable predictions 
of pressure distributions and integrated forces at low Mach numbers. The pioneers of theoretical aerodynamics 
took full advantage of this, and before large-scale computation of nonlinear flow problems became practical, 
much of our quantitative prediction capability depended on incompressible potential-flow theory, most often 
implemented in the form of panel methods. It remains a powerful tool both for conceptualizing and for quantit- 
ative calculations. 


3.11 Compressible Flow and Shocks 


“Compressibility” deals with changes in fluid density in response to changes in pressure. In the previous section, 
we discussed how these changes are insignificant at low Mach numbers. As Mach number increases, however, 
we can no longer ignore the density changes, and when the local Mach number exceeds one, their effects on 
the flowfield become so strong that the character of the flow changes drastically, as reflected for example in the 
response of the flow to changes in cross-sectional area, which is essentially reversed between the subsonic and 
supersonic regimes. 

There are small-disturbance flow regimes, both subsonic and supersonic, in which the effects of compress- 
ibility are treated approximately, in the context of linear equations. These theories have such narrow ranges of 
applicability, however, that they are usually not very useful, and they have been marginalized in recent years by 
the availability of higher-fidelity computational methods. We'll touch on the linear theories again in Chapter 9 
when we survey broad classes of theoretical approximations. In this section, we explore what happens when we 
attack the problem of compressibility head on, without assuming that disturbances are small. 

In compressible flows, the changes in density bring with them changes in local temperature and in all of 
the fluid properties that are affected by temperature. These changes have significant local effects on the flow, 
and accurate flow prediction requires solving for the altered local fluid properties in detail. As we discussed 
in Section 3.1, the viscosity, thermal conductivity, and local speed of sound are usually assumed to depend on 
temperature only. 

A general feature of most transonic and supersonic flows is the appearance of compression shocks, regions 
in which significant amounts of deceleration and compression (pressure increase) take place over very short 
distances, on the order of the mean free path of the gas molecules. The extremely large gradients inside a shock 
are beyond the strict range of validity of the NS equations. As we've already noted in Section 3.1, the NS equa- 
tions don't predict the internal structure of a shock accurately, but because the equations also enforce the integ- 
rated conservation laws, they do capture the “jump” conditions across shocks correctly. Shocks tend to form in 
most transonic and supersonic flows because sustained deceleration and compression in supersonic regions 1s 
usually not possible, especially when the deceleration goes all the way from supersonic to subsonic. There are 
exceptional situations in which sustained gradual deceleration can occur, but they are not common in practical 
situations. In most ordinary situations, infinitesimal compression waves in a supersonic region tend to pile up 


on each other and to coalesce into a shock. Details on this process can be found in Shapiro (1953) or Liepmann 
and Roshko (1957). 

Shocks in steady flows can be locally perpendicular (normal) or oblique to the flow. Figure 3.11.1 illustrates 
several examples of steady flowfields with shocks. In Figure 3.11.la we have a transonic airfoil flow, with a 
“bubble” of supersonic flow over the upper surface, terminated by a weak shock that is generally curved and 
slightly oblique. This global flow pattern is typical of jet airliner wings in high-speed subsonic cruise (The sim- 
plified sketch omits the details of the interaction between the shock and the boundary layer, which we'll discuss 
in Section 7.4.8). In Figure 3.11.1b we have flow over a sharp wedge or cone at a sufficiently high superson- 
ic freestream Mach number to produce an attached oblique shock propagating from the nose. In this case, the 
flow downstream of the shock 1s still supersonic. In Figure 3.11.lc we have supersonic freestream flow past a 
blunt-nosed body with a detached, curved bow shock standing off the nose. In this case, there is a “bubble” of 
subsonic flow downstream of the part of the shock that is close to normal, and supersonic flow downstream of 
the more oblique parts of the shock. Of course, more complicated patterns are possible, especially in 3D, with 
shocks from different parts of a body intersecting, or with shocks being affected by expansion waves from other 
parts of the body. In Section 3.11.2, we'll discuss the local relationships the flow through a shock must obey. 


Figure 3.11.1 Examples of flowfields with compression shocks. (a) A transonic airfoil flow with a bubble of 
supersonic flow over the upper surface, terminated by a shock. (b) Supersonic flow over a sharp wedge or 


cone, with an attached oblique shock. (c) Supersonic flow over a blunt-nosed body with a detached bow shock 
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With or without shocks, compressible flow is inherently more complicated than incompressible flow. The full 
system of equations for compressible flow, even if assumed inviscid, is nonlinear and entails all the difficulties 
we discussed in Section 3.5. Except for a very few simple cases, analytic solutions for flows in more than one 
dimension are not possible, and the prediction of a multidimensional flowfield generally requires large-scale 
computation. 

But all is not lost. There 1s still a great deal that can be learned about steady multidimensional compressible 
flows without resorting to computation, based on the combination of continuous 1D flow theory and the re- 
lations governing normal and oblique shock waves. Most real flows are not close to being 1D, and the 1D 
flow theory/shock relations are therefore not globally predictive, but they are still powerful tools because they 
provide simple ways to calculate useful relationships between the various flow quantities, applicable along 
streamlines of steady multidimensional flows. Their general usefulness is thus analogous to that of Bernoulli's 
equation for incompressible flow. 


3.11.1 Steady 1D Isentropic Flow Theory 


If the flow 1s steady, the flow in any given slender streamtube in a flowfield is effectively 1D. If we assume that 
the flow is inviscid and isentropic, as it usually would be outside the boundary layer near solid surfaces, and 
that the fluid is a perfect gas, then the 1D equations of motion can be integrated analytically. Note that these 
assumptions satisfy the requirements for the compressible Bernoulli equation (Equation 3.8.6) to hold, so that 
stagnation conditions are constant along our streamtube. Remember also that if we further assume that the fluid 
is homocompositional, such that Crocco's theorem (Equation 3.8.8) holds, and that the onset flow is irrotation- 
al, then the stagnation conditions will be the same in all of the streamtubes in the field, at least upstream of all 
shocks. Downstream of any shock, the stagnation pressure is reduced, as we'll see next. 

The result of the steady 1D flow theory for a perfect gas is a set of relationships defining dimensionless ratios 
of flow variables. This set of relationships can be expressed in a variety of ways. An exhaustive compilation 
is given in NACA Report 1135 (Ames Research Staff, 1953). Derivations can be found in Shapiro (1953) and 
Liepmann and Roshko (1957). 

In a commonly used form of these relations, the local temperature, pressure, and density are related to their 
stagnation values, the streamtube area and flow velocity are related to conditions that would apply at Mach one, 
and everything is expressed in terms of local Mach number: 
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where A is the streamtube area, a is the speed of sound, and the star refers to conditions that would apply at 
Mach 1. These relations are plotted in Figure 3.11.2, and they highlight several very useful facts regarding 
steady inviscid flow of a perfect gas: 
1. Streamtube area is at a minimum, and therefore mass flux per unit area is maximum, at a local Mach 
number of one. Thus for a flow to pass from a subsonic condition to supersonic, it must pass through 
Mach one at a local minimum in streamtube area. If geometric constraints force the flow area to go 
through a minimum at some location, as is the case in flows in confined passages or “ducts,” a maximum- 
mass-flux flow condition at Mach one at that location is often referred to as “choked.” To produce a 
steady supersonic flow in a duct, as in a rocket exhaust or a supersonic wind tunnel, a choked nozzle 
somewhere upstream is generally required. 
2. A change in streamtube area has opposite effects in subsonic and supersonic flows. An area increase 
slows a subsonic flow down and increases the pressure and temperature but does the opposite to a super- 
sonic flow. 
3. Dimensionless ratios of flow quantities depend only on Mach number and y. Changing the /evel of 
pressure or temperature has no effect on dimensionless ratios if Mach number is kept the same. 
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Steady, 1D flow analysis can be extended to include effects of bulk-averaged heating or cooling, or viscous 
friction. Treating such effects in a 1D, bulk-averaged sense can be a reasonable approximation for flows in 
ducts that are sufficiently slender relative to their length. The inclusion of these effects means that entropy and 
total pressure will no longer be constant. A flow with simple viscous friction and no heat transfer will still have 
constant total temperature, but a flow with heat transfer will not. Analytic solutions such as those leading to 
Equations 3.11.1—3.11.5 are possible for constant streamtube area and simple friction or simple heat transfer. 


Numerical solution is generally required in cases with variable streamtube area and/or combined effects. Equa- 
tions applicable to these situations are derived in Shapiro (1953, Chapters 6—8) 


3.11.2 Relations for Normal and Oblique Shock Waves 


In solutions to the NS equations, compression shocks appear as continuous compressions in which all flow 
quantities vary smoothly. The predicted shock thickness of a bit over one mean free path is roughly correct, but 
the detailed distributions of flow quantities are not, because the NS equations don't model the molecular trans- 
port terms accurately when changes take place over such short distances. The “jumps” in flow quantities across 
a shock are accurately predicted because they don't depend on the internal structure of the shock, and the NS 
equations enforce all the relevant physical conservation laws that must be obeyed in the total jump across the 
shock. 


Note that most practical numerical solutions to the NS equations are done on grids that are far too coarse to 
resolve the correct thickness of a shock. Numerical dissipation smears shocks over a distance of at least several 
grid intervals, which is typically several orders of magnitude larger than the actual thickness of a shock, and 
the degree of smearing increases when the shock front is oblique to the grid. Still, except to the extent that the 
smearing encroaches on flow gradients upstream and downstream, jumps across the shock can be predicted ac- 
curately in spite of the smearing. 

The “shock relations” are a set of analytic formulas describing the effects of shocks in terms of the jumps 
in flow quantities from upstream to downstream. In this discussion, we will consider only steady flows, but the 
results of the steady theory can in principle be transformed to apply to unsteady flow because the flow in the 
neighborhood of a shock can be rendered locally steady by a Galilean transformation that cancels the motion of 
the shock normal to itself. The shock relations are “1D” in that they describe what happens only to an individual 
streamtube as it goes through the shock, and in this sense they are similar to the 1D isentropic flow relations 
we described in Section 3.11.1. The only multidimensional consideration that comes into play is turning of the 
streamtube through the shock, which must be accounted for when the shock is oblique to the local flow direc- 
tion. Because jump conditions through a shock do not depend on the detailed internal structure of the shock, the 
shock relations can be derived in an essentially inviscid framework. 


To derive the shock relations, we enforce conservation of mass, momentum, and energy across the shock 
discontinuity. Conservation of the component of momentum tangential to the shock requires that the jump in 
tangential velocity is zero. Because the tangential velocity is the same upstream and downstream, it can be re- 
moved by a Galilean transformation, and any shock can be viewed as a normal shock. The derivation is there- 
fore carried out in two stages. First we derive the jump conditions for a normal shock, and then we derive the 
jump conditions for an oblique shock by transforming the normal-shock relations (1.e., by reversing the Galilean 
transformation that removed the tangential velocity). 

For a normal shock, once we enforce conservation of mass, momentum, and energy; assume a perfect gas; 
and stipulate that entropy cannot decrease, the relationships between downstream and upstream flow quantities 
are uniquely defined. The resulting relations are plotted in the form of dimensionless ratios in Figure 3.11.3. A 
shock obeying these relations is often referred to a Rankine-Hugoniot shock (see Liepmann and Roshko, 1957). 
The following general characteristics of normal shocks are noteworthy: 

1. The Mach number jump is always from supersonic to subsonic, with accompanying increases in pres- 
sure, temperature, and density, and a decrease in velocity. 


2. Total temperature and total enthalpy are unchanged, but because passage through a shock is a dissip- 
ative process, there is a loss in total pressure and an increase in entropy. A jump from subsonic to su- 


personic (an “expansion shock”) would require a decrease in entropy and thus violate the second law of 
thermodynamics. 

3. The “strength” of a shock, as indicated by the loss in total pressure, increases rapidly with upstream 
Mach number. In applications where low drag is important, really strong shocks are to be avoided by 
design, and the strength of relatively weak shocks becomes an important issue. An important fact to re- 
member in that regard is that for upstream Mach numbers near 1.0 (the weak shock limit), the total-pres- 
sure loss goes as (M — 1)° , 

4. Because of the loss of total pressure, the minimum area, A*, through which the flow can be forced, is 
increased by passage through the shock. 

5. Dimensionless ratios between downstream and upstream are unique functions of either upstream or 
downstream Mach number and y. 


Figure 3.11.3 Normal-shock relations 
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Of course, the normal-shock relations apply to oblique shocks as well, provided we resolve the velocity and 
Mach number in the direction normal to the shock. So the basic physics of an oblique shock is no more com- 
plicated than that of a normal shock. With oblique shocks, however, there is a surprising degree of apparent 
additional complexity, introduced by what amounts to only one additional free parameter in the problem (the 
tangential velocity). Across an oblique shock, the normal velocity is reduced, and the tangential velocity is un- 
changed, so that the flow is turned, or deflected, through some angle as it passes through the shock. For a given 
upstream Mach number, which must be supersonic, there is a maximum deflection angle. For any deflection 
angle less than the maximum, there are two possible shock inclination angles and two different possible down- 
stream flow conditions. For deflection angles close to the maximum, both of the possible downstream condi- 
tions are subsonic, as would be the case with a normal shock, but for deflection angles less than the maximum 
by more than a fraction of a degree, one of the possible downstream conditions is supersonic. These complex- 


ities are best understood through study of the various oblique-shock charts in Shapiro (1953) or NACA Report 
1135 (Ames Research Staff, 1953). 


The normal and oblique shock relations we've just discussed are essentially 1D, but they apply in multidi- 
mensional flows. When supersonic flow is present in steady-flow solutions of the multidimensional inviscid 
equations (the Euler equations), shocks are usually also present and appear as surfaces imbedded in the field, 
across which flow quantities are discontinuous. Flow quantities can also be discontinuous across imbedded vor- 
tex sheets, or singularities can exist at discontinuities in the slope or curvature of bounding surfaces, but every- 
where else in the field flow quantities are continuous and differentiable, and the inviscid equations of motion are 
satisfied. The locations and shapes of imbedded shock surfaces are not known or specified a priori; all that can 
be specified is that if a shock arises as part of the solution, the jump across it must satisfy the shock relations. In 
mathematical jargon, solutions with such imbedded surfaces of discontinuity are called “weak” solutions. Note 
that the Euler equations, which are satisfied in the regions of continuous flow, are not sufficient to determine the 
solution if any shock discontinuity is present and that the shock relations must be imposed explicitly across the 
discontinuity. In numerical solutions, shock discontinuities can either be represented explicitly (shock fitting) 
or approximated automatically in a somewhat smeared-out way in the numerical grid (shock capturing). We'll 
discuss these options further in Section 10.4.1. 


Chapter 4 


Boundary Layers 


In Section 3.6, we saw that the basic effect of viscosity is to exert shear stress, and that when that is combined 
with a no-slip condition at a solid surface, it leads to vorticity that originates at the surface, 1s convected down- 
stream, and diffuses outward. (In Section 4.2.4, we'll look at this in detail and see that some parts of a surface act 
as sources of vorticity, and others act as sinks, but that the net result 1s that there is always some vorticity con- 
vected along next to a solid wall.) In Section 3.8.2, we saw that irrotational flow tends to remain irrotational until 
viscosity has had a chance to work on it. The upshot is that flows around bodies at high Reynolds numbers ac- 
quire a natural global structure: an inner vortical region where viscosity is important, consisting of the boundary 
layer next to the surface and the wake downstream, surrounded by an outer flow that is effectively irrotational and 
that behaves as if it were inviscid. This basic pattern was not generally understood until Prandtl (1904) explained 
it and proposed his approximate theory for the flow in the boundary layer. 

In this chapter, we'll take a detailed look at the physics of the flow in the boundary layer, in preparation for a 
more global discussion of the whole flowfield in Chapter 5. This ordering of the discussion is convenient because 
it turns out that the boundary layer and the outer flow interact only through a relatively simple set of boundary 
conditions at their interface, and everything of interest in the physics of the boundary layer can be discussed in 
general terms without our having to know much about the outer flow. On the other hand, our later discussion 
of the global flow will refer often to what goes on in the boundary layer. Boundary-layer specialists like to joke 
(only half joke, actually) that the boundary layer is the most important part of the flow because it's the only part 
that touches the body. 

When the Reynolds number is sufficiently high, the boundary layer remains relatively thin, unless it separates 
from the body “prematurely,” that 1s, ahead of the tail of a body or the trailing edge of a wing. Within the thin at- 
tached boundary layer, Prandtl's simplified versions of the Navier-Stokes (NS) equations often apply, along with 
special boundary conditions through which the boundary layer and the outer inviscid flow interact. These equa- 
tions and the many methods for solving them constitute the subdiscipline of fluid mechanics called boundary- 
layer theory. The terminology can be confusing, for example, when it is sometimes implied that the viscous flow 
near the surface ceases to be a “boundary-layer flow” when it fails to satisfy the assumptions of boundary-layer 
theory. We won't take that narrow view here. Most of the physical boundary-layer phenomena we'll discuss in this 
chapter don't depend on the validity of boundary-layer theory. However, the theory does provide helpful insights 
into why boundary layers behave as they do, and it is sufficiently accurate, enough of the time, that it is useful 
for quantitative analysis as well. And, of course, the theory played a huge role in the history of our understanding 
of viscous effects at high Reynolds numbers. 

In this chapter, we'll limit our attention to steady boundary-layer flows. We'll get to the specifics of the theory 
in Section 4.2, after discussing the general physical aspects of 2D and 3D boundary-layer flows in Section 4.1. 
In much of the literature, 2D boundary layers and 3D boundary layers are treated as separate universes. To em- 
phasize the common ground these two universes share, we'll take a different tack here, and in the discussion in 
Sections 4.1 and 4.2, we'll keep 2D and 3D integrated. Topic by topic, we'll introduce ideas in 2D and then dis- 


cuss what has to be added to go to 3D. Then, in the remaining sections, we'll deal with transition and turbulence, 
control of flow separation, heat transfer, compressibility, and surface roughness. 


4.1 Physical Aspects of Boundary-Layer 
Flows 


The boundary layer is a thin sheet of flow close to the surface, so it's natural to imagine following its progress 
along the surface and using time-like terminology to describe its development even when the flow is actually 
steady. And we'll use this kind of time-like terminology to refer to the boundary layer as a whole, even though 
fluid parcels flowing along at different distances from the surface move at different speeds and thus experience 
different transit times. This is an example of the “pseudo-Lagrangian viewpoint” we discussed in Section 3.4.7. 
Imagining a steady flow in this way is especially appropriate in the case of a boundary layer, because, as we'll 
see in Section 4.2, a boundary layer flow has the character of an initial-value problem. You'll note that many 
discussions of boundary-layer flows, including those in this chapter, use time-like terminology freely. 


4.1.1 The Basic Sequence: Attachment, Transition, 
Separation 


Let's look first at the major milestones that mark the development of a 2D boundary layer as it flows along 
the surface of a body. The development starts where the flow first attaches at or near the front of the body and 
ends where the boundary layer finally leaves the body and becomes part of the viscous wake. In between, the 
boundary layer will usually undergo transition from laminar to turbulent, except in special cases at low Reyn- 
olds number, and it may also undergo intermediate separation and reattachment. 

The least complicated type of 2D attachment occurs at a stagnation point, at or near a blunt leading edge, as 
shown in Figure 4.1.la. The velocity ue just outside the boundary layer initially increases linearly in both direc- 
tions from such a stagnation point, and the boundary layer in the neighborhood is always laminar. In the region 
of linear acceleration, the boundary-layer development is described by one of the special similarity solutions 
to the equations, as described in Section 4.3.2. The thickness of the boundary layer remains constant until ue 
deviates from its initial linear distribution. 


Figure 4.1.1 Types of initial boundary-layer attachment in 2D. (a) At or near a blunt LE (b) Directly on a 
sharp LE. (c) Near a sharp LE 





If the leading edge is sharp, the situation is not always simple. For supersonic flow or for one particular angle 
of attack in subsonic flow, the flow can attach directly to the sharp leading edge, as shown in Figure 4.1.1b. 
At other angles of attack in subsonic flow, the attachment will be either above or below the leading edge, as 


in Figure 4.1.1c, with local behavior around the attachment point like that of Figure 4.1.la, and there will be a 
separation at the leading edge with at least a short bubble of recirculating flow downstream. 

The final separation of the boundary layer from a 2D body can take place either from a sharp trailing edge, 
as in Figure 4.1.2a, or from a smooth part of the surface, as in Figure 4.1.2b. Separation from a smooth surface 
raises interesting physical issues that we'll discuss further in Section 4.1.4. Between the initial attachment and 
the final separation, a boundary layer usually undergoes transition from laminar to turbulent, a process we'll 
discuss in some detail in Section 4.4.1. 


Figure 4.1.2 Types of final 2D boundary-layer separation in 2D. (a) Separation from sharp TE. (b) Single sep- 
aration from smooth surface and into the wake. (c) Double separation from smooth surface and into the wake 





(a) (b) 


Separation with subsequent reattachment can take place from a sharp leading edge, as in Figure 4.1.1c; from 
a smooth surface, as in Figure 4.1.3a,b; or from a sharp corner at the edge of a “cove,” as in Figure 4.1.3c. Ina 
smooth-wall separation bubble like that in Figure 4.1.3a, the reattachment is usually precipitated by transition to 
turbulent flow, with its associated rapid increase in shear stress. For reattachment to happen without transition, 
the separation bubble would generally need to occupy a dip in surface, as in Figure 4.1.3b, because reattach- 
ment of a laminar boundary layer requires a favorable pressure gradient, as would occur as the flow exits the 
dip. Pressure gradients also play an important role in smooth-wall separation, as we'll see below. 


Figure 4.1.3 Types of boundary-layer separation with subsequent reattachment in 2D. (a) Laminar separation 
from a smooth surface with reattachment triggered by transition to turbulent flow. (b) Laminar separation and 
laminar reattachment in a dip in the surface. (c) Separation at the sharp edge of a “cove” 
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Note that the word separation connotes two different aspects of a flow's topological structure. First, separa- 
tion in general involves some flow leaving the surface and forming a shear layer that is at least somewhat “sep- 
arated” from the surface. Second, a line of separation on the surface divides the surface into “separate” regions, 
from which the flow along the surface converges from different directions (By “flow” we are referring either to 
flow a short distance off the surface or, loosely, to the surface shear stress). This is an idea we'll return to 1n our 
discussion of 3D separation in Section 4.1.4. 





3D boundary-layer flows run the gamut from nearly 2D at one extreme to very different from 2D at the other. 
In some situations, we can view a 3D flow in cross sections and see the same patterns of attachment and sep- 
aration that we saw in 2D flows in Figures 4.1.1—4.1.3. But the additional dimension in 3D makes many other 
patterns of attachment and separation possible as well. 

Note that in 2D flows the attachment and separation points are actually lines that extend to plus and minus 
infinity in the “spanwise” direction. In 3D flows, we can have singular points of attachment and separation, and 
these are often accompanied by what we commonly call attachment lines and separation lines that can form 


complicated patterns on the surface of the body. These terms are very useful for discussions of flow patterns, 
but we should note that they aren't mathematically precise. In real flows around finite bodies, the flow struc- 
tures that we commonly call attachment lines and separation lines are not uniquely defined lines or curves in the 
mathematical sense. They are actually bands of finite width, loosely defined by either strong flow convergence 
toward the body (attachment) or strong diverge away from it (separation), but for practical purposes, they are 
often sufficiently narrow that it isn't grossly inaccurate to refer to them as “lines.” 

This issue will become clearer when we discuss what the boundary-layer velocity field looks like in more de- 
tail locally in the neighborhood of separation in Section 4.1.4. Then we'll discuss the global topology of points 
and “lines” of attachment and separation on the body surface in some detail in Section 5.2.3. 


4.1.2 General Development of the Boundary-Layer 
Flowfield 


Now let's look at the general features of the boundary-layer velocity field, time-averaged if the flow 1s turbulent. 
The velocity in the boundary layer is nearly parallel to the surface. It varies relatively slowly along the surface, 
but much more rapidly in the direction normal to the surface, in a distribution called the velocity profile. 

Typical velocity profiles for 2D laminar and turbulent flow are shown in Figure 4.1.4. In both cases, the ve- 
locity starts at zero at the surface, in keeping with the no-slip condition, and gradually approaches a distribution 
consistent with an inviscid outer flow. In the first-order theory that we'll discuss in Section 4.2.1, and therefore 
in this discussion, we'll ignore the slope of the inviscid velocity distribution in the outer flow and assume that 
the slope of the boundary-layer profile goes to zero. The boundary between the viscous boundary layer and the 
effectively inviscid outer flow is often referred to as the boundary-layer “edge,” though in fact it is indistinct. 
Any definition of the boundary-layer edge is thus arbitrary. The usual choice is the point at which the velo- 
city reaches 99% of its outer-flow value, and the term boundary-layer thickness usually refers to the distance 
from the surface to this 99% point. Because the slope of the velocity profile is small there, the determination 
of boundary-layer thickness in this way is sensitive to small errors in the determination of the velocity profile, 
especially in the turbulent case, and the difficulty is compounded if the distribution of velocity in the outer flow 
has significant slope. 


Figure 4.1.4 Typical 2D boundary-layer velocity profiles. (a) Laminar. (b) Turbulent 





In a laminar constant-property boundary layer the viscosity u is constant, so that the shear stress in 2D is just 
proportional to Ou/dy. Given the shape of a laminar velocity profile in Figure 4.1.4a, we can see that the shear 
stress starts at zero at the outer edge of the boundary layer and increases as the wall is approached. The value 
of wou/dy at the wall itself is the shear stress tw transmitted from the flow to the surface and is referred to as 
the skin friction. This terminology is a bit misleading, because it evokes an image of relative motion between 


the fluid and the surface, as in mechanical friction. But remember that with the no-slip condition at the surface, 
there 1s no relative motion, and that this 1s “friction” only in the sense that a shear force is exerted on the surface. 


In a compressible boundary layer or a turbulent boundary layer the relationship between t and Ou/Oy 1s more 
complicated, but tw is still proportional to Cu/dy at the wall. Though a turbulent boundary layer is typically 
much thicker than a laminar one, the turbulent layer has a much larger gradient Ou/Oy at the wall and thus much 
higher skin friction. We'll consider turbulent boundary-layer flow in detail in Section 4.4.2 and compressible 
flow in Section 4.6. 

A streamwise pressure gradient Op/Ox causes additional acceleration that affects the shape of the velocity 
profile. Because Op/Ox tends to be nearly the same across the thickness of a boundary layer, it contributes nearly 
the same incremental material (Lagrangian) acceleration to fluid parcels regardless of distance from the wall. In 
Equation 3.2.2, we saw that a given Lagrangian acceleration requires larger Ou/Ox when u is small than when u 
is large. Thus the bottom of the boundary layer, where u is small, responds to Op/Ox with a more rapid spatial 
rate of change of u (larger Ou/Ox) than does the outer part of the boundary layer. As a result, a positive pressure 
gradient reduces the velocity at the bottom of the profile more than at the top, as shown in Figure 4.1.5a, and 
pushes the flow closer to separation, which we'll discuss in Section 4.1.4. Because of this association with the 
approach to separation, a positive pressure gradient is often called an adverse pressure gradient. A negative, or 
favorable pressure gradient has the opposite effect, as illustrated in Figure 4.1.5b. With these changes in profile 
shape come an increase in skin friction in a favorable gradient and a decrease in an adverse gradient. 


Figure 4.1.5 Effects of pressure gradient on 2D laminar-boundary-layer velocity profiles. (a) Adverse (posit- 
ive) pressure gradient. (b) Favorable (negative) pressure gradient 
Adverse dp/dx 





Favorable dp/dx 





In Section 3.6, we discussed the general effects of viscosity and used a notional boundary-layer velocity pro- 
file as an example in Figure 3.6.2c. There we noted that the negative second derivative 6 uldy* corresponds 
to a negative shear-stress gradient Ot/Oy, which constitutes a net viscous force on fluid parcels tending to slow 
the parcels down. The boundary conditions on Ou/dy require that every boundary-layer velocity profile have at 
least one region of negative 67 u/dy* . And in fact, in the usual situation in most boundary-layer flows, negative 
6 u/dy* and negative Ot/Cy dominate, with the result that fluid parcels in the boundary layer slow down, and the 
boundary layer grows thicker as it flows along. Of course net viscous forces are not the only thing affecting this 
boundary-layer growth; the pressure gradient also plays a role. An adverse pressure gradient tends to slow fluid 
parcels down and hasten the growth the boundary layer, while a favorable pressure gradient tends to slow the 
growth down and can even reverse it. In Section 4.2.2, we'll see how these effects are quantified in terms of the 
integrated momentum balance. 

The tendency toward positive boundary-layer growth is usually quite pronounced. A boundary layer starting 
at a stagnation point as in Figure 4.1.la starts with nonzero thickness, and it is not uncommon for the thickness 


to grow by a couple orders of magnitude over the length of a body. Streamlined bodies usually have regions of 
adverse pressure gradient over their aft portions that contribute strongly to the overall boundary-layer growth. 


The general tendency of viscous forces to slow fluid parcels down in the boundary layer has an important ex- 
ception, and that is at the bottom of a boundary layer in an adverse pressure gradient. Note that the velocity pro- 
file in the adverse pressure gradient in Figure 4.1.5a has a positive second derivative 6 uldy* close to the wall 
and a negative second derivative farther out, with an inflection point in between. When we get to the quantitat- 
ive theory, our discussion in connection with Equation 4.2.7 will explain why this is, but for now the important 
point is that positive 6 uldy* at the bottom of the boundary layer corresponds to a positive shear-stress gradi- 
ent Ot/Oy, which constitutes a net viscous force on fluid parcels pushing them along in the flow direction rather 
than impeding them. This “favorable” viscous force is the main mechanism by which the flow at the bottom of 
the boundary layer resists being slowed in an adverse pressure gradient and thus resists separation, which we'll 
discuss further in Section 4.1.4. 

A boundary layer flow must obey conservation of momentum, both locally and in an integrated sense, 
something we'll discuss in detail in Section 4.2.2. One result of this is that a boundary layer maintains a 
“memory” of what it was subjected to upstream, and this memory typically persists over some distance down- 
stream. As an example, consider two turbulent boundary-layer flows, A and B, that are subjected to the same 
outer flow and differ only in that flow B is subjected to a short patch of surface roughness near the upstream end, 
that 1s not present in flow A. For reasons we'll discuss in Section 6.1.8, the roughness 1n flow B will increase 
the skin friction locally and thicken the boundary layer, relative to flow A. Conservation of momentum requires 
that the additional boundary-layer thickness in flow B persists downstream for some distance, but not forever. 
Downstream of the roughness patch, the skin friction in flow B will be lower than that in flow A because of the 
increased boundary-layer thickness. The boundary layer thickness in flow B will therefore grow more slowly 
and asymptotically settle back toward the thickness in flow A. Thus when a boundary-layer flow is perturbed 
in some way, it “remembers” the perturbation and then gradually “forgets.” We'll see a computational example 
that illustrates this effect in Figure 6.2.4 in connection with a more detailed discussion of surface roughness. 

Note that although the idea of “memory” is applicable in boundary-layer flow, “premonition” is not. In at- 
tached boundary-layer flow, “influence” is heavily skewed in the downstream direction, that is, flow conditions 
at one location along the surface strongly influence what happens downstream but have only very weak influ- 
ence upstream. In Sections 4.2.1 and 4.2.2, we'll see that in the idealized theory for both 2D and 3D flows the 
direct upstream influence is predicted to be zero. 

In 3D boundary layers, the velocity profile takes on an additional dimension, becoming a two-component 
vector function. An instructive way of visualizing a 3D velocity profile is to resolve it into components parallel 
and perpendicular to the outer flow, as shown in Figure 4.1.6. The component in the direction of the outer flow 
at the local boundary-layer edge is called the streamwise profile, and it looks qualitatively like the velocity pro- 
file in a 2D boundary layer, as in Figures 4.1.4 and 4.1.5. The component in the direction perpendicular to the 
local outer flow is called the cross-flow profile and is different from the streamwise profile in several ways. 
First, the cross-flow velocity goes to zero by definition at the boundary-layer edge. Second, cross-flow profiles 
are of different types depending on what “drives” them. If the wall is in motion so as to provide a shearing 
action in the cross-flow direction, as for example, on a propeller spinner, the cross-flow profile appears as in 
Figure 4.1.7a and is said to be shear-driven. If the wall is stationary, the cross-flow velocity must go to zero 
there, and the cross-flow profiles appear as in Figure 4.1.7b,c. Cross-flow of this type is said to be pressure 
driven, because a pressure gradient in the cross-flow direction is required to set it in motion. And, of course, it 
is possible for cross-flow to be shear driven and pressure driven simultaneously. 


Figure 4.1.6 Isometric view of a 3D velocity profile and its resolution into streamwise and cross-flow com- 
ponents 
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Figure 4.1.7 Profiles of cross-flow velocity uc in a 3D boundary layer. (a) Shear-driven cross flow produced 
by motion of the wall. (b) Pressure-driven cross flow increasing. (c) Pressure-driven cross-flow profiles revers- 


ing, including one profile of the cross-over type 
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Given the essentially inviscid momentum balance that pertains in the outer flow, a pressure gradient in the 
cross-flow direction requires outer-flow streamline curvature in the cross-flow direction. So pressure-driven 
cross flow is always associated with situations in which there is outer-flow streamline curvature in the cross- 
flow direction. Because the cross-flow direction is parallel to the local body surface, “curvature in the cross- 
flow direction” refers to the curvature of the streamline as viewed in the local tangent plane of the body surface, 
as distinct from the part of the curvature that is due to the curvature of the surface itself. We'll explore what this 
entails in greater detail when we address the theory in Section 4.2.1. The simple way to think of it 1s that it is 


the “lateral” component of the outer-flow streamline curvature as seen in a local “plan view” that is associated 
with pressure-driven cross flow in a 3D boundary layer. 

The effect of the cross-flow pressure gradient on the flow is similar in some ways to the effect of a stream- 
wise pressure gradient that we discussed above in connection with 2D flow in Figure 4.1.5. Like a streamwise 
pressure gradient, the cross-flow pressure gradient tends to have nearly the same strength regardless of depth in 
the boundary layer, and it has stronger effects on velocity gradients in the low-velocity fluid deep in the bound- 
ary layer than it does in the outer flow. These effects can take the form of rapid changes in flow direction, as, 
for example, when a boundary layer with little cross-flow flows into a region with a strong cross-flow pressure 
gradient. In this situation the cross-flow rapidly increases, as in Figure 4.1.7b, and the streamline curvature deep 
in the boundary layer is much greater than in the outer flow. 

Just as we saw in 2D, these effects of pressure gradient in 3D are resisted by viscosity. As we saw for a 
2D boundary layer, the tendency of an adverse pressure gradient to slow the flow is resisted by viscous forces 
produced by the positive second derivative of the velocity profile close to the wall (see Figure 4.1.5a). In a 
3D boundary layer with a cross-flow pressure gradient, the tendency of the cross-flow to increase as in Figure 
4.1.7b is resisted by viscous forces produced by the negative second derivative of the cross-flow profile typic- 
ally spanning a region starting at the wall and including the peak of the cross-flow profile. The growth of the 
cross-flow profile often stops when these two tendencies come into equilibrium. 

Of course, inertia also plays a role in the evolution of the cross-flow profile. If the cross-flow pressure gradi- 
ent disappears, the cross-flow profile lags behind and persists for some distance downstream. If the cross-flow 
pressure gradient reverses sign, the cross-flow profile reverses first at the bottom of the boundary layer and goes 
through an intermediate stage with a profile of the cross-over type illustrated in Figure 4.1.7c. Because it is a 
transient state accompanying a reversal in sign, the cross-flow velocity magnitudes associated with a cross-over 
profile tend to be small. 

The distribution of flow direction in the boundary layer can be expressed in terms of a direction profile, 
measured by the flow angle fs relative to the outer-flow direction. The direction profile consistent with a 3D 
velocity profile like that of Figure 4.1.6 is shown in Figure 4.1.8. In the limit as y approaches zero, this flow 
direction is the same as the direction of the shear stress at the wall, Bsw, as indicated in Figure 4.1.6. 


Figure 4.1.8 The direction profile in a 3D boundary layer, in terms of fs, the flow angle relative to that at the 
edge 
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Curves constructed parallel to the wall-shear-stress direction are called skin-friction lines, or limiting stream- 
lines, or even wall streamlines, though this last term is misleading, because the velocity at a stationary wall is 
zero, and no real streamline can be defined. Figure 4.1.9 shows skin-friction lines and outer-flow streamlines in 
a 3D turbulent boundary layer on a flat wall of a curved duct of rectangular cross-section. The turning of the 


outer flow is accompanied by a radial pressure gradient that forces the flow deep in the boundary layer to turn 
inward much more strongly than the outer flow does, as we would expect based on Figure 4.1.6. 


Figure 4.1.9 Skin-friction lines and outer-flow streamlines in a 3D turbulent boundary layer on a flat wall of a 
curved duct of rectangular cross-section, plotted from measurements by Vermeulen (1971) 
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The presence of cross flow in a 3D boundary layer often significantly affects the momentum transport and 
thus the development of the flow compared with what it would be 1n a 2D boundary layer subjected to the same 
streamwise pressure distribution. In a 2D boundary layer, the streamwise momentum deficit is convected in only 
one direction: It comes from upstream and is carried downstream. In a 3D boundary layer, convection is in the 
direction of the local flow, which varies through the boundary layer, and cross-flow thus plays a direct role in 
the development of the flow, by transporting momentum “laterally.” Convergence or divergence of the flow in 
the boundary layer also plays an important role. Figure 4.1.10 illustrates what convergence and divergence look 
like in cross-flow profiles at locations that are a short distance apart in the cross-flow direction. Note that con- 
vergence and divergence don't require a change in sign of the cross-flow velocity, just an increase or decrease. 
The cross-flow gradient associated with convergence or divergence affects the velocity component normal to 
the wall through continuity, and thus affects momentum transport indirectly. 


Figure 4.1.10 Examples of cross-flow convergence and divergence. (a) Divergence to either side of a location 


with zero cross flow. (b) Divergence in cross-flow all of one sign. (c) Convergence to either side of a location 
with zero cross-flow. (d) Convergence in cross-flow all of one sign 
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The cross-flow gradient also transports mass, which can alter the displacement effect of the boundary layer, 
as we'll see in Section 4.1.3. And the additional degree of freedom in 3D boundary-layer flowfields opens up 
the possibilities regarding how the flow can separate from the surface, as we'll see in Section 4.1.4. 

The oil-flow-visualization technique is often used in wind-tunnel testing to visualize the skin-friction lines. 
Oil is applied to the model surface, and the shear stress drags it along, forming streaks in the direction of the 
shear-stress lines. The streaky pattern can be made visible by pigment in the oil or by fluorescent dye illumin- 
ated by ultraviolet light. Figure 4.1.11 1s an example of a fluorescent-oil-flow photo of a swept wing in a wind 
tunnel. (We'll discuss the specifics of swept-wing boundary layers in Section 8.6.2.) In general, such photos 
are useful for diagnosing 3D boundary-layer separation patterns, which we'll discuss in detail in Sections 4.1.4, 


5.2.2, and 5.2.3. It can also give an indication of the kind of cross-flow convergence or divergence we discussed 
above. 


Figure 4.1.11 Fluorescent oil-flow photo of a swept wing in a wind tunnel 





As useful as it is, however, oil-flow visualization has a downside: The streaks can give the misleading 1m- 
pression that they represent the general flow direction over the surface. We should always keep 1n mind when 
looking at oil-flow pictures that the streaks represent only the surface-shear-stress direction and that the flow 
direction can be very different only a very short distance above the surface, as indicated by the direction profile 
Figure 4.1.8. The change in direction above the surface can be especially rapid in a turbulent boundary layer in 
a strong pressure gradient, as, for example, near the trailing edge of the wing in Figure 4.1.11. The general flow 
over the surface there doesn't turn outboard nearly as strongly as the oil streaks indicate. The exaggerated turn- 
ing of the streaks in oil-flow pictures often leads observers to overestimate the importance of “spanwise flow” 
in swept-wing boundary layers, something we'll discuss further in Section 8.6.2. 


4.1.3 Boundary-Layer Displacement Effect 


Flow separation like that depicted in Figure 4.1.2b obviously alters the effective shape of the body as seen by 
the outer flow. What is less obvious is that even an attached boundary layer has a displacement effect, though it 
tends to be much more subtle than the effect of separation. 

We've seen how the no-slip condition and viscous diffusion act in combination to slow the flow in the bound- 
ary layer, compared with a corresponding inviscid flow. It follows from the reduced velocity that streamtubes 
within the boundary layer are thicker than they would have been in the inviscid flow. As a result, the flow out- 
side the boundary layer is displaced outward, away from the body, relative to what would happen in the inviscid 
case. The effect of this attached-flow displacement effect on the outer flow varies widely depending on how 
sensitive the outer flow is to small changes in the effective shape of the body. For example, boundary-layer dis- 
placement can have dramatic effects on airfoil pressure distributions in transonic flow, as we'll see in Section 
7.4.8. But even when the effect on the surface pressures is subtle, it can make a significant contribution to the 
viscous drag of a body. We'll look at this contribution to pressure drag further in general in Sections 6.1.3 and 
6.1.5, and specifically in the case of airfoils in Section 7.4.2. 

Note that we just described the displacement effect as the displacement of the outer flow relative to the ideal 
inviscid flow that would adhere to the contour of the body. In the viscous case, the displaced outer flow acts 


like a fictitious inviscid flow around a body whose contour has been displaced outward by some amount. In the 
theory, this fictitious inviscid flow is called the equivalent inviscid flow. Outside the boundary layer and vis- 
cous wake, the equivalent inviscid flow is the same as the actual outer flow, while in the region occupied by the 
boundary layer in the real flow, it 1s an inviscid extrapolation of the outer flow. 

The most widely familiar measure of the displacement effect is the displacement thickness, 5, which varies 
along the surface, mostly increasing in the flow direction, and reflects how far outward the boundary layer has 
displaced the effective surface as seen by the equivalent inviscid flow. We'll look at using 6 as way to quantify 
the effect in Section 4.2.3. In 2D flow, the slowing of the flow in the boundary layer generally results in positive 
5 , except in cases with strong cooling at the surface, which can produce a sink effect that overrides the slowing 
effect. 

So, if we shift the body contour outward by the distance 5 , we define a new body, the equivalent body seen 
by the equivalent inviscid flow. But what determines how ine 5 must be to have the required effect? First, 
the equivalent body surface must meet the flow-tangency requirement, that is, 1t must be a stream surface of 
the equivalent inviscid flow. But this requirement by itself is not sufficient to determine a unique value of 5 
because the equivalent inviscid flow has an infinity of stream surfaces where a solid surface could be placed, 
all presumably having the same effect on the remainder of the outer flow. How do we choose which of these 
possible stream surfaces is the “right” one for our purposes? To illustrate the problem, Figure 4.1.12 shows 
schematically the streamlines of the equivalent inviscid flow around a 2D airfoil-like body. It is clear that there 
is only one streamline that splits at a closed “leading edge” and becomes two streamlines that pass the body 
on opposite sides (shown as heavier lines than the others). If we choose streamlines farther from the body than 
these, the equivalent body must extend upstream to infinity, and if we choose streamlines closer than these, the 
equivalent body must spew mass out of part of its leading edge. These other options would work, in principle, 
but they'd be inconvenient, to say the least. 


Figure 4.1.12 Schematic of streamlines of the equivalent inviscid flow around a 2D airfoil-like body. Only the 
streamline indicated by the heavier curve forms an equivalent body that is closed at the front and is therefore 
the preferred streamline to define the displacement thickness of the boundary layer of the actual viscous flow. 
Distances from the body are exaggerated for clarity 








The preferred equivalent-outer-flow streamline forms a blunt nose that mimics the shape of the body. The 
displacement thickness is therefore defined and nonzero at the stagnation point. We'll look at how this works in 
more detail in Section 4.2.3. 

Thus the preferred choice of equivalent-outer-flow streamline to define 8 is the one for which the equivalent 
inviscid flow has no mass-flow missing (because the equivalent body extends upstream to infinity) and has no 
extra mass- -flow spewing from its leading edge. With this definition of 5 , the equivalent inviscid flow between 
the 8 surface and any point outside the boundary layer has the same mass flux as the actual viscous flow does 
between the wall and the same point. In Section 4.2.3, this will be our basis for defining 5 in 2D asa function 
of the local velocity profile. 


Now note that the equivalent-inviscid-flow streamlines defining 5° behind the body “neck down” somewhat, 
but never close off. This reflects the fact that the viscous wake downstream of the body always carries some 
velocity deficit and therefore retains some displacement effect. 

The simple mass-flux argument we made using Figure 4.1.12 to define 5 in 2D flow doesn't apply in general 
in 3D. In 3D flows, cross flow can have a major influence on the displacement effect and can decouple it from 
the local streamwise velocity profile, so that while 2D 8 can be inferred from the local velocity profile, 3D 5 
cannot. Cross-flow convergence can “pile up” fluid in the boundary layer and increase the displacement thick- 
ness. Likewise, cross-flow divergence carries fluid away laterally and decreases the displacement thickness. The 
general slowing of the flow in the boundary layer still tends to produce mostly positive 5 , In the absence of 
strong cooling. In limited regions of strong cross-flow divergence, however, 5 can be nesativd In this situation, 
the boundary-layer fluid that has been carried away laterally by the divergence must be replaced by the outer 
flow, and it then appears to the outer flow as if the effective body contour has been locally carved away rather 
than thickened. Of course, the fluid that is carried away from a region of divergence has to go somewhere, and 
as a result, regions of negative 5° are always flanked by areas of unusually large positive 5 

The general interpretation of the 8 surface as an effective solid-wall boundary for an equivalent inviscid 
flow is a useful mental model, but it is sometimes leads to misunderstanding. The 5° surface represents an ef- 
fective solid wall only for the flow situation that produced the boundary-layer flow with that particular distri- 
bution of 8 . In a different flow situation, say because some part of the body geometry elsewhere changes, the 
entire flow will change, including the 5 surface, and the flow will not respond as if the original 8° surface were 
a solid surface. 

A misunderstanding in this regard has arisen in connection with wind-tunnel half-model testing. This is the 
testing of a model of half of an airplane mounted on a solid wind-tunnel wall that is supposed to enforce a 
symmetry-plane boundary condition, so that the flow is equivalent to symmetrical flow around a full model. 
In the idealized situation, the nominal symmetry plane of the model coincides with a flat wall of the tunnel, 
as illustrated in Figure 4.1.13a. In inviscid flow, this arrangement provides perfect simulation of the full-mod- 
el flow. In a real viscous flow, however, the symmetry-plane boundary condition is rendered imperfect by the 
boundary layer on the tunnel wall. An idea that is intuitively appealing, and that has frequently been put into 
practice, is that using a “standoff” spacer to move the model off the wall by a distance equal to 5 of the empty- 
tunnel boundary layer, as illustrated in Figure 4.1.13b, 1s the right thing to do on physical grounds, since the 
8 surface should be the effective location of the solid wall. But this is erroneous physical reasoning. Although 
the empty-tunnel 5 represents an effective solid-wall boundary condition for the empty tunnel, it doesn't do so 
in the presence of flow changes introduced by the model. A computational fluid dynamics (CFD) study (Mil- 
holen, Chokani, and McGhee, 1996) looked at a range of standoff heights and several strategies for controlling 
the tunnel-wall boundary layer. The results indicated that the best simulation of full-model conditions should 
be achieved when no model offset is used, and the wall boundary layer is thinned by tangential jet blowing just 
upstream of the model. (The authors did not calculate the case of zero height, but extrapolation of their results 
indicates that it would be best.) 


Figure 4.1.13 Schematic illustrations of a half model mounted on the wall of a wind tunnel. (a) Nominal sym- 
metry plane of the model coincides with a flat, solid wall of the tunnel. (b) A spacer is used such that the nom- 
inal symmetry plane of the model is spaced away from the wall at the empty-tunnel 6 surface, which 1s often 


erroneously thought to provide a better reflection plane 
AV 
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4.1.4 Separation from a Smooth Wall 


So far, we've seen what 2D separation from a smooth wall looks like topologically in Figure 4.1.2b. Now we'll 
look at the physics of separation. First, what determines whether a boundary layer separates, and if so, where it 
separates? 

Whether the boundary layer separates or stays attached is determined by what happens to the low-velocity 
fluid at the bottom of the boundary layer. In an attached-flow region, all of the fluid is moving in the general 
direction of the outer flow, which, if we take the outer-flow direction to be positive, means that the velocity in 
the entire boundary layer is positive, except at the wall itself, where it must go to zero. This, of course, requires 
the slope of the velocity profile to be positive at the wall. Just downstream of separation, there must be a region 
of reverse flow next to the wall, which requires a negative slope of the velocity profile at the wall. So going 
from attached flow to separated flow requires a decrease in the slope of the velocity profile at the wall from 
positive to negative, and the slope must go through zero at the separation point. This sequence 1s easiest to see 
and understand in the case of laminar flow, as sketched in Figure 4.1.14a. 








Figure 4.1.14 Progression of velocity profiles in a 2D boundary layer going through separation. (a) Laminar 
flow. (b) Turbulent flow 
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As we saw in Section 4.1.2, an attached boundary layer usually thickens as it flows downstream, as viscosity 
and the no-slip condition act in concert to decelerate the flow. The slope of the velocity profile at the wall thus 


naturally tends to decrease gradually. But viscosity and the no-slip condition are not sufficient to make the slope 
go through zero and become negative as in Figure 4.1.14a. For that, an adverse pressure gradient (rising pres- 
sure) is also required. How do we know that an adverse pressure gradient is needed? Note that for the negative 
velocity slope at the wall to appear, there must be a region in which the velocity profile is concave “forward” 
(positive 6*uldy’) somewhere within the boundary layer. When such a region appears, it generally appears first 
adjacent to the wall, as it does in the sequence sketched in Figure 4.1.14a. In a laminar boundary layer, or at the 
bottom of a turbulent boundary layer, positive 6" u/dy* means that the shear stress gradient Ot/Oy is also positive. 
In our discussion of the theory in connection with Equation 4.2.7, we'll establish that positive Ot/Oy at the wall 
requires an adverse pressure gradient. 

So it takes an adverse pressure gradient to cause separation of the boundary layer. But an adverse pressure 
gradient also activates a mechanism by which the boundary layer tends to resist separation, enabling it to remain 
attached at least for some distance into the region of adverse pressure gradient. As we saw above, an adverse 
pressure gradient results in a positive Ot/Oy close to the wall. Of course, Ot/Oy is the net viscous force on a fluid 
parcel (see the discussion in Section 3.6 and the flow examples in Figure 3.6.2), and positive Ot/Oy thus consti- 
tutes a net viscous force opposing the pressure gradient. In effect, the fluid at the bottom of the boundary layer 
experiences a “favorable” viscous force. One way to think of it is to imagine the higher-velocity fluid farther 
from the surface acting through the viscous stress to drag the lower-velocity fluid along, fighting against the 
pressure gradient that is trying to slow the fluid down. 

Boundary-layer separation thus involves a tug-of-war between the adverse pressure gradient and an opposing 
viscous force. At any given station along a surface subjected to an adverse pressure gradient, the favorable vis- 
cous force will generally be overmatched, and you'll see the pressure gradient winning the tug-of war, slowing 
the fluid near the wall, and reducing the velocity slope at the wall. How far the boundary layer will be able to 
persevere into the adverse pressure gradient before it separates depends on the rate at which the pressure gradi- 
ent wins and the velocity slope at the wall decreases. Although the favorable viscous force is generally over- 
matched locally, its presence is vital. As we'll see below, if it weren't for the favorable viscous force, a boundary 
layer starting into an adverse pressure gradient would separate immediately. With the favorable viscous force, 
the rate of approach to separation is finite, and the distance from the onset of the adverse gradient to separation 
is nonzero. Until the slope of the velocity profile at the wall is brought to zero, the boundary layer remains at- 
tached, just as the corresponding inviscid flow would under the same conditions. Separation occurs only when 
the adverse pressure gradient has acted over a long enough distance to produce reversal of the velocity profile. 
How long that distance is depends on a number of factors that we'll discuss in Section 7.4.3 in connection with 
the maximum lift of airfoils. 

Thus we've established viscosity as a source of resistance to separation, which seems contradictory because 
we also tend to think of viscosity as one of the main causes of separation. Without viscosity and the no-slip 
condition, a flow can remain attached over the entire length of a body, surviving the adverse pressure gradient 
all the way to an aft stagnation point without separating. But with viscosity and the no-slip condition, there must 
be a boundary layer with zero velocity at the surface, which introduces the possibility of separation when the 
flow encounters an adverse pressure gradient. So separation is a possibility only because viscosity and the no- 
slip condition have introduced a viscous velocity profile. On the other hand, once an adverse pressure gradient 
sets in, viscosity 1s the only source of resistance to separation. If you turned off the viscous stresses at the start 
of the adverse pressure gradient (1.e., switched to Euler equations with the incoming boundary-layer velocity 
profile as the upstream boundary condition), the flow would separate immediately. The flow near the wall, with 
near-zero velocity, has near-zero capacity to proceed into a pressure rise without the favorable viscous effect 
that we discussed above. So viscosity is both an enabler of separation and a source of resistance to separation. 
The key to this seeming contradiction is that the net viscous force 1s Just Ot/Oy, which is negative in most of the 
boundary-layer flowfield, slowing the fluid parcels down. Then at the start of an adverse pressure gradient, Ot/ 


Oy switches to positive at the bottom of the boundary layer and helps that part of the flow overcome the pressure 
gradient, at least for a while. This favorable viscous effect acts only in the bottom part of the boundary layer in 
an adverse pressure gradient, while viscosity everywhere else has an adverse effect. 


A laminar boundary layer cannot withstand much of a pressure rise without separating (see White, 1991, 
section 4-2), because the favorable viscous force that resists separation comes only from molecular shear stress, 
which tends to be small. The amount of pressure rise that can be withstood by a turbulent boundary layer, on 
the other hand, is much greater, because the favorable net viscous force is much stronger. At first glance it's 
tempting to think that this is stmply because the eddy viscosity and the turbulent shear stress are so much larger 
than their molecular counterparts (we discussed turbulent shear stress and the eddy viscosity in Section 3.7), but 
the correct explanation is more complicated. If the eddy viscosity were simply larger, but uniform throughout 
the boundary layer, we would have the equivalent of a laminar boundary layer at a lower Reynolds number, and 
separation resistance would be no greater. The key is that a turbulent boundary layer has a thin sublayer next 
to the wall, in which the eddy viscosity is effectively zero, and that the eddy viscosity increases rapidly with 
distance from the wall outside this sublayer. We'll look in some detail at the physics of the sublayer and the role 
it plays in the greater separation resistance of turbulent boundary layers in Section 4.4.2. 

In the regions of pressure rise that frequently occur in practical flows around bodies, a turbulent boundary 
layer is usually required if separation 1s to be avoided. Streamlined bodies, which we'll discuss in greater detail 
in Sections 5.2 and 6.1.6, must generally have a region of pressure rise at the rear, often referred to as a pres- 
sure recovery, and turbulent flow is generally required to prevent premature separation there. Even so-called 
laminar-flow airfoils (Section 7.4.6) are generally designed to have laminar flow over only part of the airfoil 
chord, with the boundary layer transitioning to turbulent before it tries to proceed too far into the region of the 
pressure recovery. 

Avoiding premature separation is important in many applications. In external flows, there are the pressure 
recoveries on airfoils and other streamlined bodies. In internal flows, ducts that serve to provide pressure recov- 
ery are often called diffusers, and they are important in propulsion inlets, wind tunnels, and many other flow 
systems. In such applications, the designer's objective is often to maximize the recovery that can be achieved in 
a given length or to minimize the length for a given recovery. In this regard, the performance of a flow device 
with a pressure recovery is strongly dependent on the details of the pressure distribution in the recovery region. 
We'll consider this issue in some detail in connection with the maximum lift of airfoils in Sections 7.4.3 and 
7.4.4, and in Section 4.5 we'll look at general strategies for delaying or preventing separation. 

Now let's look further into what happens as a 2D flow approaches separation. Velocity profiles of flows 
going through separation are illustrated in Figure 4.1.14, for laminar flow and turbulent flow. We've already 
noted that the basic flowfield topology requires that downstream of separation there be reverse flow close to 
the wall, which requires negative Ou/oy at the wall, and that the boundary between attached flow and separated 
flow is where Ou/Oy at the wall goes through zero. The surface shear stress and the skin-friction coefficient are 
therefore zero at the separation point, but we must remember that this applies only in 2D flow. 


Where Ou/Ooy goes through zero at the separation point is easy to see in plots of laminar velocity profiles, as in 
Figure 4.1.14a. In plots of mean (time-averaged) velocity profiles in turbulent flow, separation doesn't stand out 
so clearly. A turbulent velocity profile at separation can give the appearance of still having a large positive Ou/ 
Oy at the wall, because Cu/Oy can drop to zero over a very short distance from the wall, not visible on the scale 
of a plot like Figure 4.1.14b. This 1s related to the existence of the thin sublayer we mentioned above, and which 
we'll discuss in detail in Section 4.4.2. And there are other ways in which the turbulent case is complicated. 
Turbulent separation is of course unsteady on length and time scales related to the boundary-layer turbulence, 
and often on longer time scales as well. This means that turbulent separation is marked by two thresholds: the 
first appearance of intermittent reverse flow, followed downstream by reversal of the mean flow. The special 
complexities associated with separation in turbulent flow are discussed in detail by Simpson (1989). 


Separation in 3D flow is also driven by the pressure gradient, but not just by its streamwise component. We 
can still think of separation as involving the reversal of one component of the velocity close to the wall, but it 
needn't be the outer-flow-streamwise component. And a line along which separation takes place, which we'll 
call a separation line, needn't be perpendicular to the local outer flow, as it would be in 2D. 


In some situations, 1n fact, a separation line can be closer to parallel to the outer flow than to perpendicular. 
In such cases the separation is sometimes called a “cross-flow separation” (see Hirsch and Cebeci, 1977, for 
example), because it can be accompanied by reversal of the cross-flow profile. This is unfortunate terminology 
because it implies that cross-flow reversal defines the separation. Actually, cross-flow reversal occurs in many 
situations not even remotely associated with separation, and even in cases of so-called cross-flow separation, 
it isn't the cross-flow reversal that defines the separation location. Cross-flow separations are just situations in 
which the cross-flow reversal happens to occur very close to the actual separation location. 

So separation in 3D is not generally defined by a reversal of either the streamwise or the cross-flow velocity 
profile, or by zero Cf in either the streamwise or the cross-flow direction. But then what 1s it defined by? Se- 
paration in general involves some flow leaving the surface and forming a shear layer that is at least somewhat 
“separated” from the surface, and we'll look at some of the separated-flow structures that arise in 3D in Section 
5.2.2. But separation should also have a telltale signature on the surface. In 2D, that signature is zero C¢. What 
is the corresponding signature in 3D? 

In Section 4.1.1, we noted that a line of separation on the surface divides the surface into regions from which 
the flow just off the surface is converging from different directions. In the limit as we approach the wall, the 
direction of the flow just off the surface defines the direction of the surface shear stress, or the direction of the 
skin friction lines, which we defined in Section 4.1.2. A 3D separation line is thus a skin-friction line flanked by 
other skin-friction lines converging toward it from different directions, as in Figure 4.1.15. Although the mag- 
nitude of Cf is not zero, the component of Cf perpendicular to the separation line is zero, just as 1s was in 2D. But 
zero perpendicular C¢ isn't sufficient as a definition of the separation line because it is satisfied on every other 
skin-friction line as well. And the fact that other skin-friction lines converge toward it doesn't suffice either. So 
we must look at other aspects of the direction field on the surface to see what it is that makes the separation line 
different. 


Figure 4.1.15 Local view of a 3D separation line with other skin-friction lines converging toward it 
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Looking at the global pattern on the surface, we see that what distinguishes a 3D separation line is the longer 
term “history” of the skin-friction lines converging toward it: The skin-friction lines converging toward the sep- 
aration line from opposite sides “arrive” from locations on the surface that are far apart. Thus I propose as a 
working definition of a separation line that skin-friction lines converging toward it from opposite sides have 
different regions of origin. 

This isn't a mathematically rigorous definition, but we can use a math-like argument to elaborate on what it 
means. Consider two points close together on the surface, both of them on the same side of a separation line, 


and consider the skin-friction lines that “arrive” at these points from “upstream,” as illustrated in Figure 4.1.16a. 
The path followed by each of these skin-friction lines can be thought of as being a kind of mathematical func- 
tion in which the points on the surface map into the surface streamlines arriving at the points. This function 
can be said to be continuous in the sense that skin-friction lines 1 and 2 can be made arbitrarily close together 
anywhere along their length if we make points | and 2 sufficiently close together. In a strict mathematical sense, 
this kind of continuity also applies to two points on opposite sides of a separation line, as sketched in Figure 
4.1.16b, but there is a practical difference. Crossing the separation line, the rate of change of the “function” is 
much larger than it is elsewhere, as you can see qualitatively by comparing parts (a) and (b) of Figure 4.1.16. 
Practically speaking, the separation line is a near-discontinuity in the region of origin of the skin-friction lines, 
which is the most precise way of stating my proposed working definition of a separation line in 3D. 


Figure 4.1.16 Illustrations of the concept of the region of origin of skin-friction lines, as applied to two typical 
points on the surface. (a) Two points to one side of a separation line. (b) Two points flanking a separation line 
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generally characterized by zero Cr. We'll look at swept-wing separation patterns and at how the region-of-origin 
definition of separation applies to them in Sections 5.2.3.2 and 8.6.2. 

The region-of-origin way of defining 3D separation leads naturally into a discussion of the distinction 
between the two major types of separation lines illustrated in Figure 4.1.17. The separation line in Figure 4.1.17a 
divides a region where the boundary layer is fed by “clean” outer flow originating upstream of the body, from a 
region within a “closed” separation bubble. This is often referred to as a closed separation. If the bubble's foot- 
print on the surface ends at a sharp trailing edge, as on a wing, the separation line can be an actual discontinuity 
in the region of origin of the skin-friction lines. The other major type is the separation line in Figure 4.1.17b, 
which is flanked on both sides by boundary layers fed by “clean” outer flow. This type is often called an open 
separation. Note that although the open separation doesn't divide regions of the surface fed by different kinds 
of flow, it is still a near-discontinuity in the region of origin of the skin-friction lines. 
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Figure 4.1.17 Illustrations of the two major types of 3D separation lines. (a) Closed type (“bubble”). After 


Maskell, (1955). From Peake and Tobak, (1980). Published by NASA. (b) Open type (“‘free shear layer’). 
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Although Figure 4.1.17 is a good representation of the general distinction between open and closed separ- 
ation, there is one detail that is not quite right. It shows adjacent skin-friction lines joining the separation line 
tangentially but firmly, where skin-friction lines in actual flows do so only asymptotically. This 1s an issue we'll 


discuss further in Section 4.2.5. 


The idea of pressure recovery that we discussed in connection with 2D separation is also relevant to 3D sep- 
aration, but in 3D it is not just the streamwise component of the pressure gradient that is important. 3D effects 
can either increase or reduce the amount of recovery that can be withstood without premature separation. To 
take one important example, wing sweep generally reduces the amount of pressure recovery that the boundary 


layer can withstand, in absolute terms. We'll look at CFD calculations illustrating this effect in Section 8.6.2. 


4.2 Boundary-Layer Theory 


The idea of dividing the flow around a body, for theoretical purposes, into an outer inviscid flow and an inner 
viscous flow, both governed by simplified equations, was introduced by Prandtl (1904) and has been extens- 
ively developed in the years since. For decades, this approach provided the only means for making quantitative 
predictions of viscous flows at high Reynolds numbers. Even in the 1970s and 1980s, when CFD predictions of 
transonic inviscid flows became practical, calculations based on boundary-layer theory provided the only eco- 
nomical means for accounting for viscous effects. In Chapter 10, we'll discuss some of the methods by which 
coupled CFD solutions are obtained for inner and outer flow regions. In this section, we'll concentrate on the 
theory and what we can learn from it about the flow in the boundary layer itself. 


4.2.1 The Boundary-Layer Equations 


Prandtl's, 1904 derivation of the equations of motion for the flow in the boundary layer was based on physical 
reasoning and an order-of-magnitude analysis applied to the 2D NS equations. The boundary-layer equations 
can also be derived by a more formal procedure based on the method of matched asymptotic expansions (Van 
Dyke, 1964). In this procedure, the flow is divided into overlapping inner and outer regions, and the NS equa- 
tions are expanded in terms of a small parameter (R * for laminar flow). The same boundary-layer equations 
arrived at by Prandtl arise as the equations governing the first-order solution for the inner-region flow. The 
equations that arise for the outer-region flow are inviscid, as was also proposed by Prandtl. This is an inter- 
esting example of sound physical intuition being reinforced much later by rigorous mathematical analysis. The 
more rigorous later theory has provided a basis for going beyond the original theory, as, for example, in higher 
order elaborations of boundary-layer theory, and asymptotic analyses of flows in regions where the first-order 
boundary-layer approximations break down, for example, at trailing edges of plates and airfoils. 

To see how Prandtl's original 2D boundary-layer equations arise and how they relate to the corresponding NS 
equations, consider the boundary-layer flow developing along the surface of a body as shown in Figure 4.2.1. 
The flow is described in a curvilinear coordinate system with x along the body surface and y perpendicular to it, 
but because of the thinness of the boundary layer, we ignore the curvature of the surface and of the coordinate 
system. The boundary layer (the region in which the effect of viscosity is significant) is assumed to have a small 
thickness 0(x), and x derivatives of flow quantities are assumed to be much smaller than y derivatives. The 2D 
NS equations for constant-property, steady, laminar flow are given below. The strikethroughs indicate the terms 
that were eliminated by Prandtl's order-of-magnitude analysis to yield the boundary-layer equations: 
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Figure 4.2.1 2D boundary layer developing on the surface of a body, and the coordinate system used in the 2D 
boundary-layer equations 





For Reynolds-averaged turbulent flow, we would have to add a Reynolds-stress term to represent turbulent 
transport in the momentum equation, and then to have a complete set of equations, we would have to incorporate 
a turbulence model, as discussed in Section 3.7. For compressible flow (variable properties), we would also 
need to include the thermal energy equation, an equation of state, and an equation defining u as a function of 
other fluid properties (usually temperature only). The momentum and continuity equations for incompressible 
flow will suffice for purposes of this discussion. Note that Equation 4.2.3 stipulates that, according to this set 
of first-order boundary-layer assumptions, the pressure p is constant in the y direction and is thus effectively a 
function of x only. The pressure in the x-momentum equation is thus imposed unchanged across the entire thick- 
ness of the boundary layer and 1s, in effect, an environmental condition imposed on the boundary layer from the 
outside. 

So, what does the above simplification accomplish? Remember from Section 3.2 that the NS equations are 
hyperbolic-elliptic 1n space, which means that the solution at any one point depends on the solution everywhere 
else. This, combined with the nonlinearity of the equations, effectively precluded the economical computation 
of numerical solutions for most purposes, even in 2D, until the 1980s. The boundary-layer equations, on the oth- 
er hand, are parabolic because all second derivatives with respect to x have been eliminated, which means that 
they in effect represent an initial-value problem in x. Starting with an initial condition (initial velocity profile) 
at an initial x station, we can determine the solution in a one-pass marching sequence from upstream to down- 
stream, and the required computational effort is much less than that required for a corresponding NS solution. 
Economical numerical solutions to the full boundary-layer equations were obtained in the 1960s. 

And further simplifications apply in special cases. There are special situations in incompressible laminar 
flow, which we'll consider in Section 4.3.2, for which the pioneers of boundary-layer theory were able to find 
similarity transformations that reduce the boundary-layer equations from partial-differential field equations 
(PDEs) to an ordinary differential equation (ODE), greatly reducing the effort required to generate numerical 
solutions. 

Even in more general situations in which the similarity transformations don't apply, incompressible laminar 
flows can often be simplified in another way. Equations 4.2.1-4.2.4 can be transformed into a dimensionless 
form in which the Reynolds number appears only in the definition of the transformed vertical velocity v and 
not explicitly in the equations themselves (White, 1991, Section 4.2). Thus the dimensionless development of 
an incompressible laminar boundary layer can be independent of Reynolds number, provided it starts with an 


initial condition that is consistent with the Reynolds number. A common situation that meets this requirement 
is when the boundary layer starts at a stagnation point of the 2D outer flow. The 2D stagnation-point boundary 
layer 1s one of the similarity situations we alluded to above, and it scales in the right way with Reynolds number 
so that the development of the rest of the flow downstream will be independent of Reynolds number even if it 
is nonsimilar. One consequence of this is that incompressible laminar separation depends only on the pressure 
distribution, not on the Reynolds number, provided the flow starts at a stagnation point. 

In Section 4.1.4, we discussed how separation is determined by a tug-of-war between the viscous shear-stress 
gradient Ot/Oy and the pressure gradient dp/dx. So how can separation in laminar flow be independent of Reyn- 
olds number? Doesn't a change in Reynolds number change the viscous stress? Well, yes, but the change in 
boundary-layer thickness compensates for it so that the balance between the shear-stress gradient and the pres- 
sure gradient remains the same. To see how this works, consider two flows in which the body shape and the flow 
quantities p and Uc are the same, but the viscosity u is different. At comparable locations in the boundary layers 
in the two flows (same station on the body, at the half-way point in the boundary-layer thickness, for example), 
we would have t ~ ut, if Ou/Oy were the same. But Ou/Oy is not the same. In a laminar boundary layer that dis- 
plays the kind of global Reynolds-number independence we're talking about, the boundary-layer thickness 6 ~ 
haa , so that Ou/dy ~ i , and t = wou/dy ~ uw? . Then taking Ot/Oy introduces another factor of oe , so that 
Ot/Oy 1s independent of wu. 

The flow around a circular cylinder is a classic example of this kind of behavior in which laminar separation 
takes place at a fixed location in the pressure distribution, independent of Reynolds number. Assuming ideal 
potential flow as the outer-flow input (the effect of the separated wake is not accounted for), an early series 
solution predicted separation at the 108.8° location on the cylinder (see Schlichting, 1979). It has since been 
realized that the series solution converges poorly near separation, and more recent numerical solutions give sep- 
aration at 104.5° (see White, 1991). 

In the world governed by the parabolic boundary-layer equations, “information” is directly transmitted in the 
downstream direction but not the upstream direction. If the pressure distribution p(x) imposed on the boundary 
layer is held fixed, the flow at any one point within the boundary layer depends on, but has no direct influen- 
ce on, the flow at points upstream, while the flow at one point influences the flow everywhere downstream. 
Upstream influence can happen only indirectly through interaction with the outer inviscid flow, which would 
be felt through changes in p(x). There are two important points to note about this. First, the complete asym- 
metry between the directions (influence travels downstream but not upstream) reflects the fact that the boundary 
layer is largely a dissipative viscous flow and is therefore irreversible. Second, the absence of direct upstream 
influence is an idealization resulting from the theoretical assumptions: the neglect of the streamwise viscous 
diffusion term in the x-momentum Equation 4.2.1 and the assumption that pressure is an imposed environment- 
al condition that doesn't vary with y. These assumptions don't hold exactly in any real flow. But they are good 
approximations in many situations, and even in many situations in which they are significantly violated locally, 
the effects of the violation tend to be localized. For example, if a disturbance in the form of a small bump on the 
surface 1s introduced into a real boundary-layer flow, it will, of course, have some direct influence on the flow 
upstream, primarily through its disturbance pressure field, which will vary in y. But its significant upstream in- 
fluence will be confined to within a few bump lengths or heights of the bump itself, and if the bump 1s small, 
it's direct upstream influence will be limited to a short distance. Thus the idea that direct upstream influence 
in boundary-layer flows is limited, but that downstream influence can be far reaching, is a useful insight. Of 
course, this applies only to boundary layers that remain thin and attached to the surface. If the boundary layer 
separates from the surface, its upstream influence through the outer flow becomes much stronger. 

While the boundary-layer equations represent an initial-value problem in x, they also require boundary con- 
ditions, some of which can vary with x. At the inner boundary, the usual solid-wall no-slip boundary condition 
isu=v=Oat y = 0, just as it was for the NS equations. For the normal velocity v, this inner condition is all that 


we can impose, because the highest order y derivative is Ov/Oy. The outer boundary conditions on the rest of 
the solution are a little more complicated. We've already observed that the pressure is effectively one boundary 
condition imposed on the boundary layer from the outside, but we still must apply a condition to the tangential 
velocity u. Note that Ou/Oy should tend toward zero for large y, so that the x-momentum Equation 4.2.1 reduces 
to the 1D inviscid momentum (Euler) equation: 

7 Ou dp 
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with the pressure independent of y as stipulated in Equation 4.2.3. Thus for large y, u should tend toward a value 
Ue (x) that is independent of y and consistent with Equation 4.2.5. We can either impose p(x) as the “outer” 
boundary condition and let Ue (x) “fall out” as an implicit result of Equation 4.2.5, or we can impose Ue (Xx) as 
the boundary condition and use Equation 4.2.5 explicitly to determine a consistent p(x) to impose in Equation 
4.2.1. In any case, we will usually use either p(x) or Ue (x) as an outer-flow matching condition, that 1s, we 
will require it to match the distribution for some outer inviscid flow. Solving the boundary-layer equations with 
either p(x) or Ue (x) as an explicit boundary condition is referred to as the direct mode. Solving the equations 
with a boundary-layer flow variable such as the displacement thickness (which we'll define in Section 4.2.3) or 
the skin-friction coefficient Cf imposed and allowing p(x) and Ue (x) to “fall out” is referred to as an inverse 
mode. 

The terms in the streamwise momentum Equation 4.2.1 have straightforward physical interpretations easily 
related to our physical discussion of Section 4.1.2. The two convective-acceleration terms represent the steady- 
flow part of the Lagrangian acceleration of fluid parcels as they pass through the flowfield, in the manner we 
discussed in Sections 3.2 and 3.4.6, and the pressure gradient and shear-stress terms represent the internal fluid- 
stress gradients that provide the acceleration. 

Some further discussion of the order-of-magnitude argument is called for here. At first, it might seem sur- 
prising that neither of the convective acceleration terms was eliminated in going from the NS equations to the 
boundary-layer equations. After all, the boundary layer is a thin region, and because v in the boundary-layer 
coordinate system must be zero at the wall, v will be small everywhere in the boundary layer. So why can't we 
neglect vou/Oy? It turns out that although v is small, vou/oy and ucu/Ox are of the same order, as Prandtl's order- 
of-magnitude analysis showed. 

To see how this comes about, consider what determines the magnitude of v in a boundary layer. Although v 
must of course obey a vertical momentum balance, the momentum balance isn't actually operative in determ- 
ining v, because all of the terms in the y-momentum Equation 4.2.2 are negligible in the boundary-layer ver- 
sion, Equation 4.2.3. Instead, to determine v we must resort to the continuity equation. Integrating the continuity 
Equation 4.2.4 in the y direction, noting that v = 0 at y = 0, we get 
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So in boundary-layer theory, the vertical velocity v is of the same order as yOu/Ox. Furthermore, it 1s proper 
to think of v as being primarily a result of the combination of continuity and Ou/Ox, and to think of the small Op/ 
Oy in the vertical momentum balance as adjusting to accommodate the v distribution that continuity imposes. 


Now because v is of the same order as you/Ox, we must keep both convective terms in the x-momentum 
equation. And so even in a boundary-layer flow, uou/Ox isn't the only significant contributor to the Lagrangian 
acceleration. Because of the vou/Oy term, the actual Lagrangian acceleration of fluid parcels in a boundary layer 
is typically considerably smaller, in an absolute-value sense, than udu/Ox. Even though v is small, it makes an 
important contribution to the boundary-layer's streamwise momentum balance. 


At the bottom of the boundary layer, the relative importance of the terms in the momentum balance is differ- 
ent from what it is at the edge. We saw above that according to Equation 4.2.5, the momentum balance at the 
edge of the boundary layer involves only the pressure gradient and the longitudinal acceleration. At the bottom 
of the boundary layer, the situation is very different, and the gradients of the shear stress and the pressure dom- 
inate. In the limit as the wall 1s approached, the convective terms in the x - momentum Equation 4.2.1 vanish, 
and the equation reduces to 

dp Ou OT 


f - = —_—.,. 
f Oy chy 





4.2.7 tt 7 


This equation applies only to a very limited region at the bottom of the boundary layer, but it still provides 
interesting insights. It requires, for example, that in an adverse (positive) pressure gradient 6 u/dy* and 0t/Oy 
must be positive at the wall. We looked at the implications of this in the physics of flow separation in Section 
4.1.4. Because the second derivative 1s always negative in the outer part of the layer, this means that the velocity 
profile of a laminar boundary layer in an adverse pressure gradient must always have an inflection point. We'll 
see in Section 4.4 that this has implications for the stability of the laminar boundary layer. 

The 3D boundary-layer equations are analogous to the 2D Equations 4.2.1—4.2.4, with two velocity com- 
ponents, u and w, parallel to the surface, in place of just u, and an additional component of momentum to be 
accounted for. The momentum Equation 4.2.1 thus becomes a two-component vector equation, or two equa- 
tions. And in place of just one coordinate, x, along the surface, we must now have two, x and z. If the body 
surface has compound curvature, the x-z coordinate system that is laid out in the surface must be curvilinear, 
and coordinate metrics and curvature terms must be introduced. And it is often convenient to make the coordin- 
ate system in the surface nonorthogonal. In this respect, the 3D boundary-layer equations are no different from 
most implementations of the 3D NS equations, and tensor notation provides the least error prone way to derive 
equations in curvilinear, nonorthogonal coordinates. For the boundary-layer equations, details can be found in 
the book by Nash and Patel (1972). 

The x-z coordinate system for the 3D boundary-layer equations can be laid out arbitrarily on the body sur- 
face. The velocity profiles described in an arbitrary coordinate system can look quite different from the stream- 
wise and cross-flow profiles that we considered earlier. Figure 4.2.2 shows what the velocity profiles of Figure 
4.1.6 would look like in an arbitrary boundary-layer coordinate system xb,Zb. 


Figure 4.2.2 Isometric view of the 3D boundary-layer velocity profiles of Figure 4.1.6, as resolved in an arbit- 
rary 3D boundary-layer coordinate system xb, Zb 
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In some situations, particular choices of coordinate alignment can make life much easier. For example, if a 
numerical solution is to be generated in a marching sequence, it may be necessary to align one coordinate or the 
other roughly in the dominant flow direction. Sometimes it may be advantageous to align one of the coordinate 
families with the streamlines of the outer flow, a choice referred to as streamline coordinates, though this is 
not often done in practice. Sometimes one coordinate line is aligned with a line along which the initial condi- 
tions for the boundary-layer flow are known or can be easily generated as solutions to the plane-of-symmetry 
boundary-layer equations, simplified versions of the 3D equations that we'll discuss in Section 4.3.4. Examples 
of this would be to align one coordinate line with a plane of symmetry of a body or the leading-edge attachment 
line on a swept wing, as shown in Figure 4.2.3. 


Figure 4.2.3 Examples of aligning one coordinate of a 3D coordinate system with a line along which plane-of- 
symmetry equations apply 
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Leading-edge attachment line on a swept wing 


In Section 4.1.2, we discussed in physical terms how the presence of cross flow affects the momentum bal- 
ance in a 3D boundary layer, and how cross flow is related to the pressure gradient in the cross-flow direction. 
We also saw how a pressure gradient in the cross-flow direction requires outer-flow streamline curvature in the 
cross-flow direction. What this means is illustrated in Figure 4.2.4. The outer-flow streamline projected onto 
the body surface forms a 3D space curve that at any given point has a unit tangent vector t. When the curvature 
dt/ds (the of change of t along the curve) is nonzero, we can define a unit curvature vector k = (1/«)dt/ds. Then 
the curvature vector «kk can be decomposed into components perpendicular and parallel to the local tangent 
plane of the body surface. The component perpendicular to the local tangent plane 1s due to the curvature of the 
surface and is not active in cross-flow production. The cross-flow pressure gradient that is active in producing 
cross flow is proportional to the component of «k parallel to the local tangent plane of the body surface. 


Figure 4.2.4 Illustration of outer-flow streamline curvature. The unit tangent vector to the local outer-flow 
streamline projection 1s t, and the wnit curvature vector K 1s given by (1/k) dt/ds. The component of the 
curvature vector kk projected in the local tangent plane is the component that 1s active in producing cross flow 
in a 3D boundary layer 
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Another way to look at the active component of the curvature is to project the outer streamline into the tan- 
gent plane instead of into the surface itself. The perpendicular component of kk, which is the one due to surface 
curvature, is then lost (zero), but the parallel component is the same as before. So the curvature we're interested 
in is the curvature of the outer streamline as viewed in the local tangent plane. For a curve that lies in a curved 
surface, the mathematical term for this part of the curvature 1s intrinsic curvature. So pressure-driven cross flow 
is always associated with situations in which the intrinsic curvature of the outer-flow streamlines is nonzero. 


When a curve lying in a surface has zero intrinsic curvature it is called a geodesic of the surface. Longitude 
lines and great-circle routes on a spherical globe are examples of geodesics of a spherical surface. Sedney 
(1957), using the 3D boundary-layer equations in streamline coordinates, showed that if the outer-flow stream- 
lines are geodesics of the body surface, the 3D boundary layer will have no tendency to generate cross flow. 
While this sounds like a very general result, the only common examples of it are axisymmetric and 2D planar 
flows. 

Earlier in this section, we discussed how a solution to the 2D boundary-layer equations at one station along 
the surface depends only on what happens upstream and influences only what happens downstream, when the 
imposed pressure p(x) is held fixed. The same principle applies in solutions to the 3D boundary-layer equa- 
tions, but we must generalize the meaning of “upstream” and “downstream” in a particular way. For the 3D 
boundary-layer solution along a particular column normal to the surface at a point P, when the imposed pressure 
p(Xb, Zb) is held fixed, the upstream zone of dependence and the downstream zone of influence are curvilin- 
ear wedge-shaped regions defined by the widest range of streamline directions passing through the column, as 
shown in Figure 4.2.5 (see Wang, 1971). The flow at every point on the column at P depends on everything 
inside the upstream zone of dependence and influences everything in the downstream zone of influence. In the 
lateral regions outside of both of these zones, the flow neither directly affects, nor is directly affected by, the 
flow at points along the column at P. As was the case in 2D, this all-or-nothing dependence/influence situation 
is an idealization resulting from the theoretical assumptions. The earlier comments regarding indirect influence 
through interaction with the outer flow and the effects of local disturbances like small bumps on the surface ap- 
ply in 3D as they did in 2D. In 3D we have the additional effect of viscous diffusion in the lateral cross-stream 
direction, which is ignored in the idealized theory but which can have direct effects in the real world that are 


significant over short distances, on the order of the boundary-layer thickness. Thus it would be reasonable to 
think of direct dependence and influence in the real world as a kind of blurred version of the picture in Figure 
4.2.5. 





Figure 4.2.5 Zones of dependence and influence in the 3D boundary-layer equations 
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4.2.2 Integrated Momentum Balance in a Boundary Layer 


Looking at the streamwise momentum balance in the boundary layer in an integrated sense is informative for 
purposes of physical understanding and is the basis for a whole class of boundary-layer prediction methods, the 
integral methods. If we take the 2D momentum Equation 4.2.1, substitute Equation 4.2.5 for the dp/dx term, and 
integrate in the y direction from y = 0 to any height h that is everywhere outside the boundary layer, we obtain 
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Then if we integrate the continuity Equation 4.2.4 from y = 0 to y = y and substitute for v in Equation 4.2.8, 
recognizing that 
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is called the boundary-layer shape factor, and 5 is the displacement thickness, a measure of the displacement 
effect of the boundary layer, which we'll discuss below (Equation 4.2.13). The second term on the right-hand 
side defines the effect of the pressure gradient and is shown in two alternate forms, one using due/dx and the 


other using dp/dx directly. The two forms are related by the Euler momentum Equation 4.2.5. A version of 
Equation 4.2.10 valid for compressible flow can be found in Schlichting (1979). 

The momentum thickness is a measure of the flux of momentum deficit and can be related to drag in ways 
we'll discuss in Section 6.1.4. The flux of momentum deficit isn't the easiest of concepts to appreciate intuit- 
ively, but it leads to the most direct way to express the integrated momentum balance, as in Equation 4.2.10. 

As we'll see in Section 4.4, Equation 4.2.9 holds for turbulent flow as well as laminar, so that the momentum 
integral Equation 4.2.10 1s valid for turbulent flow, even though we derived it from the field equations for lam- 
inar flow. 

The momentum integral equation quantifies and illuminates several important relationships in boundary-lay- 
er flows. It clearly shows how the growth of the boundary layer, represented by d0/dx in Equation 4.2.10, de- 
pends on the skin friction and the pressure gradient, relationships we discussed in physical terms in Section 
4.1.2. It dictates that the growth rate d0/dx will be positive unless the pressure-gradient term is sufficiently neg- 
ative (1.e., there is a sufficiently strong favorable pressure gradient) to offset the skin-friction contribution. It 
also shows how the effect of the pressure gradient depends on the shape of the velocity profile, through the 
shape factor, H. Referring to the laminar velocity profiles with low and high H sketched in Figure 4.2.6, we can 
see that the higher H is, the higher will be the proportion of the flux of momentum deficit that is accounted for 
by low-velocity fluid. As we argued in Section 4.1.2, low-velocity fluid responds to the pressure gradient with a 
larger local velocity gradient Ou/Ox. A high-H boundary layer is thus more susceptible to the effect of a pressure 
gradient, an effect that Equation 4.2.10 quantifies in terms of d0/dx. 


Figure 4.2.6 2D laminar velocity profiles illustrating low and high shape factor H 
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The momentum integral equation has four independent variables: ue, 8, H, and C¢. Generally at least one of 
these must be imposed as a “boundary condition.” Solving the equation with ue imposed is referred to as the 
direct mode, just as it was in solving the field Equations 4.2.1—4.2.4. And just as before, solving with any of 
the other three imposed instead is referred to as an inverse mode. Whatever mode the equation is solved in, it 
has three unknowns, and two additional closure relationships are therefore required in order to have a complete 
system. One relation is almost always a skin-friction law that defines Cf as a function of other boundary-layer 
quantities. In many methods, the other closure relation requires making an assumption about the velocity pro- 
files, usually in the form of a family of profile shapes with a range of shape factor H. The skin-friction law 
and the other closure relation will of course depend on whether the flow is laminar or turbulent. A wide variety 
of integral boundary-layer methods have been developed for both laminar and turbulent flow. Two of the best- 
known examples are the lag-entrainment method of Green, Weeks, and Brooman (1977) for turbulent flow, and 
the laminar and turbulent methods developed by Drela and Giles (1986) for airfoil calculations. White (1991) 
discusses some of the earlier methods. 


Three-dimensional integral boundary-layer equations can be derived in the same way, starting with the 3D 
boundary-layer momentum equations in general curvilinear, nonorthogonal coordinates. There are then two 
equations corresponding to Equation 4.2.10, one for each in-surface coordinate direction. Because there are 
two surface-parallel velocity components carrying momentum, the equations contain two “displacement” thick- 
nesses and four momentum thicknesses. There are also two skin-friction coefficients to solve for. Compared 
with 2D, we thus have five more basic unknowns and only one more basic equation, so more closure relations 
are required. Tying the additional integral thicknesses together generally requires making empirical assump- 
tions about the velocity profiles. The usual approach is to develop relations based on empirical models for the 
streamwise and cross-flow velocity profiles and then to transform the relations into the general coordinate sys- 
tem. The resulting equations are very complicated. The 3D integral method by P.D. Smith (1972) is one of the 
better-known examples. 


4.2.3 The Displacement Effect and Matching with the Outer 
Flow 


In early applications of boundary-layer theory, p(x) or we (x) was usually assumed to be known a priori from 
a potential-flow solution for inviscid flow around the body, the boundary-layer equations were solved in the 
direct mode, and any influence of the boundary layer on the outer flow was ignored. Later development of the 
rigorous theory showed that this is all that is required for the formal first-order theory. But in many practical 
applications, the effect of the boundary layer on the pressure distribution is significant, and the first-order theory 
is not enough. 

In higher fidelity viscous-inviscid matching calculations, the objective is to impose the boundary layer's 
displacement effect on the outer flow and thus to have the outer-flow calculation provide a prediction of the 
boundary-layer's effect on the surface pressure. Note that the correction for displacement is formally a second- 
order effect (see White, 1991, Section 4.11), but that when we introduce it we still usually ignore all other 
second-order effects. Interactive schemes that impose the displacement effect always require some iteration, 
with both the boundary-layer and the outer-flow solutions repeated several times. 

The most obvious matching strategy is simple repeated cycling: Calculate the inviscid flow, then input the 
resulting uve (x) to a boundary-layer calculation, and then add the resulting 5° to the body shape, and repeat 
to the inviscid calculation. This scheme works some of the time, but it doesn't always converge, especially if 
the boundary-layer calculation gets close to separation. Many other schemes have been developed in attempts 
to achieve more robust convergence, to extend matching calculations into regions of separation, and to reduce 
computing time. In these schemes, the inviscid flow and the boundary layer are calculated either in series or in 
parallel with each other, or simultaneously in systems where the boundary-layer and outer-flow equations have 
been combined. In various nonsimultaneous schemes, the boundary-layer equations may be solved in the direct 
mode, an inverse mode, or a combination. In all cases, the matching of p(x) or ue(x) is either imposed explicitly 
as a condition on each boundary-layer solution or is sought as an outcome of the iterative coupling. We'll dis- 
cuss such schemes further in Chapter 10. 


In Section 4.1.3, we discussed the displacement effect and the idea of a displacement thickness 1n qualitative 
terms. We also discussed the concept of an equivalent inviscid flow, which has the same velocity field and pres- 
sure distribution outside the boundary layer as the actual flow, but remains inviscid and irrotational across all 
or part of the near-wall region occupied by the boundary layer in the actual flow. Implementing this concept 
requires altering the boundary conditions on the outer inviscid flow solution so that 1t matches the desired equi- 
valent inviscid flow. There are two ways to do this: (1) to apply the usual no-through-flow boundary condition 


on an “effective” body contour that has been moved outward by the displacement thickness or (11) to apply a 
nonzero normal velocity, or transpiration, on the actual body contour. These ideas are illustrated in Figure 4.2.7. 





Figure 4.2.7 Illustration of the 2D boundary-layer displacement effect. (a) Actual boundary-layer flow. (b) 
Equivalent inviscid flow with displacement thickness. (c) Equivalent inviscid flow with surface transpiration 
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In Section 4.1.3, we saw that in 2D flows the most convenient value for the displacement thickness 5 can be 
defined by requiring that the equivalent inviscid flow between y = 5° and any value of y outside the boundary 
layer, as shown i in Figure 4.2.7b, have the same mass flow as the actual flow, as shown in Figure 4.2.7a. The 
result is that 5 is given by an integral of the local velocity profile: 
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Given this definition, the imaginary displacement surface at y = 5° is a streamline of the equivalent inviscid 
flow and is therefore a surface along which the equivalent inviscid flow would satisfy a solid-wall boundary 
condition. 

But this is not our only choice for a definition of 5. As we noted in Section 4.1. 3, we can apply a solid-wall 
boundary condition at any nearby streamline of the equivalent inviscid flow, and it will have the same effect 
on the rest of the flow. Thus the value of 8° defined by Equation 4.2.13 isn't the only choice that would have 
the effect we seek. However, as we saw in Figure 4.3.12, the drawback to the other choices for 8 is that the 
displacement surface won't have a simple, closed shape at the nose of the body. As a practical matter, this is a 


major drawback, and alternative definitions of 2D 5 are hever used in practice. But the alternative derivation is 
useful because it shows the way toward a definition of 8 that works in 3D. 

The alternative derivation of 2D 8. that captures the other possible choices 1 iS. based on flow tangency and 
the continuity equation. Recall from Section 4.1.3 that the purpose of defining 5 is to produce the right flow 
inclination v/uwe in the equivalent inviscid flow where it contacts the displacement surface. We can calculate v(y) 
in both the real flow and the equivalent inviscid flow by integration of the continuity equation in the y direction. 
For the real flow, we start with v = 0 at y = 0 and integrate outward to a point outside the boundary layer. For the 
equivalent inviscid flow, we start with v matching the real flow outside the boundary layer and integrate inward 
to any y inside the boundary layer, thus defining v(y) for the equivalent inviscid flow. The 5° surface we seek 
to define must be tangent to the streamlines of the equivalent inviscid flow and thus must have a slope dd “Idx 
equal to v(y)/ue. The result is an equation for 8: 
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where ~ OK © 1g defined by the local integral given by Equation 4.2.13. Equation 4. 2. 14 is singular at a stagnation 
point, but useable elsewhere. When Equation 4.2.14 is integrated in x, it defines 5 only to within a constant of 
integration. In effect, the value of 5 at any initial x station other than the stagnation point is free to be chosen 
arbitrarily. 

At the stagnation point of attachment, we seem to have a problem: Equation 4.2.14 is singular there, and be- 
cause u(y) = Ue = 0, Equation 4.2.13 doesn't seem to define anything either. But the 2D stagnation-point bound- 
ary layer is one of the laminar similarity solutions we'll discuss in Section 4.3.2, and in the neighborhood of 
the stagnation point OK © ig constant, including in the limit as the origin is approached. So i" = oc is the 
definition of 8 that works at the stagnation point and is consistent with Equation 4.2.14 elsewhere. 

In 3D flow, the mass-flux argument leading to Equation 4.2.13 doesn't generally apply. Instead, we must use 
the same matching and tangency analysis that led to Equation 4.2.14, extended to 3D flows (see Nash and Patel, 
1972). The result is a PDE in two dimensions on the body surface, involving two local integrals of the velocity 
profiles, which in Cartesian coordinates is given by 
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The 3D 8 equation is hyperbolic, and solving it numerically must follow the usual rules for hyperbolic equa- 
tions. Initial conditions (initial values of 8° ) must be defined at upstream boundaries, in a manner analogous to 
defining a starting value of 5 in 2D in Equation 4.2.14. Three-dimensional analogs to starting at a 2D stagna- 
tion point are starting at a singular point of attachment, as at the blunt nose of a fuselage, or at an attachment 
line, as at the leading edge of a swept wing. A similarity solution applies at a singular point of attachment, and at 
a leading edge attachment line, we can usually use the plane-of-symmetry boundary layer equations discussed 








in Section 4.3.4. Both of these situations define starting values of 5 analogous to starting with 0 
2D. 

We mentioned earlier that an alternative to using 5 to produce the displacement effect is to use an equivalent 
transpiration boundary condition applied at the actual body surface. We derive this condition by applying the 
same process of integrating the continuity equation that led to Equations 4.2.14 and 4.2.15. The 2D result is 
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Imposing this as a boundary condition on the outer flow simulates the boundary-layer displacement effect by 
enforcing the flow situation illustrated in Figure 4.2.7c. Note that because this is a boundary condition on the 
outer inviscid flow, pw is the density of the inviscid flow at the wall, not the viscous boundary-layer flow. 


4.2.4 The Vorticity “Budget” in a 2D Incompressible 
Boundary Layer 


In Section 3.6, we discussed the vorticity equation governing the creation or destruction, convection, stretching, 
and diffusion of vorticity. In a 2D planar flow, only the transverse component of the vorticity can be nonzero, 
and there is no vortex stretching. Furthermore, if the density is constant, there can be no creation or destruction 
of vorticity in the interior of the field. With these restrictions, there 1s only convection and diffusion, and the 
vorticity equation looks like a conservation equation for a passive scalar (the transverse component of the vorti- 
city). We should keep in mind, however, that it isn't an independent physical equation in its own right, but really 
just a sort of rearrangement of the momentum equation. 

In a 2D, incompressible, steady flow, the conservation of vorticity in control-volume form is quite simple. It 
reduces to a balance between two fluxes integrated over the boundaries of the control volume: the convective 
flux @V « n and the diffusive flux vOw/on, where v is the kinematic viscosity, and @ 1s the vorticity component 
in the transverse direction, the only component that can be nonzero. 

In Figure 4.2.8, we set up a control volume for a boundary-layer flow in the x direction and indicate the relev- 
ant diffusive flux to or from the wall and the convective fluxes through the upstream and downstream boundar- 
ies. The diffusive fluxes at the upstream and downstream boundaries are negligible. Even if flow passes through 
the outer boundary, there is no diffusive or convective vorticity flux there because the vorticity is effectively 
zero outside the boundary layer. So the diffusive flux in or out at the wall must balance the difference between 
the convective flux in (upstream) and out (downstream). 


Figure 4.2.8 Control volume for tracking the vorticity “budget” of a 2D incompressible boundary layer 
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The diffusive flux of vorticity to the wall is given by 
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and for a thin boundary-layer flow, the convective flux reduces to 


a th Oy ete th 
| uty as — | u—dy = — - | udu = = 
4.2.22 /0 Jo (OY /0 . 


So we see that the net vorticity flux in a thin boundary layer 1s given approximately by half the square of the 
edge velocity. This is analogous to the result that the integrated vorticity in a thin shear layer is approximately 
equal to the velocity jump across the shear layer, which we discussed in Section 3.3.8. 

Now we see that the vorticity balance requires flux to or from the wall only when ue 1s changing with x. The 
wall acts as a source or sink for wore depending on the rate of change of ue. Increasing Ue requires vorti- 
oy _ from the wall, or negative 6? uldy’, while decreasing Ue requires vorticity flux to the wall, or positive 
Gr way”. This is of course consistent with our finding in Equation 4.2.7, based on the momentum equation, that 
6" u/dy* at the wall is proportional to dp/dx. 

A flat-plate boundary layer, for which ue is constant in x, 1s an interesting special case. When ve is constant, 
there is no diffusive vorticity flux to or from the wall, only a constant integrated convective flux along the flow 
direction. Of course, the vorticity in the field becomes more diffuse as the boundary layer thickens. 


4.2.5 Situations That Violate the Assumptions of 
Boundary-Layer Theory 


Remember that in deriving Prandtl's boundary-layer equations, 1t was assumed that the boundary layer is thin 
and that x derivatives are much smaller than y derivatives. Any boundary-layer situation in which the x derivat- 
ives become comparable to the y derivatives obviously violates these assumptions. Figure 4.2.9 illustrates a few 
common examples of flows that constitute boundary layers in the physical sense but violate the assumptions of 
first-order boundary-layer theory. 


Figure 4.2.9 Illustrations of situations that violate the assumptions of first-order boundary-layer theory. (a) 
Corners. (b) Steps. (c) Large longitudinal curvature. (d) Separation from a smooth wall. (e) Separation from 
trailing edge. (f) Interaction with shock. (g) Sudden change in surface roughness. (h) Protuberances. (1) Sharp 
leading edge 
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Situations such as these can raise interesting issues for boundary-layer theory. For example, separation from 
a smooth wall (Figure 4.2.9d) is something we discussed from a physical point of view in Section 4.1.4, where 
we saw that the skin-friction coefficient Cy goes through zero at a separation point in 2D flow. Because the neg- 
ative values reached by Cf downstream of separation are generally quite small in magnitude, in the real world 
governed by the full NS equations Cy usually goes through zero with a small slope (Figure 4.2.10a). Direct- 
mode solutions to the first-order boundary-layer equations don't predict this behavior correctly at all. The Cr 
calculated in the direct-mode goes through zero at separation with infinite slope (Figure 4.2.10b), a square-root 
singularity that in the context of boundary-layer theory is called the Goldstein singularity. Numerical methods 
for solving the equations in the direct mode generally fail before reaching the singularity. Solving the equations 
in any of the inverse modes removes the singularity but does not alleviate the problem that x derivatives in the 
neighborhood of separation are usually still large enough to violate the boundary-layer-theory assumptions. 


Figure 4.2.10 Illustrations of how Cf goes through zero at separation in 2D. (a) Small slope according to NS 
equations. (b) Infinite slope (Goldstein singularity) according to direct-mode BL equations 


Cy C; 
x x 
(a) (b) 


Continuing the solution downstream of the separation point raises another set of difficulties having to do 
with dependence and influence. Upstream of separation, a one-direction marching sequence can be used in the 
numerical scheme, because the solution at any point depends only on the solution upstream. Downstream of 


separation, the reversed flow at the bottom of the boundary layer convects information from the general down- 
stream direction, and for the numerical scheme to deal with this properly, something more complicated than 
one-direction marching is required. 

Separation from a sharp trailing edge (Figure 4.2.9e) raises a different set of issues. For this situation, an ap- 
proximate correction to boundary-layer theory has been developed, called triple-deck theory (see White, 1991, 
for a summary). Of course, the full NS equations handle all of these situations “exactly,” and without any of the 
previous theoretical difficulties, just a large increase in required computational effort. 

Separation from a smooth wall in 3D raises the same issues for direct-mode solutions to the boundary-layer 
equations as it does in 2D. The component of Cy perpendicular to the separation line goes through zero with a 
square-root (Goldstein) singularity just as 2D Cydoes. This means that adjacent skin-friction lines converge to a 
distinct separation line parabolically, rather than just asymptotically converging toward each other, as they do 
in solutions to the NS equations. This distinction 1s illustrated in Figure 4.2.11. Some illustrations of the topo- 
logy of separation, such as the one in Figure 4.1.17, show the separation line in the form of a distinct, singular 
line, as in Figure 4.2.11b, even though separation in real flows takes the form in Figure 4.2.1 1a. 





Figure 4.2.11 Skin-friction lines in neighborhood of a 3D separation line. These are the 3D analogs of the 2D 
Cf distributions of Figure 4.2.10. In 3D, the component of Cf perpendicular to the separation line follows qual- 
itatively the same pattern as in 2D. (a) Continuous variation according to NS equations (asymptotic conver- 
gence). (b) Singular behavior according to direct mode BL equations (parabolic convergence) 





4.2.6 Summary of Lessons from Boundary-Layer Theory 


To summarize the conclusions we've gleaned from boundary-layer theory: 
1. Because a boundary layer tends to remain thin, the pressure variation across it tends to be small and is 
ignored in the first-order theory. The pressure plays the role of an environmental condition imposed on 
the boundary layer from outside, that is, by the outer inviscid flow. 


2. The local flow in the boundary layer depends on everything that happens upstream and influences 
everything that happens downstream. 


3. The local boundary layer's influence on what happens upstream is much less direct. If the boundary 
layer is attached, it has little direct influence upstream (none in the first-order theory), and only a weak 
indirect influence through the outer flow. If the boundary layer separates, its upstream influence through 
the outer flow becomes much stronger. 


4. The development of a 2D laminar, incompressible boundary layer, expressed in dimensionless terms, 
is independent of Reynolds number, provided the boundary layer starts at a stagnation point. 


Except for point (4), the above conclusions apply to both 2D and 3D boundary layers. 


4.3 Flat-Plate Boundary Layers and Other 
Simplified Cases 


In this section, we look at flat-plate flows and a number of other situations in which boundary-layer behavior is 
simplified in one way or another. These idealized cases provide valuable physical insights that are not as eas- 
ily seen in more general situations. And in many situations, they are close enough to reality to provided useful 
quantitative information. 


4.3.1 Flat-Plate Flow 


When the outer-flow pressure and velocity are constant in 2D flow, the flow development predicted by 
boundary-layer theory is particularly simple. This idealized situation is referred to as flat-plate flow because a 
thin flat plate aligned with the flow actually produces something very close to it, except in the neighborhood of 
the leading edge. 

Two-dimensional incompressible laminar flow at constant pressure 1s one of the similarity situations we men- 
tioned in Section 4.2.1, and the similarity solution is credited to Blasius (see Schlichting, 1979). Treating the 
corresponding incompressible turbulent flow requires empiricism to account for the effects of turbulence, which 
we'll consider in more detail in Section 4.4.2. 


Expressions for the average skin-friction coefficient CF over the length of a flat plate are widely used in 
preliminary drag estimation, as we'll see in Section 6.2.1. A question mark that hangs over any such formula 
has to do with the leading-edge region and what details had to be glossed over there. If we start the boundary 
layer at zero thickness and ignore the singularity of the boundary-layer equations in that situation, not to men- 
tion the fact that the assumptions behind the equations aren't valid there, the result from the Blasius solution for 
incompressible laminar flow is 
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The most widely used corresponding formula for incompressible turbulent flow is the Prandtl-Schlichting 
relation (see Schlichting, 1979): 


These laminar and turbulent “fi relations are plotted for a wide range of length Reynolds number in Figure 


4.3.1. First, note that in both laminar and turbulent flow Cf decreases with increasing Reynolds number, in 
spite of the fact that the boundary-layer thickness, as a fraction of body length decreases and Cu/Oy at the wall 
(in dimensionless terms) increases. The rate of increase of dimensionless Ou/Oy at the wall is not enough to can- 


cel the direct effect of the Reynolds-number increase. Also note that laminar fi decreases faster than turbulent 


oF , becoming a smaller and smaller fraction of turbulent oF : 


Figure 4.3.1 Average skin-friction coefficients for incompressible laminar, transitional, and turbulent flow on 
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The curves for cases with transition in Figure 4.3.1 were calculated for idealized “instantaneous” transition, 
in which the boundary layer is assumed to become turbulent immediately at the assumed transition location, 
and the turbulent boundary layer is assumed to start with the same momentum thickness reached by the laminar 


boundary layer at the transition point. (We'll see immediately below how 0 and CF are related, in Equation 
4.3.3.) Note that when the length Reynolds number exceeds the assumed transition Reynolds number, and the 
boundary layer becomes turbulent on the downstream part of the plate, the average Cy increases rapidly at first 
and then asymptotically approaches the all-turbulent curve from below. 


Because the pressure gradient is zero in flat-plate flow, only the skin friction contributes to the growth in 


momentum thickness along the plate. @ is then just proportional to the cumulative average Cr, as can be seen 
in the momentum-integral Equation 4.2.10, which reduces to 
~ 24 2 Ra 

C;(z) = C* = 
4.3.3 = ite 











This result is valid for either laminar or turbulent flow and is not limited to incompressible flow. 
The local Cy is often also of interest, and we can obtain expressions for it by differentiating «! CK , where 


Cr comes from Equations 4.3.1 and 4.3.2. The result for incompressible laminar flow is 
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and for incompressible turbulent flow: 


43.5 Ca = Cp(L =z) —0.51(log R,)~*™. 
These relations are plotted in Figure 4.3.2. The first thing to note about local Cg 1s that it decreases with 
increasing R and is therefore less than Of (for flat-plate flow; in more general situations, this is not always the 


case). The cases with transition are the same ones for which Ch was plotted in Figure 4.3.1, but when viewed 
in terms of Cf in Figure 4.3.2, they look very different. After transition, local turbulent C4 jumps well above 
the all-turbulent curve. This is because the laminar run results in a thinner boundary layer (smaller @) at the start 
of the turbulent boundary layer than would be there in the all-turbulent case. Thus the local-Cy curves in Figure 


4.3.2 approach the all-turbulent curve from above, while the Cf curves of Figure 4.3.1 approach it from below 
because the average includes the low-C¢ laminar run. 


Figure 4.3.2 Local skin-friction coefficients for incompressible laminar, transitional, and turbulent flow on a 
flat plate 
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The above formulas for “fi and Cri as functions of x are not always accurate for real-world flat-plate flows 
because the development of a real flow near the leading edge doesn't generally follow the idealized flat-plate 
assumptions. Away from the leading edge, real-world flows with constant pressure should settle into agreement 
with the local-C¢ formula, but with a shift in the apparent origin in x. In the “fully developed” flat-plate bound- 
ary layer the local relationship between Cf and Ro is independent of what happened near the leading edge. For 
laminar flow, we can derive this formula by inverting Equation 4.3.4 (expressing x as a function of Cf) and 
substituting it into Equation 4.3.3, with the result: 


43.6 Ci = 0.6647 Re 


For turbulent flow, Equation 4.3.5 cannot be inverted in closed form, and the turbulent counterpart to Equa- 
tion 4.3.6 must be determined by numerical interpolation of tables of C(x) from Equation 4.3.5 and Re (x) from 
Equations 4.3.2 and 4.3.3. The resulting Cg(Re) relations are plotted in Figure 4.3.3. 


Figure 4.3.3 Local skin-friction coefficients for incompressible laminar and turbulent flow on a flat plate, as 
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Compressibility and/or heat transfer can alter flat-plate skin friction significantly. For cases with heat trans- 
fer, numerical calculations are generally required. For the case of an adiabatic wall, the situation is simpler, and 
many approximate theories have been developed, with predictions that vary widely (see White, 1991). Experi- 
mental data from numerous sources also show considerable scatter. White states that the curve labeled “Frankl- 
Voishel” in his Figure 7.22 fits “typical data” fairly well. A simple formula that approximately fits this curve, 
over the range of Mach numbers below 2.0, is given by 
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which 1s plotted in Figure 4.3.4. Note that at Mach 1.0, compressibility reduces Or by only about 10%. 


Figure 4.3.4 Effect of Mach number on adiabatic turbulent skin friction on a flat plate, Equation 4.3.7 
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4.3.2 2D Boundary-Layer Flows with Similarity 


Similarity in a boundary-layer flow refers to a situation in which the velocity profile in nondimensional form, 
with an appropriately scaled vertical coordinate, is the same for all x. Strictly speaking, similarity exists only in 
laminar flow, but there are examples of near-similarity in turbulent flow. 


For incompressible laminar flow, it can be shown that similar solutions exist when we ~ x". In such cases, a 
similarity transformation can be defined that reduces the boundary-layer Equations 4.2.1 and 4.2.4 to a single 
ODE. A derivation is given in Schlichting (1979). The ODE has solutions for a range of values of m, which 
are usually referred to in terms of the value of the similarity parameter B = 2m/(m + 1). Negative values of B 
correspond to adverse pressure gradients, with a boundary layer 1n a constant state of incipient separation (zero 
Cf for all x) corresponding to B = — 0.19884. Zero B corresponds to a flat-plate flow (constant ue, the Blasius 
solution), and positive B corresponds to favorable pressure gradients. Velocity profiles for three representative 
values of B are plotted in Figure 4.3.5. 


Figure 4.3.5 Velocity profiles for several laminar similarity solutions with outer-flow velocity distributions 
corresponding to the potential flow over wedges with included angles zf, as illustrated in Figure 4.3.6. 1 is the 
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The laminar similarity solutions are often referred to as wedge flows because the outer-flow velocity distri- 
bution ue ~ x” arises in the potential flow around a wedge of included angle zB. These are the same corner-flow 
solutions that we saw in Section 3.10 can be obtained by conformal mapping. Figure 4.3.6 illustrates how they 
work for the different types of flow. The decelerating flows correspond to wedges with negative included angles, 
or potential flow over a convex corner, as in Figure 4.3.6a. Flat-plate flow corresponds to a zero included angle, 
as in Figure 4.3.6b, and B = 1 corresponds to a 2D stagnation-point flow, as in Figure 4.3.6c, often referred to as 
Hiemenz flow. Note in Figure 3.10.2d that the edge velocity in the stagnation-point flow increases linearly away 
from the stagnation point, and the laminar-boundary-layer thickness is constant because the usual tendency of a 
boundary to grow is canceled by the strong favorable pressure gradient. 


Figure 4.3.6 Illustration of the potential-flow wedge angles zB corresponding to the laminar-boundary-layer 
similarity solutions. (a) Decelerating flow with negative included angle. (b) Flat-plate flow with zero included 
angle. (c) Accelerating flow in the special case of a 2D stagnation point flow (B = 1.0) 
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Of all of these similarity solutions, only two tend to appear in actual flowfields in something close to their 
ideal form: the flat-plate flow and the 2D stagnation-point flow. However, the other members of the laminar 


similarity family provide fairly good matches to local velocity profiles in more general laminar flows. As we 
discussed in Section 4.2.2, integral methods for predicting boundary-layer development often use assumptions 
about velocity-profile shapes. The laminar method developed by Drela and Giles (1986), for example, uses 
curve-fits based on the laminar similarity solutions. 

In 2D incompressible turbulent flow, the velocity profiles themselves are never really similar, but the 
velocity-defect profiles, properly scaled, can be similar under the right conditions. Clauser (1954, 1956) found 
that if the ue(x) distribution is properly tailored, the velocity-defect profile in the form (we — u)/u; can be held 


constant, where “™ ~ W 'w/ is the friction velocity, which we'll discuss further in Section 4.4.2. The de- 


fect profile is a function only of a turbulent pressure-gradient parameter, 
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not to be confused with the laminar similarity parameter of the same name. Note that the laminar and turbulent 
versions of f§ have opposite signs: laminar accelerating flows have positive B, while turbulent accelerating flows 
have negative B. The turbulent similarity relation has the form 
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Clauser (1954, 1956) coined the term equilibrium turbulent boundary layer for the special boundary-layer flows 
that obey the above relation. In principle, an equilibrium turbulent boundary layer is strictly similar only in the 
defect-profile sense of Equation 4.3.9, but because ur; varies slowly with x, it will appear nearly similar in terms 
of the conventional velocity profile, provided the range of Reynolds number considered is not too wide. 
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Several researchers have succeeded in setting up experimental flows that were very close to “equilibrium” in 
the sense of Equation 4.3.8, ranging from strongly accelerating flows (negative B) to strongly decelerating (pos- 
itive B). Flat-plate flow is a member of this family. The best-known of the non-flat-plate cases was published 
by Stratford (1959b), a turbulent boundary layer held in a constant state of incipient separation (nominally zero 
Cf), which we'll see in Section 7.4.3 has played a role in the theory of airfoil maximum lift. 

Like the laminar similarity solutions, the family of equilibrium turbulent-boundary-layer velocity profiles 
has provided velocity-profile assumptions for integral methods of calculating boundary-layer development. The 
turbulent method by Drela and Giles (1986) is one example. 


4.3.3 Axisymmetric Flow 


Axisymmetric flow generally requires an axisymmetric body shape and zero angle of attack. Flow quantities 
by definition depend on only two coordinates (x, 7), so that axisymmetric flow is 2D, mathematically speaking. 
What makes axisymmetric flow different from the usual planar 2D flow is that the body radius can vary with 
x, contracting or stretching the boundary layer circumferentially, as illustrated in Figure 4.3.7. The flux of mo- 
mentum deficit in the boundary layer is thus distributed over a varying distance circumferentially, so that the 
radius variation joins with the skin friction and the pressure gradient as an additional factor that affects the evol- 
ution of the boundary-layer thickness along the body. When the radius grows, stretching the boundary layer 
circumferentially, the rate of boundary-layer growth is reduced or even reversed, and when the radius shrinks, 
the rate of boundary-layer growth 1s increased, compared with what it would be in a planar flow with the same 
pressure gradient. 


Figure 4.3.7 Illustration of the effect of a change in body radius on a boundary layer in axisymmetric flow 
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These effects of radius variation are similar to the effects of convergence and divergence in 3D boundary 
layers that we discussed in Section 4.1.2, with the major difference that in more general 3D flows the effects 
are often enhanced by pressure-driven cross flow in the boundary layer. In axisymmetric flow, symmetry does 
not allow a pressure gradient in the cross-flow (circumferential) direction, so there is nothing to drive pressure- 
driven cross flow. Thus unless the body is spinning, producing shear-driven cross flow, there is no cross-flow 
in axisymmetric flow. In mathematical terms, the outer-flow streamlines are along longitude lines of the body 
surface. They are therefore geodesics of the body surface, defined by having zero intrinsic curvature, so that 
there is no tendency to produce pressure-driven cross flow, as we discussed in Section 4.2.1. 

An axisymmetric body can either be simply connected, with no “holes,” or it can be in the form of a ring with 
an open duct along the axis, as illustrated in Figure 4.3.8. If the leading “lip” of a ring-type body is blunt, the 
attachment of the flow to the lip will look locally like flow attachment to the leading edge of a 2D airfoil, and 
the boundary layer at the attachment will closely resemble the 2D similarity solution for B = 1 (Hiemenz flow, 
Section 4.3.2). If the nose of a simply connected body is blunt, flow attachment will be at a singular point of at- 
tachment, from which the flow spreads out radially, as in Figure 4.3.9. In the axisymmetric case, the boundary- 
layer equations for this flow can be transformed to the same ODE as for the similarity solution for 2D planar 
flow with 6B = 1/2, and the velocity profiles are therefore the same (see Schlichting, 1979). The boundary-layer 
thickness 6 is constant in the neighborhood of the stagnation point, as it is in 2D planar flow. 


Figure 4.3.8 Illustration of two types of axisymmetric body. (a) Simply connected. (b) With open duct along 
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Figure 4.3.9 The boundary layer at a singular axisymmetric stagnation point of attachment 





4.3.4 Plane-of-Symmetry and Attachment-Line Boundary 
Layers 


Flows that have a nominal plane of symmetry are common in applications, as, for example, in the flow along 
the keel line of a ship hull at zero yaw or the flow along the vertical symmetry plane (crown and keel lines) 
of a fuselage at zero yaw. Another class of examples comes from the world of boundary-layer experiments. 
Many boundary-layer measurements have been made in special-purpose flow channels (air or water tunnels). 
The measurements are usually made in the boundary layer along the centerline of the channel floor or ceiling, 
and the channel sidewalls are usually parallel, with the intention of producing a 2D flow on the “test wall.” But 
such setups usually produce significant unintended three dimensionality because of boundary-layer growth on 
the sidewalls and 3D flow effects in the corners. 

Plane-of-symmetry flows generally differ from 2D planar flows in that the flow off of the plane of symmetry 
is either diverging away from the plane or converging toward it. In an axisymmetric flow, the flow along every 
longitude line of the body is a special case of plane-of-symmetry flow, with convergence or divergence if the 
body radius is changing with x. Why is axisymmetric flow a special case? Recall from Section 4.3.3 that an 
axisymmetric flow produces no pressure-driven cross flow. General plane-of-symmetry flows don't have this 
restriction: Off of the plane of symmetry, there can be pressure-driven cross flow. Whether the convergence or 
divergence involves cross flow or not, 1t can have a significant effect on the development of the boundary-layer 
velocity field, as we discussed in Section 4.3.3 in connection with axisymmetric flow. 

The general situation in an idealized plane-of-symmetry flow is sketched in Figure 4.3.10. On the symmetry 
plane, the velocity profile is parallel to the plane and looks like a 2D boundary-layer profile. Off of the sym- 
metry plane, divergence is shown, accompanied by pressure-driven cross-flow in which the divergence velocity 
component peaks inside the boundary layer with a larger value than at the edge. 


Figure 4.3.10 Sketch of a boundary-layer flow with convergence or divergence about a plane of symmetry 
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The 3D boundary-layer equations are of course applicable to such flows, but we can take advantage of the 
symmetry to reduce the equations to a spatially 2D set that applies just on the symmetry plane itself. Starting 
with the 3D equations, the z-momentum equation degenerates to 0 = 0 on the symmetry plane because the w ve- 
locity component 1s zero, and 0/dz of all flow quantities other than w is zero. But differentiating the equation in 
the z direction turns it into a nondegenerate equation for the convergence/divergence profile Ow/0z. For laminar 
constant-property flow in Cartesian coordinates, the resulting equations are: 


°* x momentum: 
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Note that the x-momentum equation is the same as the 2D boundary-layer momentum Equation 4.2.1 with 
no explicit appearance of Ow/dz. The solution for the u velocity profile is thus affected by Ow/oz only through 
the continuity equation and its effect on the vertical velocity v. 


Although the plane-of-symmetry x-momentum equation is the same as in 2D flow, the momentum-integral 
equation picks up an additional term representing the effects of convergence/divergence. Recall that the deriva- 


tion of the 2D momentum-integral Equation 4.2.10 made use of the continuity equation. In plane-of-symmetry 
flow, Ow/Oz appears in the continuity equation and thus shows up in the integrated momentum equation 
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as the integral of Ow/0z weighted by the velocity defect ue — u. This is consistent with the intuitive idea that the 
streamwise momentum balance is affected when momentum-deficient air is carried away by flow divergence. 
However, this intuitive interpretation applies directly only in the integrated sense of Equation 4.3.13. In the dif- 
ferential form of the momentum Equation 4.3.10, convergence or divergence is “felt” only “indirectly” through 
the vou/Oy term. 

Throughout the development of CFD codes for turbulent flows, the unintended three dimensionality (con- 
vergence/divergence) of nominally 2D boundary-layer experiments has been a problem for developers of tur- 
bulence models. Discrepancies between calculated and measured boundary-layer quantities are not always the 
fault solely of the turbulence model, but are also contributed to by convergence/divergence in the experimental 
flow. It takes only small amounts of convergence/divergence, usually undetectable by flow-direction measure- 
ments, to have a significant effect on the momentum thickness 0(x), for example. Mellor (1967) argued that 
fairer comparisons between his turbulence model and “2D” boundary-layer measurements could be made if he 
assumed that any discrepancy between calculated and measured 0(x) was due to convergence/divergence, and 
he “corrected” his calculations for it. To make the “correction,” he solved the plane-of-symmetry equations in- 
stead of the 2D equations and set the code up to solve for the unknown Owe/dz boundary condition so as to 
match the measured 0(x). The turbulence model was then evaluated on how well it predicted Cx), H(x), and 
detailed velocity profiles. One uncertainty in applying this calculation strategy is what to assume about the Ow/ 
0z(y) profile at the initial x station, which can affect the solution over surprisingly long distances downstream. 

One of the most important applications of plane-of-symmetry boundary-layer equations is to the band of 
flow attachment along the leading edge of a swept wing, usually called the attachment line. (In Section 5.2.2, 
we'll discuss how on a wing of finite span a single attachment line cannot be uniquely defined, but can for all 
practical purposes be pinned down very closely.) On most swept wings, the intrinsic curvature of the attach- 
ment line is small enough, and the boundary layer is thin enough, that cross flow on the attachment line itself is 
negligible, and the plane-of-symmetry equations apply. (See Section 4.2.1 for a definition of intrinsic curvature 
and a discussion of its relationship to cross flow.) In swept-wing attachment-line flow, the flow divergence 1s 
typically so strong that it keeps the boundary layer orders of magnitude thinner than it would become on a flat 
plate with the same streamwise distribution of edge velocity. We'll look further at the attachment-line boundary 
layer in our discussion of swept wings 1n Section 8.6.2. 


4.3.5 Simplifying the Effects of Sweep and Taper in 3D 


Boundary layers on swept wings, which we'll discuss in some detail in Section 8.6.2, are generally highly 3D 
in the sense of having strong pressure-driven cross-flow. In the most general cases, an adequate simulation of 
the flow requires solving the 3D boundary-layer equations. However, on well-designed swept wings, the isobars 
tend to line up along the constant-percent-chord lines, and simplified equations can then provide reasonably ac- 
curate predictions of boundary-layer development. 

The simplest boundary-layer equations that include the effects of sweep are the infinite-span-swept-wing 
equations. We assume a boundary-layer coordinate system in which the two in-surface coordinates Xsp and Xch 
are aligned parallel and perpendicular to the general sweep of the wing, as illustrated in Figure 4.3.11. We as- 
sume that all flow quantities are unchanging in the spanwise (Xsp) direction, as would be the case if the wing 


had a uniform section and infinite span. When this assumption is applied to the outer inviscid flow instead of 
the boundary layer, we have simple sweep theory, which we'll discuss in Section 8.6.1. 


Figure 4.3.11 Coordinate system assumed 1n the infinite-span swept-wing boundary-layer equations 





Figure 4.3.12 Coordinate systems assumed in the infinite-span swept-wing boundary-layer equations with 
taper. (a) A conical, orthogonal system. (b) A non-orthogonal system 





Starting with the 3D boundary-layer equations in Cartesian coordinates and setting all 0/Oxsp terms to zero, 
we obtain a set of equations that still has three velocity components, but only two space coordinates. These 
are the infinite-span-swept-wing boundary-layer equations, sometimes referred to as the “2.5D boundary-layer 
equations.” Because the spanwise coordinate xsp no longer appears in the equations, the solution domain is only 
two-dimensional, along a single “cut” across the planform, in the perpendicular (Xch) direction from leading 
edge to trailing edge. For purposes of numerical solution, the same upstream-to-downstream marching sequence 
can be used as in a 2D flow. Coding is simpler, and computing time is much shorter than for a full 3D calcula- 
tion. 

An important property of flows governed by these equations is that the influence that the flow in the spanwise 
(Xsp) direction has over the velocity profile in the chordwise (Xch) direction is either loose or nonexistent, de- 
pending on the situation. The spanwise velocity usp doesn't appear in the chordwise momentum equation or in 
the continuity equation, so that the only way the spanwise flow can affect the chordwise flow is through the 
density and the viscosity (and eddy viscosity in the turbulent case). Thus for the spanwise flow to have any 
effect on the chordwise flow, the flow must be either compressible or turbulent. Thus in the case of laminar 
flow with constant properties, we have the /aminar independence principle: The uch profile can be determined 
independently of the usp profile. This doesn't work in reverse, however. The usp profile depends strongly on the 
Uch profile. 


In turbulent and compressible flows, a rough kind of independence still tends to hold, in which the influence 
of the Usp profile on the uch profile is weak. In Section 8.6.2, we'll look at CFD calculations that illustrate the 
typical magnitude of the deviation from independence in turbulent swept-wing flows. 

The basic “2.5D flow” idea can be extended to wings with taper, with the help of an additional assumption. 
On a portion of a swept tapered wing, we can construct a coordinate system consisting of the constant-percent- 
chord lines (“spanlines”) and either “chordwise” arcs constructed perpendicular to them, as shown in Figure 
4.3.12a or conventional “rib” cuts, as in Figure 4.3.12b. Outer-flow quantities are assumed to be unchanging 
along the spanlines, but we can't assume boundary-layer quantities are. So we can't just set the 0/Oxsp terms to 
zero as we did in the case without taper. But we can make an assumption about them that will still give us an 
equation set that is spatially “2D,” with no xsp dependence that we have to solve for. 

We assume a kind of conical similarity in which the distribution of the outer-flow velocities (both the span- 
wise and chordwise components) is the same along every chordwise arc, and thus the only thing that changes 
for the boundary layer as we move spanwise from one arc to another is the chord Reynolds number. We also 
assume that the velocity profiles are similar from one arc to another, and that only the boundary-layer thickness 
changes, in the same way that it would change with chord and Reynolds number in a flat-plate flow. This as- 
sumption will not be satisfied exactly in turbulent flow, and we thus incur some loss of physical fidelity. 

To derive the equations for tapered-wing boundary layers, we start with the 3D boundary-layer equations. 
For the conical coordinate system of Figure 4.3.12a, we use the curvilinear orthogonal form, and for the rib-cut 
system of Figure 4.3.12b, they must be in nonorthogonal form. Next we introduce a scaled normal coordinate 
n = y/o. Now our assumption that the velocity profiles are similar from one arc to another is equivalent to as- 
suming that boundary quantities are unchanging with Xsp, provided we follow them at constant ny rather than 
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constant y. For 6, we assume a flat-plate power law: 0 Ca” for laminar flow and ° ™ a’ for turbulent 


flow, where Ca is the “chord” measured along the chordwise coordinate. 
We'll skip the details, but when all is said and done, the infinite-span-swept-wing boundary-layer equations 
with taper have two types of terms that don't appear in the equations without taper: 
1. Terms proportional to 06/Oxsp that arise from the 0/Oxsp terms in the 3D equations, which we were able 
to set to zero when there was no taper. Using the power law for d(ca), we can express these in terms of 
OCa/Xsp. 
2. Terms related to the curvature of the “chordwise” arcs (Figure 4.3.12a) or the rate of change of the 
skew of the coordinates (the sweep of the spanlines in Figure 4.3.12b), which is proportional to the rate 
of taper OCa/OXxsp. 


So we've replaced the 0/Oxsp terms with terms containing Oca/OXsp. There are no other xsp-derivative terms, 
and we can solve the equations along a single “chordwise” cut from leading edge to trailing edge, just as we 
could in the case without taper. Note that if the solution is obtained along a curved arc like those shown in Fig- 
ure 4.3.12a, it can be transformed to apply to any other cut across the wing (a rib cut, for example), by use of 
the similarity assumption and the power law for (Ca). 

Solutions to the 2.5D boundary-layer equations are almost as simple to compute as 2D solutions, but they 
can predict the effects of sweep and taper on the pressure-driven cross-flow with reasonable accuracy. In Sec- 
tion 8.6.2, we'll compare such predictions with those made with the full 3D equations. 


4.4 Transition and Turbulence 


In Section 3.7, we discussed in physical terms how turbulence affects the development of a flow and how the 
effects can be modeled in calculations (turbulence modeling). So far in this chapter we've taken a quick look 
at a turbulent mean-velocity profile (Figure 4.1.4b) and we've seen how turbulent skin friction on a flat plate 
varies with Reynolds number (Figures 4.3.1 and 4.3.2). In this section, we'll look at how a laminar boundary 
layer transitions to turbulent, and then at the details of the turbulent boundary layer. 


4.4.1 Boundary-Layer Transition 


Laminar-to-turbulent transition in a boundary layer can involve a sequence of possible stages, as illustrated in 
Figure 4.4.1. Not all of these stages are always present, and discussions in the literature often refer to two pro- 
totypical paths to transition: (1) Natural transition, 11 which small disturbances grow larger through dynamic 
instabilities, eventually resulting in a breakdown into turbulence and (11) Bypass transition, in which some large 
disturbance leads directly to turbulence, “bypassing” the process of instability growth and breakdown. 


Figure 4.4.1 Possible stages in the laminar-to-turbulent transition process in a boundary layer. From White 
(1991). Used with permission of McGraw-Hill Companies 
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For the small-disturbance-growth phase of natural transition to be observable, the disturbance environment 
(freestream turbulence and acoustic levels) must be sufficiently quiet, and the surface must be smooth enough 
so that its interaction with the outer flow doesn't produce large disturbances. Of course, there are middle ranges 
of disturbance environments and surface quality in which the distinction between the two types of transition is 


blurred. And, of course, for transition to happen at all, conditions must be such that a turbulent boundary layer 
can be sustained downstream. This means that the Reynolds number must be high enough, and an extremely 
favorable pressure gradient cannot be present. 

There is no quantitative theory for bypass transition, other than direct numerical simulation (DNS), which is 
almost never practical. Prediction of bypass transition is therefore practically always empirical. One application 
is the sizing of artificial boundary-layer tripping devices (trips), which are used in wind-tunnel testing to pre- 
vent excessive runs of laminar flow, given the low Reynolds numbers at which tests are usually run, but rarely 
used on actual flight vehicles. Empirical criteria for sizing trips can be found in Braslow and Knox (1958). 

Predicting natural transition is important in the design of many types of flight vehicles. In Section 7.4.6, we'll 
look at laminar low-drag airfoils, and in Section 8.6.4, we'll consider laminar flow on swept wings. In such ap- 
plications, theoretical calculations are now routinely used, but the calculations don't model the entire process, 
so that prediction must still rely heavily on empirical correlations. 

The available theoretical framework for natural transition consists of receptivity theory, which models the 
process by which small environmental disturbances provide the initial amplitude for the growth of instabilities, 
linear stability theory, which predicts the relative rate of growth of the instabilities, once started, and nonlinear 
stability theory, or the theory of secondary instabilities, which extends the growth prediction into the nonlinear 
range. There is no quantitative theory for the final breakdown into turbulence. 

Linear stability theory evolved in two major stages. In the earliest inviscid stability theories, realistic laminar- 
boundary-layer velocity profiles were assumed, but the flow was treated as inviscid for purposes of calculat- 
ing the stability. The inviscid theory predicts that a laminar boundary layer is stable unless the velocity pro- 
file has an inflection point of the type produced by an adverse pressure gradient, as in Figure 4.1.5a. So flows 
with favorable pressure gradients are stable, flows with adverse pressure gradients are unstable, and flat-plate 
flow is neutrally stable, according to the inviscid theory. The later viscous stability theory predicts that viscosity 
stabilizes inviscidly unstable flows at very low Reynolds numbers, destabilizes inviscidly stable flows over a 
middle range of Reynolds numbers, and has a vanishing effect on stability at very high Reynolds numbers. The 
destabilizing effect of viscosity is crucial in the disturbance growth that leads to transition, so practical trans- 
ition prediction has always relied on the viscous stability theory. 

The e” method was the first successful prediction method for natural transition, and elaborations on it are 
still the state of the art for practical prediction. In the e” method, calculations are made only for the viscous 
linear-stability-theory phase of the disturbance growth, and transition is predicted to take place when the calcu- 
lated amplitude ratio reaches a predetermined threshold value of e”. In the early development of the method, the 
single value n= 9 was thought to be a good compromise for all flows. The resulting “e” method” was reasonably 
successful for 2D low-speed flows, but was much less so when applied later to more-complicated flows. The 
current state of the art is called variable n-factor, in which a variable value of 7 is used to account for the initial 
receptivity phase, and for the final non-linear growth and breakdown (see Crouch, 2008). Receptivity theory is 
not used directly for any calculations, but it does provide guidance in the determination of the empirical cor- 
relations that determine n. For example, receptivity theory suggests that the effects of surface roughness should 
be correlated with the roughness size normalized by boundary-layer thickness, not by a roughness Reynolds 
number. In the variable n-factor method, receptivity plays this important secondary role, while only the linear 
stability theory is used directly in the calculations. 

In the viscous linear stability theory, small-amplitude velocity and pressure disturbances (u’, v’, w’, p’) are 
assumed to be superimposed on a base steady flow with a known velocity profile. This assumption is substitu- 
ted into the NS equations, terms higher than first order in the disturbances are dropped, and terms containing 
only the base flow are subtracted out, given that the base flow is assumed to satisfy the equations. The resulting 
small-disturbance equations are homogeneous and are linear in the disturbances. At the usual kinds of boundar- 
ies (a solid wall with a no-slip condition or a freestream in which disturbances are assumed to die out far away), 


the boundary conditions on the disturbance velocities are also homogeneous. The problem is thus an eigenvalue 
problem, which will be satisfied by some sequence of eigenfunctions. 


There are several different ways in which the disturbance equations have been simplified and used. The dis- 
turbance pressure is generally eliminated, reducing the number of equations. One of the steps is generally to 
assume that the disturbance velocities can be decomposed into a series of normal modes that are oscillatory in 
directions parallel to the surface and in time, leaving only the dependence on y to be determined, for example: 
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This reduces the problem to that of finding combinations of eigenvalues (spatial wave numbers a and £, and 
temporal frequency w) for which physically meaningful solutions exist, and solving for corresponding eigen- 
functions v(y) that satisfy the equations. Assuming that growth takes place only in time (real a and B, complex 
w) leads to the temporal stability problem, while assuming growth takes place in space (complex a and f, real 
w) leads to the spatial stability problem. We can simplify the spatial stability problem further by lining the x dir- 
ection up with the direction in which disturbance growth takes place, so that there is no growth in the z direction 
(B is real), thus reducing the number of parameters in the problem. Now @ and £ (both real) are the only remain- 
ing free parameters that specify a particular disturbance mode. Once o and f are specified, a is constrained by 
the eigenvalue problem, in that meaningful solutions exist only for particular values. Once o is determined, the 
real part defines the wavelength of the disturbance, and the imaginary part is the spatial growth rate. 

Numerical solutions to the eigenvalue problem are used to determine a and thus the wavelength and the 
damping or growth rate of a particular disturbance. Because the theory is linear, the rate predicted is a relative 
rate, proportional to the current disturbance amplitude. Disturbances that are damped are referred to as stable, 
those that are amplified are referred to as unstable, and those on the boundary between the two are neutral. At 
a sufficiently low Reynolds number, all disturbances are stable. As Reynolds number increases, a critical Reyn- 
olds number is reached at which one particular disturbance crosses the neutral boundary from stable to unstable. 
As Reynolds number increases further, disturbances over a wider range of frequencies become unstable, and 
growth rates increase for the unstable disturbances. 

For boundary layers, three basic types of potentially unstable modes (Figure 4.4.2) arise from the theory: 

1. Tollmien-Schlichting (TS) waves, in which the velocity disturbances are “2D” in the direction perpen- 
dicular to the wave fronts. In 2D incompressible flat-plate flow, the fastest-growing modes propagate in 
the flow direction, while compressibility, pressure gradients, or 3D effects can bring oblique modes into 
play. A streamwise mode in 2D flow 1s illustrated in Figure 4.4.2a. 


2. Cross-flow (CF) instabilities, usually in the form of stationary disturbance vortices, as illustrated in 
Figure 4.4.2b, which can occur in boundary layers with moderately strong cross-flow velocities. As pure 
disturbances, CF vortices are counter-rotating, but when the background mean vorticity of the boundary 
layer 1s superimposed on them, they appear to be co-rotating. Traveling cross-flow disturbances can be 
significant, but only in wind-tunnel environments with high turbulence levels. 


3. Taylor-Goertler instability, counter-rotating streamwise vortices, which can occur in boundary layers 
on concave surfaces, as illustrated in Figure 4.4.2c. 


Figure 4.4.2 The three basic types of potentially unstable disturbance modes (see Reed and Saric, 1989). (a) 
Tollmien-Schlichting waves (2D disturbances). A case of streamwise propagation in a 2D flow is illustrated 
here in terms of velocity profiles and the streamline pattern. From White, (1991). Used with permission of 
McGraw-Hill Companies. (b) Cross-flow instability (stationary disturbance vortices, shown as a schematic 
streamline pattern in a plane perpendicular to the axes of the vortices). From Reed and Saric, (1989). Used 
with permission of Annual Review of Fluid Mechanics. (c) Zaylor-Goertler instability (streamwise vortices on 


concave surfaces). 
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More than one mode may coexist in a given flow, and when their amplitudes are small, they grow independ- 
ently of each other, according to the linear theory. Often, one or the other grows faster and leads to transition, 
as if the other mode hadn't existed. It is also conceivable that if two modes are of comparable strength when 
they reach the nonlinear range, that nonlinear interactions between them can play a role in transition. Figure 
2.1c shows a smoke-flow visualization of the flow past a rotating body of revolution in which signs of both TS 
waves and vortices due to CF instability can be seen. 

Real instability growth in an otherwise steady boundary layer is spatial, not temporal. But the spatial stability 
problem is harder to solve than the temporal problem, and in the early decades of stability research, only the 
temporal problem was dealt with. This turns out to have been forgivable, since the two problems lead to the 
same result at the neutral boundary, and early work concentrated only on determining the neutral boundaries for 


various boundary-layer velocity profiles. Later transition-prediction methods that depend on calculated growth 
rates generally use solutions to the spatial problem. 

Another simplification is the assumption of parallel flow, in which the local variation of the boundary-layer 
velocity profiles in directions along the surface 1s ignored, and the base flow is assumed to depend only on y. 
Assuming 2D parallel flow and 2D disturbances, and eliminating u’ from the equations, results in the classic 
ODE known as the Orr-Sommerfeld equation, which was the basis of much of the early work. Details are giv- 
en by Schlichting (1979) and White (1991). More recently, methods have been devised for taking into account 
non-parallel-flow effects, which are found to increase the predicted instability. But practical transition predic- 
tion still often uses the parallel-flow assumption and accounts for the difference through calibration. 

The stability or instability of a laminar boundary layer, and the growth rates when there are unstable disturb- 
ances, depend both on the Reynolds number and on the shape of the velocity profile. Stability is thus strongly 
dependent on the pressure gradient, as we saw above in our discussion of the inviscid and viscous stability the- 
ories, and can also be affected by distributed surface suction. In the viscous world, a favorable pressure gradient 
reduces or eliminates TS instability, but if the favorable gradient leads to stronger cross-flow, it increases CF 
instability (growth of stationary disturbance vortices). An adverse gradient usually results in such strong TS in- 
stability that CF instability is not the critical issue. Distributed surface suction changes velocity-profile shapes 
so as to reduce both TS and CF instabilities. 

Figure 4.4.3 schematically illustrates how TS instability varies with Reynolds number and profile shape. 
Neutral stability boundaries are shown in terms of disturbance frequency versus displacement-thickness Reyn- 
olds number for favorable, zero (flat-plate) and adverse pressure gradients. Each neutral curve has a vertical tan- 
gent at its particular critical Reynolds number, below which all disturbances are stable. Above the critical Reyn- 
olds number, the range of unstable disturbance frequencies at first widens to both lower and higher frequencies. 
However, the high-frequency branch of each curve quickly levels off and comes down, and the high frequencies 
become stable again. The behavior of all three curves at high Reynolds numbers approaches the predictions of 
the inviscid stability theory. The favorable-gradient curve crosses the horizontal axis and disappears, reflecting 
inviscid stability. The flat-plate curve approaches the horizontal axis asymptotically, reflecting inviscid neutral 
stability. Only the adverse-pressure-gradient case retains a range of unstable disturbances at very high Reynolds 
numbers, in keeping with its inviscid instability. 


Figure 4.4.3 Schematic illustration of neutral stability boundaries for 2D (TS) disturbances in laminar bound- 
ary layers with favorable, zero, and adverse pressure gradients 
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Disturbance growth of the kind predicted by linear stability theory is well established as part of the natural- 
transition process in boundary layers and planar 2D channels, but not in all “thin” viscous shear flows. Flows 


in circular pipes are unconditionally stable according to the theory, and yet there is still an upper limit to the 
Reynolds number at which laminar flow can exist. Why the theory doesn't work for pipe flow is not understood. 

Predictions of the stability theory for a flat-plate boundary layer (the neutral-stability boundary and the fre- 
quencies of amplified disturbances) were convincingly verified in a wind tunnel with very low turbulence levels 
at the U.S. Bureau of Standards in about 1940, but the results were not published to the international aero- 
dynamics community until after World War II (Schubauer and Skramstad, 1947). The e” transition-prediction 
method was first developed in the early 1950s (Smith and Gamberoni, 1956; Smith, 1981), and it was not until 
the 1970s that computers and solution algorithms progressed to the point that the spatial stability problem could 
be routinely solved for arbitrary pressure distributions. 

The e” method involves marching in the direction in which disturbance growth is assumed to take place (the 
flow direction in 2D flows), and at each station solving the stability equations for a range of different modes to 
determine the growth rates. Then the growth rates are integrated in the growth direction to determine the amp- 
litude ratio A/Ao (the ratio of the disturbance amplitude to the initial amplitude at the station where the first 
instability appeared). The initial and final amplitudes themselves are not explicitly determined, only the ratio. 
This is usually expressed as In(A/Ao), which is just n in the expression A/Ao =e”. 

The integration is carried out with differing degrees of rigor in different versions of the method. The rigorous 
choice 1s to do the integration for only one mode at a time as it evolves downstream, and to repeat the integra- 
tion for a number of different modes covering the range of interest so as to determine at each station which one 
has produced the largest amplitude ratio. This generally means covering a two-parameter space, varying both B 
and @, which is laborious. A shortcut 1s to repeat the integration only for multiple values of w, using the value 
of B at each station that gives the most-rapid growth. The choice of B can be simple for incompressible flow 
because 8 = 0 (wave fronts perpendicular to the flow) usually leads to the fastest growth, but in transonic flow 
a range of B must generally be searched. Arnal and Casalis (2000) call this the “envelope method” because the 
resulting curve of 1 versus x is the envelope of multiple curves for different values of w. A further shortcut is to 
use the value of that yields the maximum amplification rate at each local station as input to the integration. I 
would call this the “locally most amplified” method. For general flows, these shortcuts have no physical basis, 
but they save computing time because they don't require generating as many stability solutions. Of course such 
shortcuts produce higher v factors at transition than a rigorous integration would. While this is compensated for 
by calibration, the lower physical fidelity of the shortcut methods should presumably result in greater scatter in 
correlations with experiments. 

Whether shortcuts are used or not, an e" calculation determines a value of n at each station that is supposed 
to reflect the cumulative growth of the most-amplified disturbance. Transition is predicted to take place when 
n passes a threshold value determined by calibration against similar flows for which the transition location is 
known. The threshold value of n thus accounts empirically for the receptivity phase and the final nonlinear 
growth and breakdown. In principle, it can account for such factors in the initial-disturbance environment as 
freestream turbulence level and surface imperfections, but only if the calibration database includes flows in 
which these things were varied. In the variable-n-factor method, it 1s possible to base the calibration on results 
from more than one type of experiment. For example, the primary calibration would come from a flow similar 
to the target application, and data from other experiments could be used to adjust for other factors, such as sur- 
face roughness. 


In 3D flows with cross-flow, TS and CF disturbances are treated as independent, in keeping with the linearity 
of the theory. There are several strategies for keeping track of the range of disturbance modes of both types 
(see Arnal and Casalis, 2000). The one that seems to produce the tightest correlations with experiments tracks 
separate n factors for TS and CF, and transition 1s predicted by whichever crosses its particular threshold first. 
Unless the surface is extremely (impractically) smooth, it is important to use a variable n-factor to account for 
the effects of surface roughness on the CF disturbances (see Crouch and Ng, 2000; Crouch, 2008). Data are 


available for flows with a wide variety of disturbance-growth progressions, leading to transition by TS growth 
in some cases and CF growth in others. For flows with low freestream turbulence and very smooth surfaces, 
typical threshold values are e® for TS transition and e° for CF. A formula relating ncF to roughness height is 
given by Crouch and Ng (2000), and one relating nts to freestream turbulence level is given by Mack (1977). 

Methods have been developed that carry out the e” method by interpolating tables of precalculated growth 
rates instead of calculating solutions to the stability equations, saving orders of magnitude in computing time. 
For 2D flows with TS transition only, the ISES and MSES codes (Drela and Giles, 1987; Drela, 1993) use a 
two-parameter table-lookup scheme that returns growth rates as functions of the boundary-layer parameters Ro 
and H. At each local station, the fastest-growing disturbance is used (as a function of @), so that this is analog- 
ous to a “locally most amplified” method. A similar scheme that handles both TS and CF was developed by 
Crouch, Crouch, and Ng (2001). The TS growth rates used in this method are maximized locally with regard 
to B, but not with regard to wm, and multiple integrations are carried out for multiple values of o. It is therefore 
analogous to an “envelope” method. 

The final stage of nonlinear growth and breakdown is complex, and the details probably vary depending on 
the nature of the disturbances that feed it, and on local conditions such as pressure gradient and freestream tur- 
bulence. In a flat-plate boundary layer, Emmons spots like the one shown in Figure 2.1d form randomly and 
grow wider, merging together until the entire width of the surface is covered by turbulent flow. 


4.4.2 Turbulent Boundary Layers 


The turbulence in a fully turbulent boundary layer is inherently 3D and is random in space and time. But being 
random doesn't mean it's a featureless jumble of random swirls. Boundary-layer turbulence contains a variety 
of coherent structures over a range of length and time scales, from small structures that are characteristic of 
the region close to the wall to larger structures that span more of the full thickness of the boundary layer. Such 
structures form at essentially random locations over the surface and at random times, and even for structures 
of a particular type, there is wide variability in detailed evolution from one instance to another. But structures 
with distinctive features show up with surprising regularity. Many of their characteristics follow fixed statistical 
distributions. 

Statistical distributions (probability distribution functions) for flow quantities within the structures are 
anchored 1n their vertical positions by the wall, and the vertical extent of the larger ones scales roughly with the 
boundary-layer thickness. In the horizontal directions there are probability distribution functions that move with 
the structures and reflect their typical shapes and sizes. And horizontal positions of structures relative to each 
other aren't completely random. There are two-point correlations that reflect “typical distances” and “typical 
cycle times,” as well as “typical life spans” (paraphrased from Spalart, 2009, private communication). 

Figure 2.le shows flow visualizations of large eddies that appear in the outer part of the layer and illustrates 
how they differ in favorable and adverse pressure gradients. In these photos, the turbulence is made visible by 
smoke that was introduced through a slot in the wall upstream. The smoke has been transported away from the 
wall mostly by turbulent convection and has been diffused relatively little by molecular action, and it there- 
fore tends to “mark” the turbulent, vortical regions. The regions empty of smoke are essentially nonturbulent 
and nearly irrotational, and we see that such regions can penetrate deep into the boundary layer. The boundary 
between turbulent and nonturbulent flow is highly convoluted but surprisingly sharp. As the structures move 
along with the general flow, the turbulence seen at fixed points in space in the outer part of the layer 1s intermit- 
tent in time. These large structures tend to scale with 6, the overall thickness of the layer. 

The main smaller structures that appear in the inner part of the layer are associated with streamwise vortices. 
A pair of counter-rotating vortices is often linked in the form of a horseshoe, as shown in Figure 4.4.4. Where 


the flow between two vortices is downward toward the wall, higher speed fluid 1s carried toward the wall in 
what is called a sweep, and where the flow between two vortices 1s upward, lower speed fluid is carried away 
from the wall in a burst. The resulting streaky structure very close to the wall was shown in Figure 2.1f. 


Figure 4.4.4 Pair of streamwise vortices forming a horseshoe pair in the inner part of a turbulent boundary lay- 
er 
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These turbulence structures have been studied intensively, both experimentally and by DNS. Comprehensive 
surveys are to be found in Robinson (1991) and Moin and Mahesh (1998). Surprisingly, this knowledge of or- 
ganized structures has had very little effect on the practical prediction of turbulent boundary-layer flows. The 
reasons for this have to do with the nature of the turbulence modeling problem, which we've already discussed 
in Section 3.7. In the RANS equations (Reynolds-averaged Navier-Stokes) governing the mean flowfield, the 
turbulent transport of momentum is represented by the Reynolds stresses, which are local (single-point) aver- 
ages of fluctuating velocity components. Of course, the Reynolds stresses in the real world reflect contributions 
from the organized structures, but for modeling purposes, incorporating information about organized structures 
into single-point averages 1s difficult. As a result, such information that has found its way into the turbulence 
models has been of a very low-order variety, such as the idea that the motions that matter have length scales that 
decrease with decreasing distance from the wall, which was around long before we gained our current aware- 
ness of coherent structures. So as interesting as organized turbulence structures are, our detailed knowledge of 
them has not had much influence on engineering practice. Of more practical interest are the mean velocity pro- 
files and the strategies that turbulence models have used for predicting them. 

Typical mean velocity profiles in 2D turbulent boundary layers are illustrated in Figure 4.4.5. The most strik- 
ing feature of turbulent profiles compared to laminar profiles is the very large velocity gradient at the wall. The 
highest gradient occupies a very thin region adjacent to the wall, too thin to be resolved on the scale of these 
plots. It is called the viscous sublayer because the shear stress there is effectively all viscous, with practically 
no contribution from Reynolds stress (see Section 3.7 for a discussion of Reynolds stress). But this lack of 
turbulent shear stress doesn't mean there is no turbulence in the sublayer. The flow in the sublayer is actually 
quite turbulent, with velocity fluctuations that are not small relative to the mean velocity. The Reynolds stress 
is nearly zero throughout a layer of finite thickness, not because the turbulence has disappeared, but because of 
the way the nearby wall constrains the turbulence, as we'll see next. 


Figure 4.4.5 Typical mean-velocity profiles in 2D turbulent boundary layers in various pressure gradients 
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So the existence of the sublayer gives a distinctive appearance to turbulent mean velocity profiles. It also 
plays a vital role in the greater separation resistance of turbulent boundary layers relative to laminar, an issue 
we discussed in Section 4.1.4. There we noted that for a boundary layer to resist separation, fluid parcels near 
the wall must experience a favorable net viscous force mesVuugied with a positive shear-stress gradient Ot/Oy. 
Ina eons nepren ty laminar boundary layer, 0t/Oy = LO *uley?, and the favorable viscous force thus requires 
positive er u/dy”. In the laminar case, the force is typically weak because pu is small. In a turbulent boundary 
layer, the rapid 1 ea in the turbulent eddy viscosity outside the sublayer can provide a strong positive Ot/ 
Oy even though er u/dy* is strongly negative there. One way to think of the turbulent boundary layer's greater 
resistance to separation is to imagine the strongly mixing part of the flow pulling the top of the sublayer along, 
keeping the velocity-profile slope and the skin friction positive. 

In much of the earlier literature, the sublayer was called the “laminar sublayer,” which wrongly implied that 
the flow in the sublayer is laminar. This unfortunate terminology has led to some misunderstandings, for ex- 
ample, in discussions of the effects of surface roughness, as we'll see in Section 6.1.8. It is clear from turbulence 
measurements and from flow visualization of the streaky structure in the sublayer in Figure 2.1f that the flow 
in the sublayer is anything but laminar. A closely related bit of misleading older terminology refers to the layer 
just outside the sublayer as the “buffer layer,” implying that 1t somehow buffers “laminar” flow in the sublayer 
against disturbances from outside. 

Many discussions of how the Reynolds stress decreases as the wall is approached use the term “damping” 
in one way or another. For example, Van Driest (1956) developed a successful and subsequently much-used 
formula for the eddy viscosity that includes a “damping factor” (see Section 3.7 for a discussion of the concept 
of eddy viscosity), and Mellor and Gibson (1966) speculated that the sublayer and part of the “buffer layer” 
constitute a region in which “all disturbances are damped.” The term is not entirely inappropriate, in that the 
combination of viscosity and the no-slip condition at the wall does have a strong damping effect on the turbu- 
lence in this region, but it is potentially misleading because it suggests that the reduction of the Reynolds stress 
is due simply to a general reduction in turbulence level. The way that the Reynolds stress decreases as the sub- 
layer 1s approached 1s actually more complicated than that. 


Fiaal 
The Reynolds stress of interest in a 2D boundary layer ts Pure” and thus dependent to the same degree on 


both v and wu velocity fluctuations. DNS simulations by Spalart (1988) show that the rms w level ss to zero 
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linearly with y, pretty much across the entire sublayer. If v did the same, we might expect that would 
go to zero as Vv, and that there wouldn't be much of a seemingly finite sublayer. But rms v goes to zero much 


faster (as y Ay than u, and mary goes to zero as y. So the Reynolds stress disappears faster than rms u’ 1n the 
sublayer and buffer layer because rms v “disappears faster. 

So why does rms v disappear faster than uw near the wall? Both are constrained to be zero at the wall, so the 
answer isn't just in the boundary conditions. At least part of the explanation has to do with the length scales of 
the turbulence structures and how viscosity affects motions with different scales. If / is the typical horizontal 
(x or z) distance over which instantaneous uw, v, and w vary, and given that uw, vy, and w obey the continuity 
equation, we can make the following loose statement about how the faacnies of ,, and v are related: 
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So for motions with length scales much larger than y, v will tend to be much smaller than either w or w, and 
there won't be much contribution to the Reynolds stress. The “active” motions (the motions that contribute to 
the Reynolds stress) tend to have length scales proportional to y, with v of the same order as uw. But viscous 
damping has an increasing effect as / decreases, and as we enter the buffer layer the intensity of motions with 
length scale / ~ y decreases. If we think of viscosity as impesute a lower cutoff on / (an oversimplification, but 
with a grain of truth to it), we would expect v to decrease as y while u decreases as y, as was observed in the 
DNS simulations. 

Outside the viscous sublayer, we find another distinctive feature of turbulent velocity profiles: the pro- 
nounced “corner,” in which the second derivative of the velocity has a large negative value. In this region, the 
eddy viscosity increases rapidly with increasing y, and the velocity gradient diminishes accordingly, to the much 
lower levels typical of the outer part of the layer. 

An interesting difference between turbulent and laminar profiles is in the appearance of inflection points in 
adverse pressure gradients. In our discussion in connection with Equation 4.2.7, we noted that in an adverse 
pressure gradient 6" u/ey? must be positive at the wall and that a laminar profile must therefore have an inflec- 
tion point somewhere away from the wall. The same applies to eo protiles, but the inflection isn't gen- 
erally visible in profile plots like Figure 4.4.5. Deep in the sublayer, er wey” must be positive, but just outside 
the sublayer it is strongly negative, and the location of the inflection point must be in the buffer layer, too close 
to the wall to be visible in ordinary profile plots. But inflection can also appear farther out in the profile. As 
indicated in Figure 4.4.5, a region of positive second derivative can appear in the outer portion of a 2D turbu- 
lent velocity profile if the pressure gradient is very strongly adverse or has been adverse over a long distance. 
Because the velocity profile tends to retain a pronounced negative second derivative both in the “corner” and 
near the edge of the boundary layer, a region of positive second derivative in the middle generally requires two 
inflections, as seen for the strongest adverse-pressure-gradient case in Figure 4.4.5. 

Many of the available turbulence models can predict these distinctive features of turbulent velocity profiles, 
but before CFD became commonplace an understanding had already been developed based on physical reason- 
ing, dimensional analysis, and simplified models. The physical reasoning starts with the idea that the boundary 
layer can be divided into an inner region and an outer region, and that in each region the velocity profile de- 
pends on the flow conditions in a distinctive way. 


At the very bottom of a laminar boundary layer, the velocity profile 1s nearly linear and depends only on 
the wall shear stress and the viscosity, and not directly on the pressure gradient or on anything that happens in 
the outer part of the layer. In a turbulent boundary layer, the same dependence applies reasonably well to the 
wall region, which spans the viscous sublayer and the “corner” region of the velocity profile. To express this 
dependence in dimensionless form, we first define dimensionless wall variables: 
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is called the friction velocity, and a unit of y" serves as a length scale often called a wall unit. The dependence 
we're assuming, that is, that the velocity profile depends only on the shear stress and the viscosity, then takes 
the form of a basic functional relationship called the /aw of the wall: 
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In the viscous sublayer, the velocity profile is nearly linear, as it effectively is at the bottom of a laminar 
boundary layer, and f takes an especially simple form: 
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Farther out in the wall region, turbulent shear stress comes to dominate relative to the viscous shear stress. 
But the velocity profile still depends on the viscosity because the presence of the viscous sublayer has a large ef- 
fect on the profile shape well into the part of the region where the turbulent shear stress has taken over. Shortly, 
we'll determine what functional form f must take in the outer part of the inner region. 

In the outer region, we assume that momentum transport is entirely turbulent and that the velocity profile 
has no direct dependence on viscosity. The velocity profile in the outer region goes to Ue at the outer edge of 
the boundary layer and must merge with the wall-region velocity profile somewhere in the neighborhood of the 
“corner” of the profile. The remaining physical arguments regarding the outer-region velocity profile then go 
as follows. The outer region is like a boundary layer with a “slip velocity” at the wall, the slip velocity just 
being the velocity provided by the wall-region profile. The processes (primarily large eddies) that determine the 
velocity profile in the outer region don't “know” how much slip velocity is being provided by the wall-region 
profile, only how much shear stress is being transmitted, which 1s assumed to at least scale with the wall shear. 
Because the flow in the outer region doesn't “know” what its velocity is relative to the wall, the velocity that 
matters is the velocity defect, Ue — u, and the outer region 1s therefore often called the defect layer. 

The velocity-defect profile 1s assumed to depend on the overall thickness 6 of the boundary layer, the wall 
shear stress, the pressure gradient, and upstream history, but not directly on the viscosity. Expressing this in di- 
mensionless form, we have 
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In a flat-plate flow, F depends only on y/d because there is no pressure gradient, and upstream history effects 
are negligible because the shape of the profile is changing slowly. Equation 4.4.8 is then known as the velocity- 
defect law, and data from many experiments on both smooth and rough walls closely define a single function F, 
provided Re¢ is above about 6000. Then there is the special class of flows that we've already noted known as the 
equilibrium turbulent boundary layers, in which the pressure-gradient parameter B defined by Equation 4.3.8 
is constant in x. In an equilibrium turbulent boundary layer, the function F is also constant in x, but with a dif- 
ferent function for each value of B, as we discussed previously in connection with Equation 4.3.9. Equilibrium 
boundary layers were first described by Clauser (1954, 1956), and further theoretical issues were investigated 
by Mellor and Gibson (1966). For more general cases, F is not universal, but the important thing for our pur- 
poses is that the defect profile is a function of y/o and not directly a function of viscosity. 

So we have inner and outer regions in which the velocity profile has different kinds of functional dependence. 
The final physical argument in this analysis is that there is a significant overlap region in which both kinds of 
dependence (Equations 4.4.6 and 4.4.8 hold. In the overlap region, we then have 
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The velocity profile thus is given by a function f of y/d multiplied by a constant, and at the same time by the 
function F(y/d) with a constant added to it. As first pointed out by Millikan (1938) (see also White, 1991), the 
only possible functional form is logarithmic. The resulting /ogarithmic law of the wall for the overlap portion 
of the inner region is usually written 


ut = » inty? }+B. 
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where k 1s called the von Karman constant and has been assigned values between about 0.38 and 0.44 depending 
on what experimental data are used. The constant B also varies depending on what value of « was chosen, 
among other things. It is typically about 4.9 for a smooth wall and is smaller when the wall is rough, as we'll 
see in Section 6.1.8. The logarithmic overlap typically extends from y ~ 0 out to about 0.156. At very high 
Reynolds numbers, the sublayer is very thin, and the log profile occupies nearly the whole region from the wall 
to 0.156, while at the lowest Reynolds numbers at which turbulent flow can be sustained, the log profile disap- 
pears. 





Outside the viscous sublayer, the velocity profile blends smoothly from the linear sublayer profile to the log 
profile. Spalding (1961) devised a composite formula that fits the entire law-of-the-wall region: 
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which is illustrated in Figure 4.4.6. There are not many reliable sets of experimental data defining the profile in 
this region, but this formula fits the available data well (see White, 1991). 





Figure 4.4.6 Velocity profile in the law-of-the-wall region of a turbulent boundary layer 
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Why does it matter that a portion of the typical turbulent velocity profile is logarithmic? Well, there are at 
least two practical applications for this bit of knowledge. One is that surface roughness causes a shift in the log- 
arithmic profile, and the shift provides a practical way of characterizing the effect on the skin friction, making 
it possible to take empirical roughness data from one flow situation and apply them to other flow situations, as 
we'll see in Section 6.1.8. Another is that the logarithmic profile provides a way to determine Cr from experi- 
mental measurements of the mean velocity profile. The viscous sublayer is typically so thin that measurements 
can't be made close enough to the wall to determine Cu/Oy directly, but the logarithmic part of the profile can 
often be measured reasonably accurately. So in principle we should be able to determine the value of Cr that 
makes the measured profile fit the logarithmic law best. Clauser (1956) proposed one very effective way of do- 
ing this. First, recast Equation 4.4.10 in terms of Cf and ue: 
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a form of the log law in which increasing Cf increases both the slope and the intercept. Then all we have to do 
is to plot Equation 4.4.12 for an array of closely-spaced values of C¢ and plot the measured profile in terms of 
u/Ue versus log(y ue/v) for comparison. Figure 4.4.7 illustrates how this works. The portion of the experimental 
profile that is logarithmic is easily identified, and the curve it lines up with best provides an estimate of Cr. This 
kind of plot is often called a Clauser plot. 


Figure 4.4.7 A “Clauser plot” of an experimental turbulent velocity profile compared with curves plotted from 
Equation 4.4.12, showing how an experimental estimate of Cfcan be made (Cf~ 0.0020 in this case). (Data 
from McLean, 1970; for a case about halfway between the onset of an adverse pressure gradient and separa- 
tion) 
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The velocity-defect form in Equation 4.4.8 isn't very convenient for general flows because the function F 
isn't universal. Coles (1956) found another way of representing the outer-region velocity profile, in which a 
single universal function suffices. Instead of referencing the velocity in the outer region to ue, he referenced it 
to the outward continuation of the inner-region profile, the law of the wall. In this formulation, the entire profile 
can be expressed in a single formula: 


rd = f(y7) + Ww (=), 


where f is the law of the wall and can be expressed as a combination of Equations 4.4.7 and 4.4.10 or in terms 
of Equation 4.4.11, and the function W is well represented by 
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The constant IT is determined by the difference between the law of the wall and the actual velocity Ue at y 
= 6. The function W resembles the velocity profile in half of a turbulent wake and is therefore called the wake 
function, and Equation 4.4.13 is called the law of the wake. Equation 4.4.13 is compared with the same meas- 
ured turbulent velocity profile used in Figure 4.4.7, first in terms of inner-region variables in Figure 4.4.8a, and 
then in terms of u/Ue versus y/d in Figure 4.4.8b. This level of agreement with data is typical. 





Figure 4.4.8 A typical experimental velocity profile compared with Coles' law of the wake. Same experiment- 
al data as in Figure 4.4.7. 2 In terms of wall variables. (b) Conventional velocity profile 





The wake function W, shown in Figure 4.4.9, is “concave” (negative second derivative) in the bottom half 
and “convex” in the upper half, with an inflection point in the middle. Thus one component of the law-of-the- 
wake profile always has an inflection point, even when the pressure gradient isn't adverse. But the general con- 
vexity of the other component, the law of the wall, tends to keep the overall profile convex unless the adverse 
pressure gradient is strong or has been in force over a long distance. This can be seen in Figure 4.4.8b, in which 
the profile shown is in a moderately strong adverse gradient, at a station about half way from the onset of the 
pressure gradient to separation, and the profile is still convex, though just barely. 


Figure 4.4.9 Wake function W(y/d), defined by Equation 4.4.14, used in the law of the wake, Equation 4.4.13 
——— —- 





The use of the word “wake” in the law of the wake is odd in a way. Coles (1956) noted the similarity of the 
function W(y/d) to half of a wake velocity profile and proposed thinking of the outer part of a turbulent bound- 
ary layer as a half-wake. While a turbulent free wake and the outer region of a turbulent boundary layer are both 
regions of velocity deficit relative to ue, the “wake component” 2IT W/k« of the law of the wake is, according to 
Equation 4.4.13, a velocity excess relative to the law of the wall. The entire law-of-the-wake velocity profile, 
including the law of the wall component, is often not at all wake-like in appearance, as, for example, in Figure 
4.4.8b. 

The skin friction is much larger for a turbulent boundary layer than for a corresponding laminar flow, as we 
saw for flat-plate flow in Figures 4.3.1—4.3.3. The difference between turbulent flow and laminar flow is even 
more dramatic when we look at it in terms of the eddy viscosity ut compared with the molecular viscosity uw. 
Away from the wall in a turbulent boundary layer, the mean-velocity gradient quickly becomes much smaller 
than it is at the wall, while the shear-stress level tends to remain high. This situation reflects levels of eddy 
viscosity that are orders of magnitude higher than the molecular viscosity. In Figure 4.4.10, we plot the ratio 
ur/u for flat-plate boundary layers over a range of Reynolds numbers, according to the eddy-viscosity model of 
Mellor (1967). The pz/u distribution in the inner region, plotted versus oe is independent of Reynolds number, 
as shown in part (a). It is the linear-looking part of the p;/u distribution that produces the logarithmic part of 
the velocity profile. The outer region eddy viscosity is proportional to Ré* as seen in part (b), a relationship 
first suggested by Clauser (1956). In some models, for example, Cebeci and Smith (1974), the eddy viscosity 
decreases toward the outer edge of the boundary layer to account for the intermittency of the turbulence. Mellor 
and Gibson (1966) claim this makes little difference in the calculated mean-velocity profiles. 


Figure 4.4.10 Levels of eddy viscosity given by Mellor and Gibson's (1966) model, for flat-plate boundary 
layers over a range of Reynolds numbers. (a) The inner region, 1n wall region variables, in which the variation 
is independent of Reynolds number. (b) Over the entire boundary-layer thickness, where the inner-region vari- 
ation of (a) takes place too close to the wall to be visible 





In Section 4.1.2, we discussed the distinctive aspects of 3D boundary layers in general terms: shear- and 
pressure-driven cross flow, cross-flow velocity profiles, direction profiles, the nature of separation in 3D, and 
the effects of cross flow on the flow development and the displacement effect. An additional complication arises 


in turbulent 3D flows, and that is that the relationship between the mean-velocity gradient and the shear stress 
becomes more complex. 

One way of looking at this relationship 1s to think of the shear stresses and the velocity gradients as vectors 
(Tx, Tz) and (Cu/Oy, Ow/Oy) defining directions parallel to the surface. In 2D flows, these vectors are in the same 
direction, the direction of the entire flow, given by the vector (ue,0). In 3D, the shear stress and the velocity 
gradient line up only in laminar flow or in the viscous sublayer of a turbulent boundary layer. In the turbulent 
part of a 3D boundary layer, they are generally skewed at an angle relative to each other, with the shear-stress 
direction often lagging behind when the velocity-gradient direction changes, as is the case in the example shown 
in Figure 4.4.11. Van den Berg (1988) compiled a list of possible physical mechanisms for the skewing, but 
what the mechanisms actually are in typical 3D boundary-layer flows has not been resolved. 





Figure 4.4.11 Examples of the skewing that can take place between the shear-stress direction B; and the 
velocity-gradient direction Bg compared to the outer-flow direction B in a 3D turbulent boundary layer in an 
adverse pressure gradient. See Bradshaw and Pontikos (1985). (a) Data from van den Berg, (1988). Published 


by AGARD. (b) Data from Bradshaw and Pontikos, (1985). 
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The skewing means that the eddy viscosity has different values in the two directions, and it can have major 
effects on the development of the mean-velocity field. Many turbulence models ignore this, however, and as- 
sume that the eddy viscosity is isotropic (the same in all directions). We'll discuss this problem further in con- 
nection with the prediction of swept-wing flows in Section 8.6.2. 

In most applications, we are not interested in the unsteadiness associated with a turbulent boundary layer, and 
we tend to measure or attempt to predict only the time-averaged (mean) flowfield. But the pressure fluctuations 
at the surface are sometimes important, as, for example, in the generation of boundary-layer noise. These tend 
to have broad-band spatial and temporal spectra reflecting the range of scales in the boundary-layer turbulence. 
In an attached boundary layer, the lowest spatial and temporal frequencies tend to be those associated with the 
passage of large structures such as those shown in Figure 2.le. Boundary-layer separation from anywhere but a 
single, sharp trailing edge will give rise to lower frequencies associated with fluctuations of the instantaneous 
separation location and of the separated shear layer, and these fluctuations can be felt far upstream. Separation 
also increases the intensity of the surface pressure fluctuations, by something on the order of 10 dB, and reat- 
tachment of a turbulent shear layer increases them even more. 


4.5 Control and Prevention of Flow 
Separation 


As we noted in Section 4.1.4, delaying or preventing separation is important in many applications. In this sec- 
tion, we'll discuss some of the strategies that are used. 


4.5.1 Body Shaping and Pressure Distribution 


Separation from a smooth surface is generally related to a rapid or sustained rise in pressure, as we saw in Sec- 
tion 4.1.4. Thus a powerful means of delaying separation is through the shaping of the body so as to control 
the pressure rises. This usually means providing a generally streamlined shape, without sharp corners or large 
surface curvatures, and with a gradual closure at the back of the body to a sharp or almost sharp trailing edge or 
tail. If the closure is too abrupt, separation ahead of the tail will occur, as in Figure 4.1.2b or c. We'll consider the 
distinction between streamlined and bluff bodies further, as well as the underlying physics issues, in Sections 
5.2, 6.1.3, 6.1.5, and 6.1.6. Even for a streamlined body, some fairly subtle details of the pressure distribution 
can be important. We'll consider this issue in some detail in connection with the maximum lift of airfoils in Sec- 
tions 7.4.3 and 7.4.4. 

Separation in 3D can form complicated flow patterns such as the necklace vortex separation that can occur in 
a wing-body junction, as shown in Figure 4.5.1a,b. A fillet or strakelet can eliminate the separation at the lead- 
ing edge itself by easing the adverse pressure gradient and increasing the flow divergence ahead of the junction, 
as shown in Figure 4.5.1c. The change in the surface shear-stress pattern between (b) and (c) is dramatic. We 
should remember, however, that the surface shear-stress pattern tends to give an exaggerated impression of the 
change to the overall flowfield, as we pointed out in Section 4.1.2 and saw in Figure 4.1.11. Separations like the 
one in Figure 4.5.1b tend to be confined close to the body surface and to have very minor effects on the rest of 
the flow. 


Figure 4.5.1 3D separation in a wing-body junction. (a) Sketch of a necklace vortex separation in a junction 
without a fillet. (b) Oil-flow photo of necklace vortex separation in a horizontal-tail junction. (c) Oil-flow 
photo of same junction as in (b) with strakelet installed 
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4.5.2 Vortex Generators 


Generating streamwise vortices in the boundary layer upstream of where separation would otherwise occur can 
be an effective preventive measure. A vortex generator (VG) is usually a low-aspect-ratio vane with a planform 
ranging from delta to rectangular, attached perpendicular to the wall and placed at an angle of attack of about 
15° to the local flow, so as to produce a strong vortex. The basic types are illustrated in Figure 4.5.2. A conven- 
tional VG is usually sized about as high as the boundary layer is thick (h ~ 5), while a subboundary-layer vortex 
generator (SBVG) is much smaller (/ = 5). VGs are generally used in arrays and characterized by whether 
the vortices are all of the same sign (co-rotating) or of alternating signs (counter-rotating). In addition to vanes, 
various sorts of wedges, bumps, and scoops have been used to generate counter-rotating vortex pairs, but these 
devices are less common than vanes. 


Figure 4.5.2 Types of vortex generators used for delaying or preventing separation. (a) Scaling of VG plan- 
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A common explanation for the effectiveness of VGs is that they “enhance mixing.” We've seen that a turbu- 
lent boundary layer is more effective than a laminar one at resisting separation, and it seems logical that intro- 
ducing more mixing would make it even more effective. There is an element of truth to this idea, but it is an 
oversimplification, especially in the case of conventional large VGs. What a large VG does to the flow can be 
looked at either in terms of a large side force “pushing” the flow strongly to the side, or in terms of “induction” 
by a strong vortex. Either way, the result is a major rearrangement of the mean-velocity field, as illustrated in 
Figure 4.5.3 in terms of streamwise vorticity and velocity contours in a cross-plane downstream of the VGs. 


Figure 4.5.3 Rearrangement of the boundary-layer mean-velocity field by conventional VGs, shown in terms 
of streamwise vorticity contours and the u/ue = 0.99 streamwise velocity contour at several streamwise sta- 


tions. From Pauley and Eaton, 1988. Used with permission. (a) With counter rotating VGs. (b) With co-rotat- 
ing VGs 
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Counter-rotating VGs (Figure 4.5.2a, left side) produce bands of strong flow divergence (Figure 4.5.3a), 
where the boundary layer is thinned considerably, which increases its resistance to separation, alternating with 


bands of flow convergence, in which the boundary layer is thickened, and resistance to separation 1s presumably 
reduced. As the vortices “migrate” laterally, the bands of divergence widen, and the bands of convergence nar- 
row, until pairs of vortices converge on each other and begin to rise from the surface, as can be seen in Figure 
4.5.3a. As the vortices rise, they lift some of the low-energy air away from the wall, so that it contributes less to 
the tendency to separate. If an array of counter-rotating VGs is placed insufficiently far upstream of where sep- 
aration wants to occur, separation will occur in the convergence bands. If the array 1s placed too far upstream, 
the converged vortex pairs will rise too far and lose their effectiveness. The placement of an array, relative to its 
spacing, is thus a balancing act. And note that in counter-rotating arrays, the vanes are usually grouped in pairs, 
as shown in Figure 4.5.2c, such that vortex convergence and liftoff are delayed farther downstream than they 
would be with uniform spacing. 

Co-rotating VGs (Figure 4.5.2b, right side) also produce bands of divergence (thinning) and convergence 
(thickening), as in Figure 4.5.3b, but not nearly as pronounced as those produced by counter-rotation. 

Which type of array is more effective at preventing separation depends on several factors. Here is a brief 
summary of the pros and cons of the two types: 


¢ Counter-rotating: A counter-rotating array can be more effective over short distances because of the 
strong divergence it can produce, but vortex merging limits the distance over which it remains effective. 
The effectiveness of a counter-rotating array can be seriously degraded if the vortices in a pair are not 
of equal strength (Pearcey, 1961), and counter-rotating arrays are thus not very tolerant of flow direction 
changes that unbalance the angles of attack of the vanes. And in 3D flows with strong cross-flow in the 
boundary layer, the two vortices in a pair behave differently, as explained below, which tends to favor a 
co-rotating array. 

¢ Co-rotating: A co-rotating array will usually have lower nearfield effectiveness, but its effectiveness 
will persist farther downstream. And a co-rotating array 1s usually favored in cases with strong boundary- 
layer cross-flow. 


Strong boundary-layer cross-flow has two main effects (Shizawa and Eaton, 1992): The vortices from VGs 
decay two to three times as fast as they would in a comparable 2D flow, and vortices of opposite rotation behave 
differently, as we mentioned above. When the main vortex from the VG has the same rotation as the background 
cross-flow vorticity in the outer part of the boundary layer, the vortex tends to diffuse more rapidly, as shown 
in Figure 4.5.4a, than 1t does when the vortex rotates in the opposite direction, as in Figure 4.5.4b. So flow situ- 
ations with strong boundary-layer cross-flow favor co-rotating arrays with rotation as in Figure 4.5.4b. An easy 
rule to remember is that the force exerted by the VG should push the flow in the direction opposite to the dir- 
ection of the cross-flow. This is important in the application of VGs to swept wings. We'll discuss swept-wing 
boundary layers in some detail in Section 8.6.2. 


Figure 4.5.4 I[lustration of the effects of the direction of rotation of a vortex in a 3D boundary layer with 
strong cross flow, shown in terms of streamwise velocity contours. From Shizawa and Eaton, 1992. Used with 
permission. (a) Vortex rotation from a VG that pushed the flow in the same direction as cross flow near wall. 
(b) Vortex rotation from a VG that pushed the flow in the direction opposing the cross flow near wall 
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One of the most common applications for large VGs is to delay shock-induced separation on wings with 
transonic flow. The physical issues involved were discussed by Pearcey (1961) and ESDU (1993) provides in- 
stallation design guidelines. Large VGs have also been used on fuselage closures, engine-inlet and wind-tunnel 
diffusers, and windmill blades (McMasters, Crowder, and Robertson, 1985). In general, counter-rotating VGs 
have been favored in flows that are without strong cross flow and in which the flow direction can be depended 
on to remain the same, and co-rotating VGs have been favored on swept wings. 

For modeling the effects of VGs in CFD calculations, it's tempting to treat the flow only 1n a spanwise-av- 
eraged sense and to modify the turbulence model so as to simulate the effects of enhanced mixing. But we 
saw in Figure 4.5.3 that a boundary layer downstream of conventional large VGs has a pronounced large-scale 
spanwise-periodic structure, the effects of which are not likely amenable to averaging. The most successful 
modeling of large VGs so far has been based on 3D RANS calculations that resolve the individual vortices (see 
Yu, Kao, and Bogue, 2000). 

SBVGs, on the other hand, are much smaller than conventional VGs and have spacings that are not so large 
relative to 6, so that it is much more reasonable to think of their effects in spanwise-averaged terms. The method 
developed by researchers at QinetiQ, Ltd, based on experimental measurements by Ashill, Fulker, and Hackett 
(2001, 2002), follows this general strategy in simulating the effects of SBVGs in a 2D integral boundary-layer 
code. 


4.5.3 Steady Tangential Blowing through a Slot 


Blowing air through a tangential slot in the wall can be a very effective way to delay separation. The air can 
be introduced passively through an open slot in an airfoil, as in Figure 4.5.5a, or actively from a pressurized air 
source, as in Figure 4.5.5b. In either case, the result downstream of the slot is a velocity profile of the wall-jet 
type, with a velocity peak near the bottom of the layer. Note that in the passive case the maximum total pres- 
sure in the jet is the freestream total pressure, so that the maximum velocity at the jet exit 1s about the same as 
the inviscid-flow velocity at the edge of the boundary layer. In the active case, on the other hand, total pres- 
sures substantially higher than freestream are typically used, which results in higher jet velocities. Because of 
the typically higher momentum flux, active blowing tends to be implemented with a narrower slot than passive 
blowing. 


Figure 4.5.5 Tangential blowing slots and associated wall-jet velocity profiles. (a) Passive open slot. (b) Act- 
ive slot with pressurized air supply 
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Much of what is known about the performance of wall jets in preventing separation applies to the active case 
with a relatively narrow slot and a high jet velocity. In this situation, the slot height and jet volume flux required 
to control downstream flow separation tend to be relatively small, so that the details of the jet velocity profile 
don't matter much, and performance depends primarily on the integrated jet momentum flux. On the other hand, 
a passive wall jet, with its larger slot height and lower jet velocity, is not so simple and does not seem to per- 
form in a way that correlates neatly with simple flow parameters. Understanding passive slot blowing requires 
taking a different view, looking at the problem in terms of the boundary layers and pressure distributions on the 
individual surfaces rather than in terms of a single jet. We'll discuss this approach to understanding the passive 
slot effect in Section 7.4.4. 

Steady slot blowing is a “brute force” method of separation control in that it works by providing the fluid at 
the bottom of the boundary layer with the additional momentum it needs to overcome a longer run of adverse 
pressure gradient without separating. 

The usual correlation parameter for the effectiveness of high-velocity wall jets in preventing separation on 
an airfoil of chord c is the momentum-flux coefficient: 
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The level of Cy required depends on how far separation needs to be delayed and on other details of the particular 
application. To bring about a major change in separation behavior, for example, to achieve fully attached flow 


on a plain (unslotted) trailing-edge flap at a large deflection angle, might require Cy ~ 0.02—0.06, depending on 
the details. Performance of slot blowing correlates reasonably well with Cy in the sense that for a given airfoil 
shape and flow condition, performance is insensitive to slot height and jet velocity, provided Cy is kept constant. 
Lachman (1961) and Korbacher (1974) survey an extensive assortment of early investigations of the overall 
performance of steady blowing in delaying separation, and McGahan (1965) made detailed flowfield measure- 
ments in a series of separating flows with steady blowing. 

Steady blowing can be highly effective in delaying or preventing separation, but it tends to be costly in an 
energy sense. For most of the practical applications where blowing might be considered, providing a Cy level 
above 0.01 requires a large amount of compressor power. 

Providing slot blowing can entail other practical problems as well. An effectively 2D slot interrupts the skin 
of the body, and an alternative path for possible structural loads must be provided. And if the required slot height 
is small, maintaining the height accurately can be a problem, especially if the jet supply pressure is high. In 
some cases, an attractive alternative is to replace the slot with a spanwise array of discrete nozzles discharging 
parallel to the surface. Only discrete holes through the skin are then required, and precise nozzle geometry is 
easy to maintain. Discrete-jet blowing has been found to be just as effective as slot blowing, provided that the 
nozzle array is placed far enough upstream so that the discrete jets have a chance to merge sufficiently to pre- 
vent separation where it needs to be prevented (McLean and Herring, 1973). 


4.5.4 Active Unsteady Blowing 


Active unsteady blowing is a much more recent development than steady blowing, and it involves entirely 
different physics. Instead of preventing separation by brute force addition of momentum to the bottom of the 
boundary layer, unsteady blowing controls the effects of separation by manipulation of the separated shear layer. 
Unsteady excitation provided by a pulsed blowing jet or by a small mechanical flapper (like a miniature version 
of the spoilers used on wing upper surfaces) excites instabilities in the separated shear layer, causing it to oscil- 
late with a large amplitude. The time-averaged flowfield that results is intermediate between the separated flow 
that would occur without control and the fully attached flow that could be achieved with sufficiently intense 
steady blowing. On an airfoil or trailing-edge flap, for example, the increment in lift relative to the separated 
case 1s less than what could be achieved with steady blowing but can still be substantial. 

So unsteady excitation can't achieve the performance levels that steady blowing can, but there are indications 
that unsteady excitation can achieve useful levels with much lower energy inputs than required for steady blow- 
ing. For example, Seifert et al. (1993) reported substantial increases in Cymax for an airfoil with a plain flap de- 
flected 30° with a time-averaged Cy, of 0.0006, roughly 2 orders of magnitude lower than steady blowing might 
have required. More recently, experiments and CFD simulations (the calculations by Shmilovich and Yadlin 
(2007), for example) have required much higher excitation levels, however. The excitation level that would ac- 
tually be required to provide attractive performance for a real aircraft high-lift system is still highly uncertain. 

There are two distinct approaches regarding the frequency of excitation. Earlier experiments on unsteady 
blowing (Seifert et a/., 1993) found a relatively low reduced frequency of excitation fc/Uo ~ 1 to be most ef- 
fective. More recently, a high-frequency approach using frequencies an order of magnitude higher has been pro- 
posed (see Glezer, Amitay, and Honohan, 2003, for example). 


4.5.5 Suction 


Suction through slots or porous surfaces can be highly effective at delaying separation, essentially by removing 
the air with the lowest momentum at the bottom of the boundary layer and thus delaying the reversal of the ve- 


locity profile. Schlichting (1979) shows flow visualization images of slot suction delaying separation in a diver- 
ging channel and in the external flow around a circular cylinder. Thwaites (1958) shows experimental data for 
a circular cylinder on which porous-surface suction eliminated separation so that the theoretical potential-flow 
pressure distribution was nearly achieved. In Section 6.1.10, we'll look at another way of applying suction, in 
which the body shape is designed to localize all of the body's pressure recovery over a single suction slot, where 
a large fraction of the boundary layer arriving from upstream is removed. In practical applications, suction, like 
steady blowing, tends to be costly in terms of energy. 


4.6 Heat Transfer and Compressibility 


In our theoretical discussions so far in this chapter, we've generally assumed constant-property flow. Though 
the general momentum-transport processes and the development of the velocity field we've discussed so far 
wouldn't likely be changed qualitatively by variations in properties, variable fluid properties can have important 
quantitative effects on the flowfield, and there are situations in which the heat transfer and the wall temperature 
are of practical importance. Such effects can come into play through heating or cooling at the wall, or through 
the effects of compressibility, or combinations of these. 


4.6.1 Heat Transfer, Compressibility, and the 
Boundary-Layer Temperature Field 


When either heat transfer or compressibility causes a variation in temperature in the neighborhood of a wall, we 
have, by definition, a temperature profile and a thermal boundary layer. The development of the thermal bound- 
ary layer depends both on what goes on in the flowfield and on the thermal boundary conditions at the wall, 
where either the temperature, the heat flux, or a relationship between the two may be imposed by the thermal 
properties of the body and other aspects of the thermal environment. 

As an illustration of the kinds of effects that can arise, consider a wing skin that is also part of a fuel tank. 
Fuel was pumped into the tank at a temperature close to the ambient temperature on the ground, and now the 
airplane is flying at high altitude, where the air is much colder. The fuel is being cooled by heat transfer that 
is driven by the temperature difference and regulated by the thermal conductivity of the fuel and the skin, and 
by the effective conductivity of the thermal boundary layer. The surface temperature adjusts itself to balance 
the transfer of heat from the tank with the transfer to the flow. The temperature difference that drives the heat 
transfer to the flow is the difference between the actual surface temperature and the adiabatic wall temperature, 
which is just the temperature the wall would reach in the absence of heat transfer, as, for example, if the wall 
were a perfect thermal insulator. At very low Mach numbers, the adiabatic wall temperature is just the static 
temperature of the flow, while at higher Mach numbers it reaches an elevated adiabatic recovery temperature 
because the kinetic energy of the flow is significant, and some of it is turned into heat in the boundary layer 
through viscous dissipation. We'll shortly consider a simplified theoretical estimate of the adiabatic wall tem- 
perature. 

Other sources of heat flux (electromagnetic radiation, for example) can also affect surface temperature. The 
surface of a vehicle at rest in the sun can become quite hot to the touch, but the temperature will drop rapidly 
as flow speed increases. The net electromagnetic heat flux in this case is the difference between the absorbed 
part of the incoming solar flux (the part that isn't reflected), which is roughly fixed, and radiation from the body, 
mostly in the infrared. As the body is cooled by the flow, the net radiation into it increases somewhat because 


the outgoing infrared radiation is reduced, but the effective conductivity of the thermal boundary layer increases 
greatly with flow speed, so that the radiation overall becomes less and less of a player in the heat balance that 
determines the surface temperature. 

Thermal boundary layers encompass wide ranges in the relative importance of the various effects. For ex- 
ample, if the Mach number is low, and heat transfer is driven by small temperature differences relative to the 
absolute temperature, the density and viscosity will be almost constant, and the flowfield will be affected very 
little by the heat transfer. In such cases, the heat is conducted and convected by the flow much as a passive con- 
taminant would be. With larger temperature differences and/or Mach numbers, on the other hand, the changes 
in density and viscosity can have significant effects on the flowfield. In some flows, heat transfer dominates 
the thermal effects, while in others, heat transfer from the wall is insignificant, and compressibility is the only 
source of temperature differences. And many flows land between these two extremes. 


4.6.2 The Thermal Energy Equation and the Prandtl 
Number 


The thermal energy equation governs the temperature field and its interaction with the velocity and pressure 
fields (see Schlichting, 1979, for a detailed derivation). The equation expresses the balance between conduction 
and convection of thermal energy, and its conversion to and from kinetic energy by compression, expansion, 
and viscous dissipation. The equation gives rise to an important dimensionless material property, the Prandtl 
number: 
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where Cp is the specific heat at constant pressure, and k is the thermal conductivity. The Prandtl number appears 
in both the thermal conductivity and viscous dissipation terms in the energy equation (see Table 3.9.2) and meas- 
ures the relative importance of viscous diffusivity and thermal diffusivity in the development of the velocity and 
temperature fields. Schlichting (1979) uses an order-of-magnitude analysis to arrive at a rough estimate of how 
the Prandtl number affects the relative thicknesses of the velocity and thermal boundary layers in laminar flow: 


rik re lf 
4.6.2 ay j é id E ] * 


In gases, the Prandtl number is not far from 1, and the thermal boundary layer tends to be similar in thickness 
to the velocity boundary layer, as illustrated in Figure 4.6.1la. In liquids, the Prandtl number is typically orders 
of magnitude larger, and the thermal boundary layer tends to be much thinner than the velocity boundary lay- 
er, as illustrated in Figure 4.6.1b. Of course, in real situations the thickness of the thermal boundary layer also 
depends on the upstream history of the thermal boundary condition. For example, there can be situations in 
which a pronounced temperature difference is introduced at the wall well downstream of the start of the velocity 
boundary layer. The thermal boundary layer arising from such a change in boundary condition will be thinner 
than the velocity boundary layer until the temperature difference has had a chance to diffuse out into the flow, 
as illustrated in Figure 4.6.1c. 


Figure 4.6.1 Schematic illustration of the thermal boundary layer in relation to the velocity boundary layer. (a) 


Gases (Pr about 0.7). (b) Liquids (Pr = 1). (c) Gases (Pr near 1) after sudden change in wall emiezalt 
6 temperature . 
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4.6.3 The Wall Temperature and Other Relations for an 
Adiabatic Wall 


In many high-speed flow applications the wall boundary condition is very nearly adiabatic, and it is useful to be 
able to predict the wall temperature. Because the Prandtl number in gases is close to 1, we'll take as a starting 
point an energy integral derived independently by Busemann and Crocco (see White, 1991) for the special case 
of Pr = 1. If we further assume a perfect gas and an adiabatic wall, the boundary-layer thermal energy equation 
can be integrated, with the result that the total enthalpy is constant through the boundary layer: 


4.6.3 H=h-+ u" /2 = constant. 


Thus for Pr = 1, the adiabatic wall temperature, or the flow temperature at the wall, where u = 0, is the same as 
the stagnation temperature of the flow at the edge of the boundary layer, which in most external-flow situations 
will be the same as the far-field stagnation temperature. 

But in gases the Prandtl number 1s a bit less than 1 (0.71 for air), and the adiabatic wall temperature 1s a bit 
less than the stagnation temperature at the boundary-layer edge. We define an adiabatic recovery factor: 


4.6.4) = (Taw Te) /(Tte = dhe F 


We see that the Pr = | result of Equation 4.6.3 corresponds to r= 1. White (1991) presents approximate analyses 
indicating first that r should be very closely a function of Prandtl number only, and further that for laminar flow 


465° ™ Pr!/* = 0.84 for air. 


and for turbulent flow 
4660 % Pr’? = 0.89 for air. 


So we see that the adiabatic recovery temperature in air is much closer to the stagnation temperature than to 
the static temperature of the local outer flow, especially if the boundary layer is turbulent. These relationships 


should be good over the range of Prandtl number from 0.1 to 3.0, and provide a very convenient way to make 
rough estimates of the adiabatic wall temperature. 


Consistent with the analyses that led to Equations 4.6.3-4.6.6, a reasonable approximation to the density and 


temperature profile for laminar or turbulent flow and an adiabatic wall is 
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where r is given by Equation 4.6.5 for laminar flow or Equation 4.6.6 for turbulent flow. A typical temperature 
profile for turbulent adiabatic-wall conditions according to this equation 1s illustrated in Figure 4.6.2. 


Figure 4.6.2 Velocity and temperature profiles for an adiabatic turbulent boundary layer in air at M = 1, ac- 
cording to Equation 4.6.7 
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The increase in temperature through the boundary layer is accompanied by a reduction in density, which in- 
creases 6 and reduces 9, so that the shape factor H 1s increased. Whitfield (1978) proposed a correlation for this 


effect for both laminar and turbulent flows up to Me = 3, in terms of the kinematic shape factor Hx (H computed 
with the density ratio omitted from the integrals): 


46 H = Hq(1 +0.113Mf) + 0.290M. 


A plot of this relationship in Figure 4.6.3 illustrates how a given velocity profile with a given Hx results in 
significantly higher values of H at high Mach numbers. The increase in temperature also tends to reduce Cy. An 
approximation for this effect in the case of a flat plate was seen in Equation 4.3.7 and Figure 4.3.4. 


Figure 4.6.3 Effect of Mach number on the shape factor H for adiabatic-wall boundary layers, according to 
Equation 4.6.8 





4.7 Effects of Surface Roughness 


Distributed surface roughness can influence an aerodynamic flow in different ways depending on the situation: 
It can affect the location of laminar-to-turbulent transition, and it can alter the skin friction of a turbulent bound- 
ary layer and through combinations of these, it can alter the development of the boundary layer and the location 
of subsequent separation. In Section 4.4.1, we briefly discussed the effects of Reynolds number and roughness 
on transition, and in Section 6.1.6, we'll see how these can in turn affect the separation location and the pres- 
sure drag of spheres and cylinders. The maximum lift of an airfoil or wing also depends on separation, as we'll 
see in Section 7.4.3, so that roughness can affect maximum lift as well as drag. In Section 6.1.8, we'll look in 
some detail at how surface roughness can increase turbulent skin friction, which can influence total drag even 
when there is no change in transition or separation. This type of roughness effect 1s quite common in applica- 
tions, caused, for example, by rough materials or paint, or by dirt or other contamination. Then in Section 6.2.2, 
we'll look at methods for using empirical data on roughness effects measured in simple flow situations for drag 
estimation in more-general flows. Riblets are an example of another sort of surface texture, one that would be 
applied purposely to reduce turbulent skin friction, and we'll discuss them in Section 6.3.2. 


Chapter 5 


General Features of Flows around Bodies 


In Chapter 4, we looked in considerable detail at the physics of the flow closest to the surface of a body, in the 
viscous boundary layer. In this chapter, we explore the generic characteristics of the entire flowfield in extern- 
al flows around bodies, in preparation for more detailed discussion of specific aspects such as drag and lift in 
following chapters. While the basic physical laws (the NS equations: Navier-Stokes) govern all of the flow phe- 
nomena we'll look at, the equations by themselves won't help us much in our quest for intuitive understanding. As 
we saw in Chapters 2 and 3, there is just too much of a gap between the simplicity of the local physical laws and 
the potential complexity of the flow phenomena that they govern for us to be able to get much from the equations 
by mental exercise alone. Our discussion will therefore rely heavily on knowledge of phenomenology, that is, 
what actually happens in real flows. But we'll try to get beyond just describing what happens and understand why 
things happen the way they do. Some of the flow features we'll consider will seem at first to be trivial or obvious, 
but I'll ask you to bear with me. We'll find that explaining the “why” of even the most obvious flow features can 
be surprisingly tricky. 

For purposes of this discussion, we'll put ourselves in the body-centered reference frame and assume the flow 
can be treated as steady. The unsteadiness in turbulent boundary layers and wakes will usually not be of direct 
interest to us because of the small length scales and short time scales involved. For most of the flow phenomena 
we'll look at, we'll be concerned with only the time-averaged effects of turbulence, which, as we saw in Section 
3.7, are essentially the same as if there were local increases in the coefficients of viscosity and thermal conduct- 
ivity. In terms of time-averaged effects, turbulence will have a qualitative effect on a flow only when it affects 
the pattern of boundary-layer separation, an issue we'll return to several times in the remainder of the book. 

In Chapter 3, we discussed a general flow structure that we could deduce fairly directly from the local physics, 
that is, that a viscous boundary layer will develop close to the surface of a body, and the flow outside the bound- 
ary layer will tend to remain irrotational. Now, we'll try to understand features of global flowfields that are not 
easily inferred from the local equations. 


5.1 The Obstacle Effect 


The main thing that characterizes all flows around bodies is that the body acts as an obstacle. It is obvious that 
flow cannot pass through a solid body unobstructed and must therefore move aside to flow around the obstacle. 
This general response is something we all take for granted, but the detailed mechanisms behind it are not so ob- 
vious. 

Primitive intuition tends to imagine fluid particles flying toward the body without interacting with each other 
and hitting the front of the body unimpeded, like a hail of bullets, as illustrated in Figure 5.1.1. Even Newton 
developed theories of drag and lift based on such a model. Two major issues are not satisfactorily resolved by the 
“bullet model’: (1) what happens to the particles after they collide with the body and (11) what happens behind 


the body. With the bullet model, we might expect particles to just pile up on the front surface of the body or to 
bounce away from the body in all directions, never to interact with the body again. We would also expect to see 
a vacuum behind the body. Neither of these things happens in common aerodynamic flows. Of course, the root 
of the problem with the bullet model is 1n its assumption that fluid particles don't have random motion relative 
to the general flow direction and don't interact with each other. In the real world, the continuum behavior that 
results from frequent molecular collisions leads to a flow structure that begins to deviate from the bullet model 
well ahead of the nose of the body and differs drastically from it behind the body. 


Figure 5.1.1 “Bullet model” for flow approaching a solid body 
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The real behavior of a continuum flow encountering an obstacle can be seen 1n nature, as shown in the sketch 
of flowing water by Leonardo da Vinci in Figure 5.1.2. What Leonardo's sketch shows, of course, is not the 
streamlines of the flow, but the pattern of waves on the water surface, but it is still clear that the flow ahead of 
the obstacle is reacting to the obstacle's presence, and the flow 1s filling the space behind the obstacle. 


Figure 5.1.2 Water flowing around an obstacle as visualized by Leonardo da Vinci. Detail from a drawing 
circa 1510-1513, A seated old man, and studies and notes on the movement of water. 





To maintain its integrity as a continuum and to obey the continuity equation, the flow must react ahead of the 
body and move aside. After it passes the body, in most cases, fluid from the main flow fills the space behind the 
body, though the flow path by which it gets there may be complicated, as we'll see later. In high-speed liquid 


flow, cavitation or ventilation may take place, leaving behind the body a cavity that is empty of liquid and filled 
only with gas, but this is a special case we'll not pursue further. 

A simple idealized case that illustrates the key features of the usual basic flow pattern is the 2D incompress- 
ible potential flow around a circular cylinder shown in Figure 5.1.3a—d. 


Figure 5.1.3 2D incompressible potential flow around a circular cylinder. (a) Streamlines. (b) Contours of con- 
stant pressure (Cp interval = 0.5) and constant velocity magnitude. (c) Timelines. (d) Surface pressure distribu- 
tion 
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First examine the streamline pattern in Figure 5.1.3a. The flow reacts ahead of the body by diverging away 
from the centerline to go around it, as we can imagine it must also be doing in Leonardo's sketch in Figure 5.1.2. 
After it passes the widest part of the body, the flow converges toward the centerline to close in again behind 
the body. The inviscid streamline pattern in Figure 5.1.3a exaggerates the suddenness of the reconvergence, or 
“closure” of the flow, but all flows eventually close behind a body, and we'll defer the details to later in the 
discussion. 

Now examine the distributions of velocity magnitude and pressure coefficient Cp in Figure 5.1.3b, which 
can be shown as a single set of contours because the flow is assumed inviscid, Bernoulli's equation applies, and 
each value of the pressure corresponds to a single value of the velocity magnitude. Positive Cp corresponds to 
low velocity magnitude, and negative Cp corresponds to high velocity magnitude. 

Keep in mind that the cause-and-effect relationship between the velocity and the pressure is circular, as I've 
stressed several times in the preceding chapters and will repeat several more. The region of high pressure at the 


front of the body coincides with a slowing of the flow and the start of the divergence of the flow around the 
body. The flow along the central streamline reaches stagnation pressure and zero velocity where it hits the nose 
of the body at the stagnation point. 

The areas of low pressure at the “flanks” of the body speed the flow up again, actually overspeeding it by a 
factor of 2 in this example. The lateral pressure gradients in this region reverse the divergence away from the 
centerline and cause the flow to begin to converge toward the centerline so as to fill in behind the body. Note 
that the high velocity at the flanks coincides with pinching together of the streamlines in Figure 5.1.3a, in keep- 
ing with the 1D subsonic-flow relationship between velocity and streamtube area. Finally, the high pressure at 
the back of the body coincides with a slowing of the flow and the stopping of the convergence. 

Although the body imposes no hard constraint on the flow area, the wide part of the body acts as a sort of 
constriction, and the flow responds by speeding up to squeeze past. This streamtube-pinching effect is not a kin- 
ematic necessity, but a dynamic one. There are kinematically possible flow patterns in which there is no stream- 
line pinching around the wide part of the body, like the pattern sketched in Figure 5.1.4. (In 2D, the pushing 
out of the streamlines around the body would have to extend unattenuated to infinite distances from the body.) 
Such a flow pattern 1s not forbidden by kinematic considerations or by conservation of mass, but it does not sat- 
isfy the momentum equation. One way to see this is to look at what happens in the flow crossing a vertical line 
above the crest of the body. All of the streamtubes crossing this line have the same area and therefore the same 
velocity and pressure. So the pressure is constant in the cross-stream direction, which is not consistent with the 
flow curvature. 


Figure 5.1.4 A kinematically possible flow pattern in which the pushing out of the streamlines around a 2D 
body extends unattenuated to infinite distances from the body, and there is no streamline pinching around the 
wide part of the body. Such a flow pattern is not forbidden by kinematic considerations or conservation of 
mass, but it does not satisfy the momentum equation 





We'll consider these issues again in connection with the streamtube-pinching explanation for the lift on an 
airfoil in Section 7.3.1.5. 

Adding timelines to the picture, as shown in Figure 5.1.3c, also provides interesting insights. The delay of 
the flow that passes through the low-velocity region around the front stagnation point is clearly seen. The high 
velocity around the flanks offsets the delay somewhat, but there is further delay at the rear of the body, and the 
final result downstream of the body is a net delay of the flow that passed near the body, and that delay 1s never 
recovered, even in this inviscid example. 


The nonuniform delay means that parcels that pass close to the body experience a permanent displacement 
relative to their former neighbors and that there is thus a permanent deformation of the fluid. In a potential flow 
like this one, the deformation is a result solely of the passage of the flow through the pressure field surrounding 
the body, and as we saw in Section 3.8.2, such an interaction cannot introduce vorticity, or rotation. Thus each 
fluid parcel, step-by-step in its passage through the field, experiences purely irrotational deformations like that 
illustrated in Figure 3.3.5b. A global deformation pattern like that in Figure 5.1.3c is a result of an accumulation 
of such elemental local deformations. In a viscous flow, viscous forces 1n the boundary layer and wake contrib- 
ute additional local deformations of the shearing type illustrated in Figure 3.3.5c. 


A striking feature of the timeline pattern is the blank space behind the body, which has not yet been reached 
by any of the timelines. This void in the timeline pattern reflects the fact that the closer a fluid parcel is to the 
stagnation streamline, the longer the delay it experiences. In fact, on the stagnation streamline itself, the delay 
just to reach, or to leave, a stagnation point is logarithmically infinite. Thus timelines that start a finite distance 
upstream of the body never reach the front stagnation point, and never touch the body or any point on the aft 
stagnation streamline. 

The corresponding pattern in the viscous flow at R ~ 140 was shown in Figure 3.3.2c. The pattern is similar 
in front of the body, but separation from the flanks of the body, and the resulting viscous wake, prevent the 
timelines from closing in as much in the nearfield behind the body. Looking downstream in the real flow, it 
appears that the flow that was delayed passing close to the body is swallowed up by the unsteady vortex street 
in the wake, and the flow outside the wake doesn't appear to have been delayed on average. Denker (1996, web 
site) shows a nice illustration of the delay effect in connection with the flow around an airfoil. 


The pressure distribution along the surface for the potential-flow case is shown in Figure 5.1.3d and is seen 
to be symmetrical front to back. This is consistent with the fact that this flow has zero pressure drag, as do all 
2D potential flows, something we'll discuss further in Section 5.4. In flows with viscosity, the pressure rise at 
the back, often referred to as the aft “pressure recovery,” is usually significantly reduced or completely absent, 
and the pressure drag is generally nonzero. Pressure drag in the viscous case is something we'll discuss further 
in Section 6.1.6. 

Note that in examining flowfields like this 1t can be instructive to think of the interaction between the pres- 
sure and the velocity in two different ways. In what I would call the “integral” view, based on Bernoulli's equa- 
tion, one makes use of the negative relationship between the pressure and the velocity magnitude. The “dif- 
ferential” view, based on the momentum equation, is more complicated but also more powerful because it can 
explain changes in flow direction as well as velocity magnitude. In this view, one looks at the pressure gradient 
and relates it to the flow acceleration in a vector sense. It is worthwhile to try taking both of these views in 
looking at the flowfield depicted in Figure 5.1.3. 


Now we've seen the simplest general flow pattern associated with flow around an obstacle: high pressure, 
slowing down, and divergence as the flow approaches the front of the body; low pressure, speeding up, and 
pinching together of the streamlines as the flow squeezes past the widest part of the body; and slowing down 
again as the flow closes around the back. Around more-complicated body shapes, the pattern will become more 
complicated, but overall it will still have some of this general character. 

Note the contrast between the continuum flow and the flow according to the “bullet model” with regard to 
how “much” of the flow comes into contact with the body. In the bullet model, all the particles in a wide swath 
hit the front of the body, while in the 2D continuum flow only the central streamline touches the body. This 
brings us to the next major aspect of continuum flows: how the flow attaches to the surface of the body and 
leaves it. 


5.2 Basic Topology of Flow Attachment 
and Separation 


An important aspect of the topology of flows around bodies has to do with what part of the flow contacts the 
body, what it does while in contact, and how it leaves the body. The part that touches the body is, of course, the 
flow at the bottom of the boundary layer, the physics of which we discussed in some detail in Chapter 4. The 
flow that occupies the bottom of the boundary layer must obviously have originated in some part of flowfield 
upstream and, after leaving the body, will end up in some part of the field downstream. We've already seen that 
the Newtonian, or “bullet,” flow model is totally unrealistic in this regard. In this model, the entire streamtube 
“swept” by the cross section of the body collides with the forward-facing surfaces of the body, after which we 
have no realistic option for what happens to it. In a continuum flow, on the other hand, only an infinitely thin 
filament or slice of the flow coming from upstream can touch the body, something the mathematicians would 
call a “subset of zero measure.” 

We'll look at how this infinitesimal portion of the flow attaches to the body and separates from it, first in 
2D, and then in 3D. Then we'll consider the topology of the flow patterns associated with it, both on the body 
surface and out in the field. 


5.2.1 Attachment and Separation in 2D 


In general, if the flow is to come into contact with the body, a first attachment of some sort and a final separation 
of some sort are both required. The simplest possible example of attachment and separation in 2D was shown 
in the flow illustrated in Figure 5.1.3, where the two streamtubes straddling the central streamline split apart 
and just graze the body. These streamtubes attach to the body at the front stagnation line and separate from it 
at the rear stagnation line. Once two streamtubes attach in this way in a 2D flow, it is impossible for another 
streamtube from upstream to touch the body because to do so it would have to cross one or the other of the 
streamtubes that are already attached. 

The first attachment on a smooth surface will generally follow the simple pattern we just looked at in Figure 
5.1.3. Separation, however, even in 2D flow, has a wider range of possibilities, determined by the physics of the 
viscous boundary layer on the surface upstream and by features of the body shape. 





The separation pattern we see in Figure 5.2.1 is the simplest possible, in which the two attached streamtubes 
from upstream rejoin and separate from a single line. In part (a), the single separation line occurs on a smooth 
surface, a pattern seen only at very low Reynolds numbers, while in part (b), the separation line is anchored by 
a sharp edge at the rear of the body, as is typical of airfoil flows. 


Figure 5.2.1 Pattern of the simplest separation from the rear of a 2D body. (a) Smooth surface. (b) Sharp edge 


The other common pattern involves two separation lines that bound a “bubble” of recirculating flow that is 
isolated from the main flow and contains an additional line of attachment, as shown Figure 5.2.2. Topological 
consistency requires that the recirculating bubble must actually consist of two counter-rotating cells, as shown. 
In part (a), both separations are from the smooth surface; in part (b), one of the separations is anchored by a 
sharp edge; and in part (c), both are anchored by edges. Note that when a separation is not anchored by an edge, 
its location is sensitive to the flow environment upstream, especially the pressure distribution, as discussed in 
Section 4.1.4. 


Figure 5.2.2 Patterns with two lines of separation from the rear of a 2D body. (a) Smooth surface. (b) One 
sharp edge. (c) Two sharp edges 
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The flow patterns of Figures 5.2.1 and 5.2.2 illustrate the distinction between so-called “streamlined bodies” 
and “bluff bodies,” as shown in Figure 5.2.3. When a body has a sharp or small-radius tail and is shaped so 
as to limit the rear separation to a single line as in Figure 5.2.1b, or a practical approximation to it, it is of- 
ten referred to as “streamlined.” A streamlined body at an unfavorable angle of attack may have a separation 
bubble of significant size and thus fail to achieve “streamline” flow, as in Figure 5.2.2b. When the body closes 
too abruptly to achieve streamline flow under any high-Reynolds-number condition, as in Figure 5.2.2a,c, the 
flow is often referred to as “bluff-body flow.” We'll discuss the pressure drag associated with bluff-body flow in 
Section 6.1.6. 


Figure 5.2.3 Streamlined and bluff bodies in 2D flow. (a) Streamlined. (b) Bluff 


The above discussion raises an issue of terminology that can be confusing. In common parlance, the term 
“separation” is often reserved for situations where the flow separates “prematurely,” before reaching the aft 
closure point of the body, and the flow patterns of Figure 5.2.1, because they have no “separation” in this sense, 
would be referred to as “all attached.” But this use of the terminology ignores the fact that even if the flow 
doesn't separate prematurely, it must leave the body somewhere, and it seems logical to refer to that as “sep- 
aration” as well. Topologically speaking, the only distinction between the simple separation patterns of Figure 
5.2.1 and patterns like Figure 5.2.2a or b with “premature” separation is that the latter tend to divide the body 
surface into multiple regions and require an additional attachment line, as well as a more complicated flow pat- 
tern behind the body. 

In our discussion so far, we've really dealt with attachment and separation only in a topological or geometric 
sense. We've identified attachment/separation streamlines that lie in contact with the body over part of their 
length and are out in the field upstream and downstream of the body. But it is also intuitively natural to want 
to think of attachment and separation in terms of the Lagrangian time histories of fluid parcels that arrive at the 
body, come into contact with it, flow along the surface, and then leave. Here we have to be more careful. When 
attachment is at a stagnation point, as it is in the flow in Figure 5.1.3a, infinitesimal fluid parcels on the attach- 
ment streamline itself spend infinite time in the neighborhood of the stagnation point without actually arriving. 
Furthermore, because of the no-slip condition, fluid parcels on the surface itself can't really be said to be flow- 
ing along, and their “transit time” is also infinite. So to see a parcel arrive at the body and then leave, we must 
follow a parcel that passes at a finite distance to either side of the attachment streamline and doesn't actually 
touch the body, but only comes very close to it. This point will come up again when we discuss explanations of 
lift in Section 7.3.1. 


5.2.2 Attachment and Separation in 3D 


We've seen the simple patterns that attachment and separation can take on in 2D flows. If we look at cross sec- 
tions of 3D flows, projecting the velocity vector into the plane of the cross section, the general 2D flow patterns 
we've already discussed will often appear, but the additional dimension makes possible other patterns as well. 
The velocity field in a cross-sectional cut is a 2D vector field, like a 2D flow, but it doesn't have to satisfy the 
2D continuity equation. The flow has the additional dimension in which to move and in which it can vary, which 
opens up more possibilities for topological structure, even within a single cross section. And a single cross-sec- 
tional view will often fail to convey everything that's going on, so that to really understand a 3D flow we must 
look at the entire 3D field. 

In 2D planar flows, a streamline should be thought of as a stream surface, and a streamtube should be thought 
of as a layer of flow extending to infinity in both directions spanwise. To understand 3D flows, on the other 
hand, we must keep track of discrete streamlines, and we should think of streamtubes as discrete tubes, not lay- 
ers. With this in mind, let's look at flow attachment in 3D. 


While in 2D planar flows an infinite stream surface attaches to the body at a stagnation line, in 3D only dis- 
crete streamlines can typically reach the body surface. Each of these attaches to the body at a singular point of 
attachment, which is most often a nodal point of attachment from which the flow spreads out in all directions 
along the surface. To “attract” an attachment streamline to attach at a nodal point, something in the local body 
shape must act, for flow purposes, like a locally pronounced “nose.” A less common type of attachment point 
is a saddle point of attachment, of which we'll consider an example in our discussion of Figure 5.2.6b. In any 
case, common body shapes typically have only small numbers of attachment points, and often only one. All of 
the flow along the body surface can be thought of as emanating from these singular points of attachment, and 


the flow on any streamline other than an attachment streamline cannot contact the body, though it may come 
very close. 


Although true flow attachment in 3D is nearly always limited to isolated singular points, there 1s often anoth- 
er kind of area on the body surface, generally an elongated band called an attachment line, where flow attach- 
ment also takes place, not in the strict sense, but for all practical purposes. 

As an example of a body with attachment lines, consider the fuselage/swept-wing combination in Figure 
5.2.4. For now, we'll think of the flow as inviscid and illustrate it schematically in terms the streamlines. Only 
one outer-flow streamline attaches to this body, at the blunt nose of the fuselage, in a nearly axisymmetric 
stagnation-point flow. Along the leading edge of each wing is an attachment line, where the flow is lined up 
along the leading edge. To either side of this narrow band of spanwise flow, the flow diverges rapidly to go over 
the upper and lower surfaces of the wing. 





Figure 5.2.4 A fuselage/swept-wing combination with a singular point of attachment at the nose. The “attach- 
ment line” at the wing leading edge has the practical attributes of flow attachment but is not an attachment in 
the strict sense 





An attachment line is generally characterized by strong divergence to both sides, and by flow that approaches 
the surface as if it were headed for a true attachment. But the flow approaching the surface along an attachment 
line of finite length never reaches the surface; though it often approaches it very closely. So an attachment line 
of finite length is an area of “effective attachment,” not true attachment in the strict sense. An attachment line 
can be a location of true attachment only if it is infinitely long, as on the leading edge of the idealized swept 
wing of infinite span, which we'll discuss in Sections 8.6.1 and 8.6.2. 

Let's look more closely at a leading-edge attachment-line flow. If we take a cross section of this flow perpen- 
dicular to the leading edge anywhere along the span of the wing, we see a flow pattern that is qualitatively the 
same as the flow around a 2D airfoil with attachment at a stagnation point near the leading edge, as illustrated in 
Figure 5.2.5. In terms of the velocity component in the plane of the cross section, the boundary layer develop- 
ment looks just like the stagnation-point boundary-layer flow we discussed in Section 4.3.2, and the flow at the 
edge of the boundary layer is being entrained into the boundary layer as if it were headed for a real attachment 
to the surface. 


Figure 5.2.5 Cross-sectional view of flow at the leading edge of a swept wing 





Figure 5.2.6 Further examples of attachment lines on leading edges of isolated wings, seen in plan view. (a) 
Zero leading-edge sweep. (b) Near-zero, slowly varying leading edge sweep 
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But this cross-sectional view doesn't tell the whole story. In a 3D flow, this is not a stagnation point, and there 
is spanwise flow along the leading edge at velocities that depend on the sweep and other details of the wing 
and fuselage shapes. And in the cross-sectional view shown in Figure 5.2.5, the transit time for a fluid parcel 
on the stagnation streamline to reach the surface at the stagnation point is infinite. So given the finite spanwise 
velocity in the 3D case, it would take an infinite distance along the span for a parcel entering the boundary layer 
to reach the surface. Thus if we look close enough to the surface in the boundary layer along the leading edge 
in the finite-span case in Figure 5.2.4, even at the tip, we'll find fluid that arrived there from the outer flow in 
the vicinity of the nodal point of attachment at the nose of the fuselage, not from anywhere along the wing. So 
in an ideal sense, the only flow attachment is at the fuselage nose. Of course, at any station along the span of 
the leading edge, we would have to look extremely close to the surface to find fluid that arrived there from the 
nose of the fuselage. A very short distance farther from the surface we would find fluid that was entrained into 
the boundary layer on the leading edge just a short distance upstream (inboard). 

We used the term “attachment line” frequently in our discussion of 3D boundary-layer flows in Chapter 4, 
and we'll use it some more in connection with swept-wing leading-edge boundary layers in Section 8.6.2. This 
usage is justified because for all practical purposes flow attachment is taking place there, and it is only in the 
ideal sense discussed above that it is not a real flow attachment. And we use the term even though it 1s impre- 
cise in another way: On a 3D wing, there is generally no single streamline that qualifies as uniquely defining 
the attachment line. Defining it as the line above which the flow diverges toward the upper surface and below 


which the flow diverges toward the lower surface doesn't pin it down exactly. On a wing of finite span, there 
will generally be a narrow but finite band of flow that satisfies this condition. So 1t would be more precise to 
call it an “attachment zone” or “attachment band.” 


Let's look at two further examples illustrating attachment in 3D. First, an isolated wing of finite span, with 
zero sweep at the leading edge is shown in Figure 5.2.6a. Like the case of the swept wing in Figure 5.2.4, this 
flow has an attachment line near the leading edge, with a nodal point of attachment where it intersects the sym- 
metry plane. The nodal point on the symmetry plane is the only true attachment point, but the spanwise velocity 
elsewhere on the attachment line is so small that the flow in the neighborhood of a point anywhere on the at- 
tachment line is practically indistinguishable from the flow near a 2D stagnation point of attachment. For all 
practical purposes, any point on the attachment line qualifies as an attachment point like the nodal point at the 
symmetry plane. 

Now consider another isolated wing, but with the sweep of the leading edge varying slightly from zero, as 
in Figure 5.2.6b. In this case, there are two nodal points of attachment outboard with spanwise flow diverging 
very slowly away from them. Between the nodal points, the spanwise flow along the attachment line converges 
toward another type of singular point, a saddle point of attachment. In this situation, we have three singular 
points of attachment, but every other point on the attachment line 1s practically indistinguishable from the sin- 
gular points. 

A further issue regarding attachment lines comes to light if we take any one of the bodies in Figures 5.2.4 and 
5.2.6 and imagine morphing it gradually into a sphere. As the body becomes more sphere-like, the attachment 
point remains (or the multiple attachment points merge into one), and the attachment lines cease to exist as dis- 
tinct entities. But there is no distinct threshold at which an attachment line disappears. We have no definite cri- 
terion for deciding when diverging flow constitutes an attachment line and when it doesn't. We can cite obvious 
qualitative differences between the flow around a sphere and the wing attachment-line flow in Figure 5.2.4: On 
the sphere, the flow divergence is axisymmetrically distributed (independent of the azimuthal angle), while the 
wing leading edge has a pronounced local maximum in the divergence. Another possibility would be to reverse 
the region-of-origin definition of 3D separation that we developed in Section 4.1.4 and define an attachment 
line as a near-discontinuity in the region of destination of the streamlines. Note that at typically very small dis- 
tances to either side of a leading-edge attachment line the flow is headed to the upper and lower surfaces, which 
are widely separated destinations, practically speaking. But this is also a qualitative, not quantitative, criterion. 

Let's summarize what we've learned about the concept of an attachment line. There is often a distinct elong- 
ated band on a body surface where flow attachment is taking place for all practical purposes, and the term “at- 
tachment line” is a useful shorthand for what is happening. But we've seen that the term is imprecise in three 
ways: 

1. An attachment line of finite length is not an attachment in the strict sense, but only in a practical sense, 
2. There is generally no single streamline that defines an attachment line, and 
3. There is no quantitative criterion for deciding when diverging flow constitutes an attachment line. 

We use the term “attachment line” routinely, though the theoretical ground underlying it isn't entirely firm. 

So flow attachment to a body takes place through a combination of singular points of true attachment, and 
lines of near attachment called “attachment lines.” Flow separation from a body is similar in that there are sin- 
gular points of separation and separation lines, which are locations of separation for all practical purposes. But 
the similarity between attachment and separation extends only to these basic topological entities. The flow that 
separates from a body comes from the boundary layer and carries with it vorticity and a deficit in total-pressure, 
things not generally present in flow attachment. As a result, the flow downstream of separation is typically a 
very different animal from the flow upstream of attachment. 


Let's look at some of the basic flow structures that can result from separations in 3D. In Figure 5.2.7a—c, we 
have cross sections of flow patterns that can occur on round-cross section bodies at angle of attack, based on 
observations reported by Han and Patel (1977) and Su, Tao, and Xu (1993). These start as free-vortex sheets 
emanating from open separation lines S; along the flanks of the body. Whether or not the sheets roll up signi- 
ficantly at their free edges depends on the strength of the vorticity in the sheets, which depends on the angle of 
attack, and on how far the sheets have evolved along the length of the body. Once rollup of the primary vortex 
sheets has occurred, secondary separations S2 can appear, as in (b) and (c). Open separations from a smooth 
surface, as in these examples, can be unstable and move about with time, causing unsteady loads on the part of 
the body from which the vortex is shed or on structures downstream that may be impacted by the vortex, such 
as airplane tail surfaces. 


Figure 5.2.7 Flow structures associated with separation in 3D. (a) Free vortex sheets from open separation 
lines S1 along the flanks of a round cross-section body, either well forward on the body or at low angle of at- 
tack, where little or no rollup of the sheet has occurred. (b) Free vortex sheets as in (a), but farther along the 
body or at higher angle of attack, with some rollup of the sheets, and secondary separations S2. (c) Free vortex 
sheets as in (b), but where the primary and secondary sheets join to roll up into a single core. (d) Vortex sheet 
emanating from an open separation at the side and trailing edges of a wing. (e) Vortex sheet enclosing a closed 
separation bubble on the aft upper surface of a wing. (f) Vortex sheet enclosing a closed separation bubble at 
the tail of a body with abrupt closure (From Han and Patel, (1977)) 
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The free vortex sheet shed from the side and trailing edges of a lifting wing generally begins to roll up im- 
mediately, as in Figure 5.2.7d. The further development of the vortex sheet from a lifting wing is a topic we'll 
consider in detail in Section 8.1.2, and we'll look at what happens to the vorticity far downstream in Section 
8.5.5. 

A closed separation is generally a closed curve on the surface, and the vortex sheet emanating from it en- 
closes a region of recirculating flow with low total-pressure and high levels of turbulence and lower-frequency 
unsteadiness. A closed separation line can form as separation moves forward from a wing trailing edge, as in 
Figure 5.2.7e, or can form at the tail of a body with abrupt closure, as in Figure 5.2.7f. The latter is the 3D 
counterpart to the 2D bluff-body separation of Figure 5.2.3b. 


In many cases, as body angle of attack changes, for example, an open separation can change into a closed 
separation, and in other cases the two types of separation coexist. Thus the distinction between streamline be- 
havior and bluff-body behavior isn't as simple in 3D as it was in 2D. 

The flow that first attaches to a body is “clean” irrotational flow arriving from the farfield upstream. The 
flow that separates from the body, on the other hand, forms a vortical, usually turbulent shear layer. The body 
surface upstream of any separation can be thought of as being “wetted” by the outer flow, with nothing but a 
relatively thin attached boundary layer between the surface and effectively inviscid flow. The surface on both 
sides of an open separation line is wetted by outer flow in this way. A closed separation line, on the other hand, 
divides a surface upstream that is wetted by outer flow from a surface downstream that is wetted by the flow in 
the “separated region.” 

The patterns on the surface formed by the skin-friction lines, with their various isolated singular points and 
their “lines” of attachment and separation, obey the topological rules of critical-point theory, which is our next 
topic. 


5.2.3 Streamline Topology on Surfaces and in Cross 
Sections 


Much of our thinking about 3D flows 1s in terms of 2D views of the flowfield or of what goes on at the surface. 
Though we've seen that 2D views are limited in the information they can convey, they play a prominent role 
in our thinking because our graphics capabilities and our imaginations are better at dealing with things in 2D. 
We'll look at graphical visualization issues further in Section 10.7. 

Two of the 2D views that we often take are of the skin-friction distribution on the surface of a body and 
of velocity vectors projected into cuts (imaginary surfaces) out in the flowfield. These are both examples of 
2D vector fields defined on 2D surfaces. For purposes of studying the topology of these fields, we construct 
“streamlines” parallel to the vectors' directions, and we assign flow directionality, usually indicated graphically 
by directional arrows, but we can generally ignore the magnitudes of the vectors. For flowfields that are solu- 
tions to the NS equations, these “streamline fields” (really direction fields) are continuous except at isolated sin- 
gularities, or critical points, at which the direction is undefined because the vector magnitude is zero. Continuity 
means that streamlines don't collide with adjacent streamlines. There are no singular lines of attachment or sep- 
aration unless they happen to line up with a sharp edge of the body shape. Attachment and separation lines on 
smooth surfaces are not singular as in Figure 4.2.11b, but are indistinct as in Figure 4.2. 11a. 

Many interesting things can happen in a streamline field, but topologically speaking, the only things that dis- 
tinguish a given field are the arrangement of its critical points and the character of its behavior on the boundary 
of the domain. The overlying topological structure of a field doesn't tell us everything about it, but it can be 
helpful to our understanding nonetheless. Being able to spot the critical points provides a useful way of think- 
ing about a flow, and analyzing the critical points provides one way of testing the realizability of flow models. 
Knowledge of critical points can also be helpful in analyzing the oil-flow-visualization images often obtained in 
wind-tunnel testing, as we discussed in Section 4.1.2. Some of the most interesting things in a flowfield happen 
near the critical points, but because the surface shear stress is low there (remember that the shear stress 1s zero 
at the critical point itself), the oil tends to pile up and not resolve the details well. Understanding the character- 
istics of the various types of critical points can enable an observer to “fill in” details that are not always clearly 
visible. 

Mathematical topology and critical-point theory (see Kaplan, 1958; Flegg, 1974) define certain rules that 
patterns of critical points must obey. Using these rules to analyze patterns in fluid flows is a relatively recent 


development (Perry and Fairlie, 1974; Hunt et al., 1978). In the rest of this section, we'll look at a variety of 
situations that are of interest in studies of fluid flows, and the rules that apply. Much of the discussion follows 
that in Brune (1983). 

The types of critical points are illustrated in Figure 5.2.8, from Brune. Of all of these types, we will usually 
have to consider only nodes, at which the flow either converges from all directions or diverges in all directions, 
and saddles, at which flow converges in one set of opposing directions and diverges in the other (pattern to 
the lower left of the origin in Figure 5.2.8). Critical lines, or degenerate critical points, tend to appear only in 
2D situations (including axisymmetric). On 3D bodies, critical lines are all but impossible. Even in cases like 
those shown in Figure 5.2.6, there is no true degenerate line even though much of the attachment line is close to 
degenerate. Now let's look at some of the situations frequently encountered in 3D flows and at the topological 
rules that apply. 


Figure 5.2.8 Types of critical points in a 2D direction field, where p and q are parameters that are used to cat- 
egorize the form of the direction field around the critical point and are related to the “velocity” field in a co- 
ordinate system centered on the critical point, according to the diagram and equations at the top of the figure. 
(From Brune, 1983) 
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5.2.3.1 Skin-Friction Lines on a Complete Body Bounded by a Closed 
Surface 


We'll start with the simplest case. In Figure 5.2.9, we have a simple wing-body combination and an isolated 
wing, each with its combination of nodes and saddles shown. These are both simply connected solid bodies, and 
their bounding surfaces are simply-connected closed surfaces, for which the topological rule is 


where )>'N is the number of nodes and )'S is the number of saddles. We can see that the rule is satisfied in both 
cases in Figure 5.2.9. 





Figure 5.2.9 Examples of critical points on simply connected bodies bounded by closed surfaces. (From 


Brune, 1983) 
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Now consider a body that has holes through it. We're not talking here about holes just through the skin, but 
holes that go through the body, like ducts. The skin of the combined outer surface and ducts is still a single 
closed surface, just no longer simply connected. When holes are added, the flow patterns necessarily become 
more complicated, and the topological rule becomes 


59) UN = D9 = 2 = 22 





where g is the topological genus of the body. The genus is defined as the largest number of nonintersecting 
closed curves that can be laid out on the surface without dividing the surface into separate regions. The simple 
way to think of the genus is that it is the number of holes through the body. For example, the simple bodies of 
Figure 5.2.9 have g = 0, so that Equation 5.2.2 reduces to Equation 5.2.1. A torus, or donut, has g = 1. 

According to Equation 5.2.2, adding a hole through the body must add two more saddles than it does nodes. 
This is illustrated in Figure 5.2.10, which shows that the simplest possible effect of adding a duct through a 
fuselage-like body is to add two saddle points, one of attachment and one of separation. Other examples of 
opening holes in a body, such as deploying a slotted flap, which can open multiple holes between the main wing, 
the flap, and the support brackets, often add even more complexity. 


Figure 5.2.10 Simple example of adding a hole through a body and how it must add at least two saddle points. 
(a) Fuselage-like body with no hole, g = 0. (b) Fuselage-like body with duct, g = 1 
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5.2.3.2 Skin-Friction Lines on a Portion of a Body Surface 


With more-complicated body shapes and flowfields, dealing with the entire surface as we did above can be dif- 
ficult, and it is often more convenient to consider only a portion of the surface. The simplest option is to isolate 
a portion of the surface by drawing a closed curve around it in such a way that the curve doesn't pass through 
any critical point of the flow pattern. An example of this would be to take just a portion of the upper surface of 
a wing, as in Figure 5.2.11. Such a portion is not a closed surface, so that before we can apply topological rules, 
we must determine the index of the surface. 


Figure 5.2.11 Isolating a portion of a surface, in this case part of the upper surface of the wing of a simple 


wing-body configuration (g = 0). (From Brune, 1983) 
Closed curve 
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The index of a surface is determined by what goes on around the closed contour at the boundary. It's a little 
like circulation, but instead of being the line integral of the “velocity” vector, it just counts the integral number 
of rotations the “velocity” vector goes through as we take one loop around the contour. This is illustrated in Fig- 
ure 5.2.12. When the boundary 1s circuited in the counterclockwise direction as shown, the index I is the number 
of counterclockwise rotations of the “velocity” vector. Examples of simple flow patterns and their indexes are 


shown in Figure 5.2.12b. 


Figure 5.2.12 Illustrations of the index of a flow pattern on a bounded surface. (From Brune, 1983.) (a) 
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Once we know the index of a surface, based on the flow pattern at its boundary, the topological rule is 

523 2N — BS = I. 

The simplest example illustrating this rule would be to imagine that the surface portion in Figure 5.2.11 is 
covered by unidirectional flow, in which there would be no critical points, and for which we would have I = 0, 
trivially satisfying Equation 5.2.3. 

Let's see how this works for more-complicated flow patterns. In Figure 5.2.13 are schematic sketches of skin- 
friction lines on a swept wing, showing a progression of features that have been observed in actual swept-wing 
flows. In each case, we've drawn the boundary near the leading edge, the tip, the wing-fuselage junction, and 
the sharp trailing edge. 


Figure 5.2.13 Examples of patterns of skin-friction lines on a portion of one surface of a swept wing. The 
boundary of the region has been drawn at the trailing edge, near the junction with a fuselage, the leading edge, 
and the tip. Schematic representation of progression that has been observed to occur with increasing angle of 
attack. (a) With no separation forward of the trailing edge. (b) Closed separation bubble appears along out- 
board trailing edge. (c) Outboard separation jumps forward to the leading edge, and a nodal point surrounded 
by spiral streamlines appears. (d) Pattern that can occur on the main part of a wing with high-lift devices de- 
ployed 
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In Figure 5.2.13a, we have a pattern typical of a low angle of attack, with no separation ahead of the trailing 
edge. This pattern is topologically the same as uniform flow, so it satisfies Equation 5.2.3 trivially. In (b), we've 
increased the angle of attack, and a closed separation line has moved forward from the outboard trailing edge, 
but there are no critical points, as before, and Equation 5.2.3 is still trivially satisfied. 


The separated region in Figure 5.2.13b is bounded by a separation line, drawn in this sketch so as to be con- 
sistent with the region-of-origin definition of 3D separation from Section 4.1.4. This pattern illuminates some 
interesting features of separation on swept wings. Note that there are now two points on the trailing edge where 
the skin-friction lines run parallel to the trailing edge, and the skin friction perpendicular to the trailing edge is 
zero. Between these points, the skin friction perpendicular to the trailing edge is negative (denoted in the figure 
as “reverse | flow at TE”), and the skin-friction lines cross the trailing edge in the reverse-flow direction. But 
also note that this region of reversed perpendicular skin friction at the trailing edge doesn't encompass the whole 
spanwise extent of the closed separation bubble. With a bubble of finite extent as sketched here, some outboard 
portion of the bubble will have positive perpendicular skin friction at the trailing edge. Zero or negative perpen- 
dicular skin friction at the trailing edge is an indicator of separation only over an inboard portion of the bubble. 
At the onset of separation, when the bubble first starts to move forward from the trailing edge, perpendicular 
skin friction would be zero at a single point on the trailing edge and positive everywhere else. Such a point of 
zero perpendicular skin friction would thus be an indicator of incipient separation. We'll discuss this further in 
Section 8.6.2. 

In Figure 5.2.13c, we've increased the angle of attack further, and part of the closed separation line has moved 
up near the leading edge. Along the separation line, we now have a saddle point and a nodal separation with a 
spiral streamline pattern around it, and Equation 5.2.3 1s still satisfied. 

In Figure 5.2.13d, we show the main element of a wing that has part-span high-lift devices deployed, a 
leading-edge slat and a trailing-edge flap. The devices themselves are not shown in the drawing. These are slot- 
ted devices that enhance maximum lift, as we'll discuss in Section 7.4.4. Here the slat covers more of the span 
than the flap, which 1s typically the case. One of the patterns that can develop is shown here, a separation pattern 
with flow reversals at both the front and rear portions of our boundary, for which we have I = — 1. The pattern 
satisfies Equation 5.2.3 with one node and two saddles. 


5.2.3.3 Streamline Patterns Based on Velocities Projected into 2D Cuts of 
the Flowfield 


Now let's look at examples of flow patterns in 2D cuts out in the field. First, we'll take the simplest case, a cut 
with no boundary (a cut that doesn't cut the body, and goes to infinity in two directions). This is not a closed 
surface, and even though it doesn't have a closed outer boundary, we must determine its index in order to estab- 
lish the topological rule. Most external flows are close to uniform in the farfield, and if we follow any closed 
contour far from the body, we'll find I = 0, so that the rule is 


5242N —ES = O. 
The example in Figure 5.2.14, with one node and one saddle, satisfies this rule. 


Figure 5.2.14 Combined node and saddle in an unbounded field. (From Brune, 1983) 





When a cut goes through a cross section of the body, we will still generally have I = 0 for the farfield flow 
behavior, but we can have nodes and saddles on the boundary at the body surface. The nodes and saddles at 
the solid surface are called half-nodes and half-saddles, which we'll designate by N’ and S’. Examples of half- 
saddles can be seen in Figures 5.2.15 and 5.2.16. The number of critical points on the solid surface depends on 
how many separate pieces of the body are cut, and the rule is 


TN 4 1/OeN! — vs _ 1/ore! — 1 — 
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where n is the connectivity of the 2D domain of the cut, which is just the number of separate pieces of the body 
that are cut, plus one. Thus in the common situation in which the body is cut only once, we have n = 2, and the 
RHS of Equation 5.2.5 is — 1. Figure 5.2.15, from Brune (1983), shows the separation bubble that arises in a 
computational fluid dynamics (CFD) solution for the flow around an ellipse and how the whole pattern satisfies 
Equation 5.2.5 for n = 2. The separation points near the tail of the body are half-saddles (S'), and there are two 
recirculating cells (N) within the separation bubble. 





Figure 5.2.15 2D flow pattern from a Navier-Stokes solution for flow past an ellipse. (From Brune, 1983) 
n=e2 
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Figure 5.2.16 Postulated flow patterns in cuts through flows around blunt and sharp cones at angle of attack in 
supersonic flow, based on pitot-tube measurements. From Peake and Tobak, (1980), published by NASA. (a) 
Blunt cone (b) Sharp cone 

Note: For both cones, a half-saddle point S also exists on windward ray. Sum of singular points satisfies 


(Equation 5.2.5) 
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In a 2D flow such as the one in Figure 5.2.15, a node will have closed streamlines, like node (e) in Figure 
5.2.8, and is called a center node. A center node typically exists only in a 2D flow. It is an example of what 1s 
called a transitional flow pattern because it is on the boundary between two other types of patterns, the conver- 
ging and diverging spiral nodes to the left and right of the vertical axis in Figure 5.2.8. It generally takes only a 
small change in the flow conditions, for example, the introduction of a small amount of three-dimensionality, to 
change a transitional flow pattern into one of the neighboring forms. 3D examples in which nodes take on the 
spiral form are shown in Figure 5.2.16, cuts through flows around blunt and sharp cones at angle of attack in 
supersonic flow. These examples also satisfy Equation 5.2.5 with n= 2. 

The streamline patterns of Figure 5.2.16 were drawn based on actual flows, and they satisfy the topological 
rule. But there is a probable physical inconsistency in one of the details of the patterns, as drawn. Figure 5.2.17a 
shows a detail from one side of Figure 5.2.16b, with a circle superimposed to indicate the problem area: a nar- 
row “neck” between two streamlines going in opposite directions. In effect, the secondary separation vortex 
with clockwise rotation has pinched a larger region of counter-clockwise rotation nearly into two parts: the 
primary separation vortex on the upper left, and a smaller region to the right, with the narrow “neck” in between. 
The “neck,” with opposing velocities on opposite sides, must contain a region of very low velocity magnitude. 
Whether or not there is an actual point of zero velocity, the “neck” region is at least very nearly singular, and 
it would be clearer to show it as an actual saddle point, as indicated in the proposed modified pattern in Figure 
5.2.17b. Topological consistency then requires adding a node, which would seem to fit best in the small circu- 
lating region on the right, as shown. This is an example in which physical intuition combined with knowledge 
of the topological rule helped clarify the interpretation of a flow pattern. 


Figure 5.2.17 Detail from Figure 5.2.16 and a proposed modification to make it more physically consistent. (a) 
Circle added to the original pattern indicates “neck”’ flanked by flow in opposite directions. Pattern does not 
violate topological rules, but is physically questionable. The “neck” region is likely to contain a point of zero 
velocity. (b) Modified pattern with an additional saddle in the “neck” region and additional node to satisfy the 
topological rule 
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5.3 Wakes 


In Section 5.2.2, we noted that the flow separating from a body carries vorticity and a deficit in total-pressure, 
and that downstream of the tail of the body this vortical flow becomes the wake. Wakes take on a wide variety 
of forms. The wake behind a lifting body in 3D carries strong streamwise vorticity and 1s called a vortex wake, 
something we'll discuss its own right 1n some detail in Sections 8.1.2 and 8.5.5. Wake flow not associated with 
lift in 3D 1s referred to as a viscous wake, not because the rest of the flow is not viscous, but because the wake 1s 
the part that has felt significant direct effects of viscosity. Vortex and viscous wakes are overlapping categories, 
and there is no rigorous way to assign a given wake to one or the other. 

Wakes are practically always turbulent, and through turbulent diffusion they tend to spread out downstream 
of the body more rapidly than a laminar wake would. A viscous wake without significant streamwise vorticity 
spreads rapidly and becomes very diffuse, and the mean-velocity deficit decays to very low levels, as illustrated 
schematically in Figure 5.3.1. The cross-sectional outlines of wakes of different types evolve in distinctive 
ways. In 2D planar flows, wakes spread perpendicular to the general plane of the body, as shown in Figure 
5.3.2a. In axisymmetric flows, wakes remain round as they spread, as in Figure 5.3.2b. A 3D wake with no 
streamwise vorticity would be expected to lose its distinctive shape and become round as it spreads, as in Figure 
5.3.2c. But most 3D wakes probably have enough streamwise vorticity that they become more distorted, not 
less, as they spread, as in Figure 5.3.2d. 


Figure 5.3.1 Schematic illustration of the spreading of a viscous wake and the associated decay of the mean- 
velocity deficit 


Figure 5.3.2 Schematic illustration of how the cross-sectional shapes of wakes evolve. (a) 2D planar wake. (b) 
Axisymmetric wake. (c) Initially 3D wake with no streamwise vorticity. (d) Initially 3D wake with streamwise 
vorticity and associated distortion 
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Strong streamwise vorticity leads to complex changes in the cross-sectional shape of a wake as it flows 
downstream. The streamwise vorticity shed from the trailing edge of a lifting wing, for example, soon rolls up 
into distinct vortex cores. The vortex motion then inhibits turbulent mixing and slows the further spreading of 


the wake, so that the distinct vortex cores and the large velocity disturbances associated with them persist much 
farther downstream than disturbances do in a purely viscous wake. We'll look at this type of flow in detail in 
Sections 8.1.2 and 8.5.5. 

The form a wake takes in the nearfield depends on whether the flow is of the bluff-body type or the stream- 
lined type. Closed separation in the bluff-body case usually leads to a time-averaged (mean) flow characterized 
by multiple recirculating cells that persist into the near wake, as in Figures 3.3.1 and 5.2.15. At high Reynolds 
numbers, however, the wake flowfield 1s generally very unsteady. A bluff-body wake that is significantly thicker 
than the boundary layers that feed it 1s extremely unstable, which usually leads to periodic shedding of vortices 
that span nearly the whole thickness of the wake. As these large-scale structures form and move downstream, 
instantaneous flowfields look very different from the time average. 

The classic example of this kind of flow is the wake behind a circular cylinder in 2D, in which a train of vor- 
tices of alternating sign (a von Karman vortex street) 1s shed into the wake. The vortex street is very effective 
at mixing, and the wake grows rapidly in thickness within the first few diameters downstream of the cylinder. 
The separation points on the flanks of the cylinder oscillate fore and aft with opposite phases, locked with the 
vortex shedding, but this “nodding” of the separations isn't essential to the shedding. Even if the separations 
were anchored in place, the separated wake would be unstable, and shedding would occur. 

In bluff-body flows, patterns of vortex shedding tend to remain qualitatively similar over wide ranges of 
Reynolds number, at least in the nearfield. We looked at several views of the circular-cylinder flow at Ra = 140 
in Figure 3.3.2, and Figure 5.3.3 shows views of it at Ra = 50 000, from a DES calculation. At both of these 
Reynolds numbers, the separating boundary layers are laminar, but at the higher Reynolds number there is more 
small-scale turbulence superimposed on the shed vortex structures, as can be seen in Figure 5.3.3a, in terms 
of the instantaneous isovorticity surfaces. At the higher Reynolds number the small-scale mixing seems to be 
quickly smearing out the shed-vortex pattern, while at the lower Reynolds number the vortices look like they 
will remain coherent for many cycles. 


Figure 5.3.3 Wake behind a circular cylinder at R = 50000 from a Detached Eddy Simulation (DES) calcula- 
tion. In spite of the appearance of rapid spreading of the unsteady wake, the mean-flow recirculating bubble 1s 
relatively compact. Rapid closure of the mean-flow bubble can be thought of as a result of strong mixing. 
From Travin et al. (2000). Used with permission of Springer Science + Business Media. (a) Unsteady isovorti- 
city surfaces. (b) Mean-flow streamlines 





(a) (b) 


The time-averaged streamline pattern in Figure 5.3.3b provides an interesting contrast. There is no hint of 
the periodic shedding, just a pair of recirculating cells that look qualitatively like the laminar flow in the range 
20 < Ra < 60, in which there is separation, but the flow is below the threshold of unsteadiness. 


The pattern behind a body with true streamline flow is very different from the bluff-body pattern. The mean 
flow can actually leave a sharp trailing edge smoothly and without recirculation, as illustrated in Figure 4.1.2a. 
Still, there is turbulence of the usual boundary-layer variety upstream of the trailing edge. When the two bound- 
ary layers join and leave the trailing edge, the turbulence is no longer constrained by the solid surfaces, and 
the turbulence intensity and eddy viscosity increase. The resulting large shear stresses cause the mean-velocity 
profile to “fill in” rapidly, as shown in Figure 5.3.4, and with the reduction of the velocity defect comes a reduc- 
tion and “necking down” of the displacement thickness, as indicated by the dashed lines. There can be vortex 
shedding in such a wake, but in cases with turbulent boundary layers and no significant separated wake, the 
shedding is typically very weak. 


Figure 5.3.4 The filling in of the mean-velocity profile and the necking down of the displacement thickness in 
the near wake of a body in streamline flow 





Except in a tiny region around the trailing edge itself, the mean velocity field in a 2D planar wake in a 
streamline flow obeys the boundary-layer equations, with the boundary conditions changed to reflect the fact 
that there is no solid surface with a no-slip condition. In viscous-inviscid interaction calculations for airfoils and 
wings, it is common practice to solve the boundary-layer equations for some distance downstream in the wake, 
to account for the effect on the pressure distribution of the necking down of the displacement thickness and to 
provide a better basis for calculating the viscous drag, as we'll see in Section 6.1.4. Because momentum trans- 
port in the wake is almost entirely turbulent, the development of the wake has relatively weak direct dependence 
on Reynolds number. 


5.4 Integrated Forces: Lift and Drag 


One of our main motivations for developing the discipline of aerodynamics is to know how to predict and con- 
trol the integrated forces on bodies, and we resolve the forces in ways dictated by what is important 1n applic- 
ations. For most purposes it is convenient to resolve the component perpendicular to the farfield flow, which 
we usually call /ift even when it isn't oriented vertically upward. Lift is generally important for countering grav- 
ity and providing maneuvering forces for flight vehicles, and as part of the interaction that provides propulsion 
in sailing. The force component parallel to the farfield flow is generally a combination of drag (resistance) 
and thrust (propulsion). We'll look further at this distinction and the difficulties it entails in the introduction to 
Chapter 6. 

We get the total aerodynamic force by integrating the distributed stresses exerted by the fluid over the surface 
of the body. It is generally convenient to resolve the surface stress into a normal component, which in flows that 


obey the NS equations is just the local hydrostatic pressure, and a shear component, which is just the viscous 
shear stress. Then the lift (in the k direction) is given by 


L= | (keTen—pken) dS. 


5.4.1 5 
We'll look at the corresponding integral for the drag in Section 6.1.3. 

In still air, the surface shear stresses are zero, and the normal stress is just the local atmospheric pressure. 
Under the influence of gravity, atmospheric pressure exerts a vertical hydrostatic buoyancy force on any body 
with volume. In airplane applications, this is small but not insignificant, and it is often accounted for in careful 
bookkeeping of airplane weight. But gravity-induced buoyancy is usually ignored in dealing with aerodynamic 
forces, either from theory or from wind-tunnel measurements, which is tantamount to assuming that the contri- 
bution from buoyancy doesn't change between the static condition and a condition with flow. 


In the integrated force on a closed body, it is only the variations in pressure that matter, not the background 
level. Under flow conditions, the pressure variations are typically of order one in terms of the usual coefficients 
normalized by free-stream dynamic pressure, much larger than the shear stresses, which tend to be of order 10>. 
As a result, the lift is dominated by pressure differences, with only a negligible contribution from shear. For 
drag, the relative contributions vary widely, depending on whether the body 1s bluff or streamlined, a distinction 
illustrated in Figure 5.2.3. The drag of a bluff body is dominated by pressure differences. A streamlined body 
also can cause substantial pressure disturbances, but because there are no large separated regions, the pressure 
distribution is close to what it would be in inviscid flow, and the pressure contribution to drag is small enough 
that the shear contribution dominates. But for bodies with significant thickness, even if they are streamlined, 
the pressure contribution to drag is never negligible like the shear contribution to lift. Even for well-designed 
streamlined bodies, the pressure contribution is rarely less than about 10% of the total drag. For airfoils, the 
pressure contribution to drag is generally larger than this, as we'll see in Section 7.4.2. 

The pressure contribution to the force can be clearly interpreted as lift (Equation 5.4.1) or drag ("Equation 
6.1.1) only after it has been integrated over the entire surface of a closed body. If the pressure contribution is 
integrated over only a portion of a body, interpreting the result as “lift” or “drag” is problematic. We'll see the 
reasons for this when we consider the issue in detail as it applies to drag in Section 6.1.3. 

Newton's third law requires that the force exerted by the flow on the body is reciprocated by an equal-and- 
opposite force exerted by the body on the flow. And the equations of motion require that this force must have 
manifestations in the flowfield. Generating lift requires setting some of the fluid in motion, as we'll discuss in 
great detail in Chapters 7 and 8. Drag requires either generating lift in 3D or the presence of the dissipative 
processes associated with viscosity, as we'll see in Chapter 6. 


Chapter 6 


Drag and Propulsion 


It is common to define an aerodynamic retarding force on a body moving through a fluid as drag and a propulsive 
force as thrust. This definition is unambiguous if we apply it to the flight-direction component of the total aero- 
dynamic force on a complete body. But things are not so simple when we try to apply it to separate contributions 
to the total, as we often do. In practice we are often interested in vehicles made up of multiple parts and having 
active propulsion, with drag on some parts and thrust on others. Propulsive effort can be applied in a variety of 
forms: rotating propellers, flapping wings, complicated processes inside engines, and so on. The practical object- 
ive is generally to control the total flight-direction force on the vehicle, for example, to make it zero for steady, 
level flight or to make it a net thrust for climbing or accelerating flight. However, in assessing the aerodynamic 
performance of such a vehicle we have a natural inclination and some practical incentives to assess the drag-pro- 
ducing parts and thrust-producing parts separately. Thus we often divide the total flight-direction force into a drag 
on one portion of the vehicle's surface and a thrust on the other portion. This seems like a reasonable thing to do, 
and we've all seen the ubiquitous elementary force diagrams showing drag and thrust acting simultaneously on 
an airplane. But resolving partial contributions to the total drag or thrust in this way raises two potentially major 
difficulties. First, when a complete configuration is divided into portions, the integrated pressure force on each 
will often reflect spurious contributions that are not rightly viewed as either “thrust” or “drag,” as we'll see in 
Section 6.1.3. And then even if we ignore this first difficulty, dividing the force in this way is ambiguous because 
choosing how to divide up the surface of the vehicle is always arbitrary to some extent. 


There are some obvious tricks that one might think would allow us to determine separate drag and thrust for 
a vehicle designed to operate with active aerodynamic propulsion, but they don't work in general. For example, 
if we simply turned the engine off, wouldn't the total flight-direction force then be the drag? Well, it would be 
“drag” by definition, but its relevance to the situation with propulsion would be questionable. Parts of the propul- 
sion system (e.g., stopped propeller blades) would now have drag that they wouldn't ordinarily have, and the flow 
around the whole configuration would be different. 

Separating the effects of thrust and drag in the flowfield is problematic as well. In some parts of the field, we 
have dissipation, or losses, taking place, and in other parts, we have energy being added to the flow as a result 
of propulsive effort being expended. Dissipative effects tend to contribute to drag, and propulsive effort tends to 
contribute to thrust, but when both kinds of effects are present in the same flow, as they often are, these connec- 
tions are not clean and simple. And if we look at the drag or thrust on just part of a body, the connections are often 
nonexistent. This is especially true of the pressure forces, and in Section 6.1.6 we'll address some misunderstand- 
ings that can arise in this regard. In most of our discussion of the basic physics of drag, we'll limit our attention 
to the passive case, without propulsion. Even then, to avoid errors, we must look beyond simply resolving the 
forces and come to understand drag in terms of the flow mechanisms responsible for it. Then in Section 6.1.10 
we'll take a brief look at the basic physics issues associated with propulsion. 


Drag, being parallel to the direction of a vehicle's motion through the local air mass, requires work to be done 
in the reference frame of the air mass. The energy to do this work can come from combinations of active propuls- 


ive effort, gravity, motions of the atmosphere, and the vehicle's kinetic energy. And of course when work is 
done, it has thermodynamic implications for the flowfield. These work and energy considerations can be tricky 
to deal with because they look different in different reference frames. We'll touch on this issue again in Section 
6.1.3. 


Our understanding of drag has a complicated history. Most early work in theoretical fluid mechanics assumed 
an inviscid fluid. For bodies without vortex shedding or shocks, inviscid theory predicts zero drag, in contra- 
diction to all experiments, a result known as d'Alembert's paradox. Fluid dynamicists knew what the source of 
the problem was, but devising computationally manageable theories that could predict drag was more easily 
said than done. For drag in external flows, analytic solutions to the Navier-Stokes (NS) equations were feasible 
only for simple 2D bodies (cylinders and spheres) in the limit of low Reynolds number. Attempts to treat high- 
Reynolds-number bluff-body drag in an inviscid framework required empirical input and were not very suc- 
cessful. For streamlined bodies at high Reynolds numbers, a correct understanding finally arrived with Prandtl's 
boundary-layer theory in 1904. Accurate calculations of the total viscous drag, including the pressure contribu- 
tion, didn't come until much later. We'll discuss the theoretical issues involved in Section 6.1.6 and look at the 
particular case of airfoils in Section 7.4.2. 


6.1 Basic Physics and Flowfield 
Manifestations of Drag and Thrust 


In this section, we discuss the overall physical process of drag production, looking first at the basic dissipative 
process internal to the fluid and then at its manifestations in various types of flowfields. 

In flows of viscous fluids, when no active propulsive effort of any kind 1s exerted, a body must experience 
a net retarding force we call drag. It has several types of flowfield manifestations, but in general, an overall 
drag force is a thermodynamic necessity. The viscous shear stresses in the flow result in dissipation into heat, 
which, in keeping with the first law of thermodynamics, requires work to be done. To do work on the flow (in 
the reference frame fixed to the undisturbed air mass), the body must exert a net force on the air in the direction 
of the body's motion, and, in accordance with Newton's third law, the body experiences a force in the opposite 
direction. 

First, let's review some background regarding viscosity and turbulence from previous sections. 


6.1.1 Basic Physical Effects of Viscosity 


In Section 3.6, we considered viscosity in physical terms and saw that it produces internal stresses in the fluid 
that alter the flowfield by transferring momentum, dissipating work into heat, and producing and diffusing vor- 
ticity. In our understanding of drag, the dissipation of work into heat is crucial because it is the ultimate result 
of all forms of drag. In Section 3.6, we saw that viscous stresses arise in the fluid only if the rate of deformation 
is nonzero, and in the discussion in connection with Figure 3.6.1 we noted that in a parcel-centered reference 
frame the stresses always respond to the rate of deformation in such a way that the work done on a parcel of 
fluid is positive and is dissipated irreversibly into heat. The no-slip condition at solid surfaces ensures that the 
fluid will undergo shearing deformation in the boundary layer and thus that shear stress (skin friction) will be 
transmitted to the body surface and that dissipation into heat will take place in the field. 


6.1.2 The Role of Turbulence 


In Section 3.7, we considered turbulence of the kind that appears in boundary layers and wakes and saw that 
its effects on the flow are qualitatively similar to those of viscosity. The transport of momentum by turbulent 
motions has, by and large, the same effects on the time-averaged flow as additional viscous stresses would and 
we refer to these apparent additional stresses as turbulent stresses or Reynolds stresses. The work done locally 
against the Reynolds stresses is transferred directly into the kinetic energy of the turbulent motions. This tur- 
bulence production is nearly always positive, like molecular viscous dissipation. Then the turbulent motions 
themselves contain local, unsteady velocity gradients, which produce molecular viscous stresses, which in turn 
dissipate the turbulence kinetic energy directly into heat (turbulence dissipation). Over most parts of a flow, 
production and dissipation are roughly in equilibrium, and it is as if the energy dissipated by turbulent stresses 
were being dissipated directly into heat. Turbulent stresses thus result in dissipation into heat, much as the time- 
averaged viscous stresses do, and their contribution to the production of drag is qualitatively similar. 

Because the basic physical effects of turbulence and molecular viscosity are so similar, in all of the discussion 
that follows, terms such as “shear stress” and “dissipation” can be taken to refer to either the molecular or tur- 
bulent variety. Just remember that in the turbulent case the overall dissipation process involves the two steps of 
turbulence production and turbulence dissipation. 

In Section 3.7, we saw that there are also important differences between turbulent and laminar flows. Outside 
the sublayer in a turbulent boundary layer, or in a wake, the apparent turbulent stresses are much larger than vis- 
cous stresses, and thus the direct contribution of turbulence to viscous drag is typically large. In most practical 
applications, however, turbulent flow over at least part of the surface is actually beneficial because of its greater 
resistance to flow separation, as we saw in Section 4.1.4. 


6.1.3 Direct and Indirect Contributions to the Drag Force 
on the Body 


The total drag of a body moving through a fluid is simply the flight-direction component of the stress imposed 
by the fluid on the body, integrated over the entire wetted surface: 


D= | (ieTen—pien) dS 
6.1.1 =) 


Here we've expressed the stress acting on each local element of the surface in terms of a component parallel to 
the local surface (the shear stress) and the pressure acting inward perpendicular to the surface. Further, we've 
resolved these two components of the stress into their components in the drag direction (opposite to the flight 
direction) by contracting them with the unit normal vector n outward from the body surface and the unit vector i 
in the drag direction. If we integrate the two parts of Equation 6.1.1 separately, the resulting forces are generally 
referred to as the skin-friction drag and the pressure drag. The skin-friction drag is entirely a result of viscous 
effects (viscosity and turbulence) in the boundary layers on the body's surfaces. The pressure drag is a result of 
a more complicated combination of flow mechanisms, including viscous effects, shocks, and the global effects 
of lift (¢nduced drag). Given enough data defining the distribution of stresses on the surface, resolving the drag 
into a skin-friction part and a pressure part is straightforward, because it involves simply resolving a local stress 
vector into components. Dividing the drag into viscous drag, shock drag, and induced drag according to the 
mechanisms responsible isn't so simple. 


Apart from such questions as to how the drag originates, which we'll get into later, the drag integration itself 
raises interesting issues because the two terms inside the integral in Equation 6.1.1 are so different in character. 
The shear-stress term could be described as “wysiwyg” (what you see is what you get) in that when it is eval- 
uated on a local part of the surface it is correctly seen as a local drag contribution that either adds to the total 
skin-friction drag or subtracts from it in a straightforward way, depending on its sign. 
The pressure-drag term is fundamentally different and cannot be properly seen as a local drag contribution. 
There are reasons for this on two levels: 
1. The local pressure-drag term depends on the overall pressure level in the environment, something that 
should have no direct bearing on drag. The integrated pressure drag depends only on the variation in 
pressure over the surface, and not directly on the overall pressure level. A mathematical statement of this 
that's even more general is that adding a constant to the pressure has no effect on the integrated pressure 
drag of a closed body because a constant pressure integrates to zero in Equation 6.1.1. Thus we can re- 
place p in Equation 6.1.1 with p—pref, where pref 1s any constant reference pressure (it doesn't have to be 
the ambient pressure level), and the integrated pressure drag will be independent of the value of pref. The 
local pressure-drag term in its general p—pDref form cannot be properly seen as a local drag contribution 
because it depends on pref, which can be chosen arbitrarily. But couldn't we turn the pressure-drag term 
into a proper local drag contribution if we made the right particular choice of pref (the freestream pressure 
Poo, Say)? Well, no. There is no reason to expect pw to be the right reference, and besides: 


2. Much of the variation in pressure on the surface of a body is a largely inviscid response to the shape of 
the body and has nothing to do with drag. Thus there is no choice of pref that can turn the pressure-drag 
term into a proper local drag contribution. The extreme example of this is a body in inviscid flow with 
no vortex shedding and therefore zero drag (d'Alembert's paradox). No matter what pref we choose, the 
pressure-drag term will generally have nonzero values except at isolated points, but integrated drag for a 
closed body will always be zero. 


So we see that although the shear-stress term can be seen as a local drag contribution, the pressure-drag term 
makes sense as drag only after it's been integrated over the surface of a closed body. 

This nonlocal character of the pressure drag makes life difficult when we want to assign a separate drag or 
thrust value to a portion of a complete body, which is usually of interest only when we are dealing with active 
propulsion. In the introduction to this chapter, we mentioned the idea of dividing the surface of a vehicle into 
parts and assessing drag or thrust separately on each, and we noted that the integrated pressure drag on a portion 
of a body will often reflect “spurious” contributions that aren't properly seen as either drag or thrust. There are 
two kinds of “spurious” pressure-drag forces that can appear on a portion of a body: 

1. The pressure force on an “unopposed” segment of the surface of a body portion that is not closed. By 
“unopposed segment” I mean a forward-facing segment of surface for which there is no corresponding 
aft-facing segment directly behind it (in the flight direction), or an aft-facing surface segment without a 
corresponding forward-facing segment ahead of it (note that a closed body cannot have any such unop- 
posed segment). The integrated pressure drag on an unopposed segment of a surface depends on pref and 
is thus “spurious.” The only way to avoid this “spurious” force is to subdivide the body in such a way 
that no portion has any unopposed segment. An extreme example of how not to do it would be to divide a 
simple body into a forward portion that is completely forward facing and an aft portion that is completely 
aft facing. Each portion would be completely unopposed, and the integrated pressure drag on it would be 
completely “spurious.” 

2. The mutual buoyancy force that 1s generally present between bodies in proximity even if they're closed. 
Think of two spheres flying in formation, one behind the other, in potential flow. The forward one will 
experience a “thrust” force, and the rear one will experience an equal-and-opposite “drag,” so that the 


total drag of the pair is zero (d'Alembert's paradox again). The forces on the individual spheres are parts 
of a passive inviscid interference effect, one that 1s also present in more-general cases with viscosity. It 
makes no sense to call such forces on portions of a body either “drag” or “thrust” because they make no 
net contribution to the total drag or thrust of the complete body, and they involve none of the flow mech- 
anisms generally associated with drag (discussed below) or thrust (discussed in Section 6.1.10). 


So dividing the surface of a body into parts raises questions about the meaning of the pressure “drag” or 
“thrust” on each, even if care has been taken to not to introduce unopposed surface segments. Now for the re- 
mainder of our discussion of drag in this section, we'll avoid this difficulty and assume a closed body that has 
not been subdivided. 

Viscosity's contribution to the skin friction is obviously very direct, while its contribution to the pressure 
drag is more subtle. In 1910, Prandtl referred to this second contribution as the form resistance, attributing it 
to the presence of rotational flow in the portions of the field directly affected by viscosity (see Durand, 1967a, 
vol. I, p. 363). Even for a streamlined body with attached flow all the way to the back, Prandtl recognized that 
the rotational flow in the boundary layer could cause a pressure interaction contributing significantly to the total 
viscous drag. In Chapter 4, we interpreted this in terms of the displacement effect of the boundary layer, which 
can be relatively subtle if the boundary layer remains attached and can be much stronger if the boundary layer 
separates. 

Setting in motion large volumes of air while producing lift in 3D, and dissipating energy in waves (shock 
waves in air or gravity waves in water), can also contribute to the total pressure drag. While all forms of drag ul- 
timately end in viscous dissipation of work into heat, these two contributions are not conventionally considered 
to be part of the viscous drag. They both involve mechanisms by which energy is convected or propagated away 
from the body, and as a result, much of the dissipation associated with them takes place far from the body. Their 
nearfield manifestations are little influenced by viscosity. The involvement of viscosity in induced drag and 
wave drag is therefore even less direct than in the case of Prandtl's form resistance. 

So three types of flow mechanisms, one that involves viscous effects in the nearfield and two that do not, 
contribute to the total pressure drag. But to what extent can we assign specific portions of the pressure drag felt 
by the body to the specific mechanisms in the flowfield? It turns out that we can do so only approximately, and 
only by appealing to theoretical idealizations. There is nothing about the distribution of the forces exerted on the 
surface that will tell us how much of the drag was caused by which flow mechanism. And looking at the flow- 
field doesn't yield a rigorous definition either. Because the different flow mechanisms overlap and interact, their 
effects do not add in a simple linear way to the total pressure drag, and an exact decomposition of the pressure 
drag into component parts is not possible. However, for practical purposes, it is possible to make an approxim- 
ate decomposition, based on idealized theories regarding what goes on in the flowfield. For example, if the flow 
in the neighborhood of a shock is known, the shock's contribution to the drag can be estimated based on the 
Oswatitsch formula, as described in Section 6.1.4.8. Likewise, if the spanwise distribution of lift is known on 
the lifting surfaces, the induced drag can be estimated using Trefftz-plane theory, which is based on an idealized 
model of the flowfield associated with the given spanloading, valid in the limit of high aspect ratio, as we'll 
see in Section 8.3.4. Later in this section, we'll see a couple of different ways that the total viscous drag can 
be estimated, and if we subtract the skin-friction drag from that, we have an estimate of the viscous “part” of 
the pressure drag. We'll look at examples of how these ideas apply to simple 2D flows. In all of this, we must 
keep in mind that decomposing pressure drag into different “components,” according to the flow mechanisms 
responsible, is an idealization. It 1s a useful one, however, and in practice, predictions of drag increments based 
on these idealized models have proved to be reasonably accurate. 


6.1.4 Determining Drag from the Flowfield: Application of 
Conservation Laws 


In this section, we look at the general problem of inferring the drag from information about the flowfield around 
the body. We consider some of the basic integral conservation laws and some relationships involving enstrophy 
(the local rate of dissipation into heat by the viscous stresses) and vorticity, with particular reference to what 
each one can contribute to a determination of the drag. Some of this discussion follows that found in Yates and 
Donaldson (1986). 


6.1.4.1 The Body-Centered Control Volume 


In looking at the local dissipation of work into heat in Section 3.6, we found it instructive to take the point of 
view of a fluid parcel moving through the field. Now, to consider the global problem of finding the force on 
a body, we take the reference frame in which the body 1s stationary and the fluid streams past it. For steady 
motion, the flow is then steady (turbulence has been time averaged), and we can drop the unsteady terms in all 
our conservation laws. To apply the integral conservation laws, we construct a control volume completely en- 
closing the body, as shown in Figure 6.1.1. For quantities that are conserved, such as momentum or total energy 
(thermal + kinetic + pressure work), we need only consider forces and fluxes at the bounding surfaces of the 
control volume, including the body surface. For other quantities, such as entropy, vorticity, or individual thermal 
or kinetic energy components, production or loss within the volume must also be considered. 





Figure 6.1.1 Control volume for application of conservation laws in body-centered reference frame 





6.1.4.2 Conservation of Energy 


If no mass or heat is transferred through the body surface, and the body is mechanically passive with no moving 
parts, so that no work is exchanged between the body and the fluid in our body-centered reference frame, then 
the conservation of total energy reduces to the statement that total enthalpy is conserved at the outer boundary 
of the control volume (flux in equals flux out). In fact, for air, where the Prandtl number is close to unity, the 


total enthalpy is nearly uniform throughout the flowfield. Note also that turbulence kinetic energy plays no sig- 
nificant role in the total energy balance (see Cebeci and Bradshaw, 1984). Thus, because the body does no work 
on the fluid in this reference frame, the conservation of energy, by itself, tells us nothing about the drag. 


6.1.4.3 Conservation of Momentum (General) 


For a control volume enclosing our flying body as shown in Figure 6.1.1, integral momentum conservation 
gives 


p= | (—pien—puVen) dS 


6.1.2 C+T+T x 
Assumptions: 
1. Conservation of momentum. 
2. Steady flow. 
3. No mass flux through body surface. 
4. Viscous stress negligible on outer control surface. 


Thus if we know the velocity, pressure, and density on a control surface enclosing the body at a moderate 
distance, we can, in principle, calculate the drag. The difficulty with this in practice is that the integrals over the 
outer control surface involve small differences between large contributions of opposite sign, both in the pressure 
and momentum flux terms. Whether we are dealing with measured or computed flowfield data, the momentum 
theorem by itself is not usually very useful without further assumptions. 


6.1.4.4 Conservation of Momentum (Trefftz-Plane Representation) 


One way out of this difficulty is to move the outer control surface so far away from the body that a linearized 
flow model is valid except in the vicinity of the viscous wake. When this is done, the velocity and pressure dis- 
turbances due to the body can be represented in terms of elementary singularities located at the body and along 
the wake. The flowfield around a body producing viscous drag generally has an apparent positive net source 
strength associated with it, and for our purposes, this is the only contribution that matters. Over the upstream 
and lateral portions of the outer control surface, the integrated effects of any higher-order singularities such as 
doublets or vortices can be made to vanish by taking the control surface far enough away. The farfield pressure 
and momentum-flux contributions of the source can be expressed analytically in terms of the effective source 
strength, which can be deduced from conservation of mass. The final result is that the drag can be expressed 
as an integral over the downstream face of the control volume (the Trefftz plane of induced-drag theory, which 
we'll cover in Section 8.3.4). The final result is 
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6.1.3 
Or, introducing the definition of total head (the same as the total pressure only in incompressible flow): 
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we have the alternative form 
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Assumptions: 
1. Conservation of momentum. 
2. Steady flow. 
3. No mass flux through body surface. 
4. Viscous stress negligible on outer control surface. 
5. Subsonic flow in the farfield. 
6. Farfield disturbances due only to simple singularities except near viscous wake. 
7. Conservation of mass. 


In that we now have only to integrate over a single downstream plane, these forms represent a distinct 1m- 
provement over Equation 6.1.2. However, for a general, 3D, lifting body that sheds a vortex wake, the required 
integration is still over a very wide area, and in practice, these forms are of little use when we are dealing with 
numerical or experimental flowfield data. For our purposes, another shortcoming of these forms is that they do 
not help to distinguish the viscous drag from other drag components. Even the appearance of the cross-flow 
kinetic energy in the second term of Equation 6.1.5 does not help much. As we'll see in Section 8.3.4, in Trefftz- 
plane theories of induced drag, the cross-flow kinetic-energy integral is assumed to give the induced drag, and 
we might be led to assume that the remainder of Equation 6.1.5 would give the viscous drag. However, Equa- 
tion 6.1.5 is valid only far downstream of the body, where in the real viscous world turbulent dissipation has 
already substantially reduced the cross-flow kinetic energy. In practice, even if the integral could be computed 
accurately, it would lead us to exaggerate the viscous component of the drag. To distinguish viscous drag from 
induced drag, we will need formulas applicable closer to the body, preferably with the integration limited to the 
viscous wake. This will require a different set of simplifying assumptions, as we will see next. But first we look 
at some special situations in which forms derivable from Equation 6.1.3 are useful. 


6.1.4.5 Conservation of Momentum in Two-Dimensional and Axisymmetric 
Flows 


A planar 2D or axisymmetric flow does not have the trailing vortices associated with lift in three dimensions. 
In this case, the pressure term in Equation 6.1.3 dies out much more rapidly with distance downstream of the 
body and can be treated approximately. If we put our downstream boundary sufficiently far from the body, the 
velocity outside the wake can be approximated by the sum of a uniform freestream and a potential-flow source 
and vortex located at the body, and the pressure can be approximated by the corresponding linearized expres- 
sion. The integrated pressure and momentum-flux contributions from the source and vortex can then be shown 
to cancel. All that remains is the second term in Equation 6.1.3, integrated only over the viscous wake as shown 


in Figure 6.1.2. 


Figure 6.1.2 Airfoil profile drag determined by wake integration far downstream (Equation 6.1.7) 
4 ¥ 





We then have the drag in terms of the momentum area of the wake: 


D= | pt(tis, — tt) dS = po un Aa. 


6.1.6 wake 
In planar 2D flow, the momentum area per unit span is the familiar momentum thickness, and the profile drag 
coefficient is given by 
26 
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where 0 is defined by Equation 4.2.11, with the integration including the entire wake. 





Assumptions: 
1. Conservation of momentum 
2. Steady flow 
3. No mass flux through body surface 
4. Viscous stress negligible on outer control surface 
5. Subsonic flow in the farfield 
6. Except in the region within the viscous wake, farfield disturbances can be represented by a source loc- 
ated on or near the body 
7. Conservation of mass 
8. 2D or axisymmetric flow. 
In this situation, it is much easier to distinguish the viscous drag from other components than in the general 


3D case. Under the above assumptions, the only other drag component that can be present is transonic shock 
drag associated with a bubble of supersonic flow in the near field, which we'll look at further in Section 7.4.8. 
If we wish to infer the drag from a wake profile taken closer to the airfoil than Equation 6.1.7 requires, then 
further assumptions are required. So far, we have not had to make any assumptions about the flow in the near- 
field, such as how the flow separates from the body. Now, we must limit our attention to flow that remains at- 
tached all the way to the back and only separates from a sharp trailing edge (in planar 2D flow) or a sharp tail (in 


axisymmetric flow). In the analysis put forward by Squire and Young (1938), the total (sum of upper and lower 
surfaces) momentum and displacement thicknesses of the boundary layers at the trailing edge are assumed to be 
known, and the momentum integral equation for the streamwise development of the wake is integrated approx- 
imately to give the wake momentum thickness 0. far downstream, as illustrated in Figure 6.1.3. 


Figure 6.1.3 Airfoil profile drag determined by wake integration in nearfield using the Squire- Young formula 
(Equation 6.1.8) 





Assuming the drag to be given as a function of 00 by Equation 6.1.7, we now have a formula relating the 
drag to conditions in the wake close to the airfoil. The formula is ordinarily thought of as requiring trailing-edge 
data, but it should be equally valid for wake data taken at any station xcut between the trailing edge and the 
farfield. As originally derived for incompressible flow, the formula is 
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where 9 and H are defined the same way they were for a single boundary layer in Equations 4.2.10—4.2.12, just 
applied by integration across both the upper- and lower-surface boundary layers. 





Assumptions: 
1. Validity of Equation 6.1.7 far downstream 
2. Incompressible flow 
3. Integration of momentum-integral equation for wake, assuming a particular empirical relationship for 
the variation of the shape factor with edge velocity (linear In(H) versus In(ue. For this to be reasonable, 
there should be no reverse flow downstream of Xcut. 


Corresponding formulas have also been derived for compressible flow (Cook, 1971), infinite-span swept 
wings (Young and Booth, 1950), and axisymmetric bodies (Young, 1939). 


6.1.4.6 Formulas Relating Drag to Dissipation (Enstrophy) 


Starting with the general Trefftz-plane drag formula, Equation 6.1.5, assuming incompressible flow, and invok- 
ing an expression for the generation of entropy by viscosity, Yates and Donaldson (1986) derive the following 
formula for the drag: 





Assumptions: 

. Conservation of momentum 

. Steady flow 

. No mass flux through body surface 

. Viscous stress negligible on outer control surface 

. Incompressible flow 

. Farfield disturbances due only to simple singularities except near viscous wake 


. Conservation of mass 
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. Fourier heat conduction law 
9. Conservation of entropy. 

Note that the integrand in the volume integral (the enstrophy) makes a contribution only over the vortical 
part of the flowfield (boundary layer and viscous wake). Note also that all of the terms in the second integral 
(over the Trefftz plane) are of second order in disturbance quantities, so that the integral vanishes if we take 
the Trefftz plane far enough downstream. (For a lifting body with a trailing vortex system, the distance required 
could be very large.) In this limit, the volume integral approaches the total drag asymptotically, and we have 
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1. Same as for Equation 6.1.9, plus 
2. integral includes all significant dissipation. 


In the next section, we apply this formula to two simple flowfields and find that it provides interesting insight 
into where in the flowfields the drag “originates.” 


6.1.4.7 Inferring Drag from Vorticity and Total Pressure at a Single 
Downstream Plane 


A theoretical framework originally developed by Maskell (1972) allows both the induced drag and the viscous 
drag to be inferred from the velocity (vorticity) and total-pressure fields sampled only in the viscous wake at a 
single cross-plane behind the body. The approach was later extended by Brune and Hallstaff (1985) and applied 
to experimental data specially obtained for the purpose. This type of analysis could also be applied to computed 
flowfields, but in that case flowfield data are more easily available, and there would be less incentive to limit 
the input data to a single cross-plane. The physical assumptions involved in using a single cross-plane almost 
certainly limit the configurations for which accurate results can be obtained, but these limits have not been fully 
determined. For example, given data only in a single cross-plane, the x-component of the vorticity is the only 
one determined, and it is assumed that the induced drag can be computed based only on the x-component. This 
assumption is likely to be better in the farfield than in the nearfield, which implies that the data plane should 
be taken far from the body. However, to minimize the effects of dissipation, which tend to reduce the apparent 


induced drag and inflate the viscous drag, the inclination has been to place the data plane close to the configur- 
ation, where the assumptions of the method appear to be quite limiting. In spite of these questions, results for 
configurations with high-aspect-ratio wings have been good. 


6.1.4.8 Shock Drag and the Oswatitsch Formula 


In Section 3.11, we discussed the major physical properties of shocks. Whether a shock is normal to the flow 
or oblique, it doesn't generate any significant shear stresses, but passage of air through a shock 1s a dissipative 
process that reduces total pressure and increases entropy. Thus when a shock appears in the flowfield around a 
body, the dissipation in the shock itself contributes to the drag through the pressure term in Equation 6.1.1. A 
shock can also interact with the boundary layer on the body surface, causing a thickening of the boundary layer 
and an increase in the viscous form drag, an effect we'll consider in more detail in Section 7.4.8 in connection 
with transonic airfoils. Here we'll concentrate on the more direct contribution to drag of the shock dissipation 
itself. 

By invoking conservation of momentum for steady flow through a control volume surrounding a body, 
Oswatitsch (1956) derived the following formula for the drag: 


D-= — || (S—S.)oV ends. 
6.1.11 Vee fw Shock 
Assumptions: 


1. Conservation of momentum. 

2. Steady flow. 

3. No mass or energy flux through body surface. 

4. Viscous stress negligible on outer control surface. 

5. Farfield disturbances accounted for only to first order. 


Tognaccini (2003) derived a higher order version of this formula with terms to second order in (S — So). 
Note that Equation 6.1.11 represents the shock drag only if the shock is the only source of entropy increase in 
the flow. This requirement is not satisfied in real viscous flows because of entropy generation in the boundary 
layers and viscous wake. Still, this equation is often used for estimates of the contribution of the shock to the 
drag. In calculations of this kind, the control volume is taken to surround only the shock itself, so that only 
the entropy increase through the shock itself 1s integrated. This 1s somewhat justified, given that the integrand 
represents the flux of entropy excess, and in upstream and downstream regions where no entropy is being gen- 
erated, the flux is constant. But it ignores the fact that in real flows, the streamtube that passes through the shock 
will eventually be entrained into the boundary layer and/or the wake, where further entropy will be generated. 
Use of the formula in this way ignores nonlinear interactions between the entropy-generation mechanisms in 
the shock and the viscous layers. 

In Section 3.11, we also discussed how a shock can be represented in a computational fluid dynamics (CFD) 
solution in potential flow, in which the flow is treated as isentropic and irrotational. In such representations of 
shocks, there can be no entropy change, so Equation 6.1.11 cannot be used directly. The fictitious momentum 
jump that is generated at a shock in such flow solutions is sometimes used as an estimate of the shock drag, but 
it is significantly in error, as discussed by Steger and Baldwin (1971). The best alternative is to interrogate the 
calculated flow conditions upstream of the shock (the perpendicular Mach number), to use the normal-shock 
relations to calculate what the entropy jump would be, and then to use that as the input to Equation 6.1.11. 


6.1.5 Examples of Flowfield Manifestations of Drag in 
Simple 2D Flows 


Here we apply the principles developed above to a thin flat plate and a 2D airfoil to illustrate the manifestations 
of drag in flowfields with and without pressure gradients. For these two examples, we have carried out CFD 
calculations and plotted several quantities relevant to the drag to show how they vary with distance along the 
flow. The calculations are for a chord Reynolds number of 10° and are based on boundary-layer equations (spe- 
cifically the computer code developed by McLean, (1977)), which should be valid except for some flow details 
near the trailing edges. The quantities plotted are 
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The quantities 20/c and IENs are both valid farfield expressions for the drag, according to Equations 6.1.7 and 
6.1.10, respectively, so both are expected to approach the total drag asymptotically at large distances down- 
stream. 


6.1.5.1 Thin Flat Plate 


For discussing viscous drag in external flows, this is the simplest case. As illustrated in Figure 6.1.4, boundary 
layers form on both sides of the plate, laminar at first, followed by transition to turbulence. At the trailing edge, 
the boundary layers merge to form a wake that continues downstream. Because the plate surface has no pro- 
jection in the x direction, the pressure contribution to the drag in Equation 6.1.1 is zero, and the drag on the 
plate is entirely skin friction. Thus IcF at the trailing edge is equal to the total drag coefficient. Also, because 
we have assumed no pressure disturbance in our calculation, IcF and 2 @/c are equal all along the plate and into 
the wake. So for this very special case, the Trefftz-plane drag Equation 6.1.3 is applicable at any station in the 
flow, not just in the farfield. The enstrophy integral IENs shows that dissipation has accounted for about 92% of 
the drag by the time the trailing edge is reached and that the drag is essentially completely accounted for about 
50% chord downstream of the trailing edge. 


Figure 6.1.4 Computed flow-field quantities for turbulent flat-plate flow, R = 10’. Calculations by P.P. Sulli- 
van using method of McLean (1977) 
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6.1.5.2 Two-Dimensional Airfoil 


The assumed pressure distribution and computed flowfield results for a hypothetical 2D lifting airfoil are shown 
in Figure 6.1.5. 


Figure 6.1.5 Computed flow-field quantities for turbulent flow on a hypothetical lifting airfoil, R = 10’. Cal- 
culations by P.P. Sullivan using method of McLean (1977) 
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The differences between this and the flat plate, resulting from the strong pressure disturbance of the airfoil, 
are immediately apparent. The sum of IcF at the trailing edge is only 70% of the total drag; the remainder is 
the form drag. (Remember that the form drag is a viscous effect, since 2D, incompressible potential flow would 
have zero drag.) 

IENS at the trailing edge accounts for about 90% of the drag, slightly less than the corresponding 92% for the 
flat plate. Again, we need go ealy 50% chord downstream of the trailing edge before dissipation has accounted 
for essentially all of the drag.! 

For a symmetrical, nonlifting airfoil or a slender axisymmetric body, the pressure gradient effects would gen- 
erally be weaker, and we would expect the results to be intermediate between those of the lifting airfoil and the 
flat plate. 


6.1.6 Pressure Drag of Streamlined and Bluff Bodies 


We've discussed three flowfield mechanisms that contribute to pressure drag: 


1. Viscous effects in the nearfield, either the relatively subtle displacement effect of an attached boundary 
layer or the much stronger effect of a separated boundary layer; 


2. The global effects of lift in 3D (induced drag); and 
3. Energy dissipation in waves, either shocks or gravity waves. 


The first mechanism involves immediate dissipation into heat in the viscous layers near the body surface and 
just downstream. The last two are largely “inviscid” in their nearfield manifestations, but they too eventually 
produce dissipation into heat. 


There is a common tendency to think of pressure drag in terms of the local surface pressures on parts of the 
surface and to think that the pressures can be manipulated by design of the surface shapes, in an essentially 
inviscid way, so as to influence the pressure drag. In this way of thinking, there is a simple criterion for as- 
sessing design changes that affect the pressure distribution: Low pressures on forward-facing surfaces and high 
pressures on aft-facing surfaces lead to low drag, and vice versa. This could lead us to think, for example, that 
it would be favorable to place an engine inlet so that it imposes higher-than-ambient pressures on an adjacent 
aft-facing surface. Another example might be to think that it would be favorable to place a blister fairing in an 
area on the surface where the pressure gradient 1s positive (increasing pressure in the flow direction) so that the 
volume of the fairing experiences a favorable “buoyancy” effect. 

But this kind of thinking is mistaken more often than not. A simple counterexample shows why it is unreli- 
able in general. Consider a body in an inviscid potential flow with no shocks and no shedding of vorticity. The 
pressure drag in this case must be zero, the result known as d'Alembert's paradox, as we saw in Section 5.4. The 
erroneous line of thinking says we ought to be able to put a blister fairing on a part of this body where the pres- 
sure gradient 1s positive, and thereby reduce the drag. But because we're starting from zero drag, the drag would 
have to become negative, which is impossible. So in global terms, we see that the addition of a blister fairing in 
this simple case cannot reduce the pressure drag, nor increase it, for that matter. Looking at it locally, in terms 
of the local pressures on the surface, the blister fairing may indeed produce a favorable incremental buoyancy 
effect on its own surface, but the pressures elsewhere on the body must change in response to the addition of 
the fairing so as to cancel that effect and make the total pressure drag zero again. Such pressure changes on the 
rest of a body will often be subtle and difficult to see on a pressure plot, but they can act over large areas of the 
surface and cannot be ignored. 


So in a simple inviscid case with no shocks or vortex shedding, changes to the pressure distribution cannot 
affect the pressure drag. In more general situations in which the pressure drag is nonzero, the same mechan- 
ism can still come into play, in which a change to the pressure distribution on one part of the body is offset 
by changes diffusely distributed over the rest of the body, with the result that there is little or no change to the 
total pressure drag. The key point to remember is that a change to the pressure distribution on the surface can- 
not change the pressure drag unless one or more of the pressure-drag-producing mechanisms in the flowfield 
is affected. So for purposes of understanding how a change in the surface pressures might affect the pressure 
drag, it can be very misleading to think just about the surface pressures themselves. One must consider how the 
change might affect either the viscous dissipation in the field, the lift distribution, or the strength of any shocks. 
It is good to be in the habit of asking “where in the field is the drag being ‘produced,’ and how is this change 
likely to affect that process?” Even if the result of asking that question is uncertainty, that answer is preferable 
to being misled by oversimplified reasoning. 

In Sections 5.2 and 5.3, we discussed the distinction between streamlined bodies with attached flow all the 
way to a reasonably sharp “tail,” and bluff bodies with large regions of separated flow at the back. For stream- 
lined bodies, skin-friction drag dominates. In the 2D lifting-airfoil example above, we saw that skin-friction 
drag accounted for roughly 70% of the total, with pressure drag due to boundary-layer displacement accounting 
for roughly 30%, a breakdown that can vary depending on the thickness ratio, the lift, and the detailed pres- 


sure distribution. Pressure drag tends to be a smaller fraction in axisymmetric flow than in 2D planar flow. The 
pressure-drag fraction for a commercial-airplane fuselage, for example, is typically less than 10%. Because the 
skin friction is such a dominant part of the drag for a streamlined body, for purposes of drag estimation it 1s 
often convenient to relate the total drag to an idealized skin-friction drag through a form factor, which we'll 
discuss in Sections 6.2.1 and 7.4.2. 

As we saw in Section 5.2, there is a range of possible nonstreamline flow patterns in which separation occurs 
other than at a sharp tail, either because an otherwise streamlined body is at an angle of attack or because the 
body shape isn't amenable to streamline flow. Separations of the 3D open type (Sections 4.1.4 and 5.2.2) don't 
by themselves form recirculating “deadwater’” wakes, but they do result in the shedding of vortex sheets that 
tend to roll up into streamwise vortex cores. Examples of this type of separation are seen on the highly upswept 
aft fuselages of military transport airplanes and along the sides of the sloping roofs of some hatchback cars. 
The pressure drag that results from such separations can be quite high and 1s similar to the lift-induced drag of 
wings, which we'll discuss in detail in Sections 8.1 and 8.3. 

Classic bluff-body flow is characterized by separation of the closed type that produces a substantial recircu- 
lating wake. For a body with separation of this type, skin friction tends to be only a small fraction of the total 
drag. Pressure drag dominates because of the strong displacement effect of the large separated region, and the 
frontal area of the body becomes a more relevant reference quantity than the wetted area or the skin friction. 
The pressure drag of a bluff body depends on the thickness of the separated wake, which depends on both the 
cross-sectional area of the body and the location of separation. When separation takes place at sharp edges in 
the pattern we discussed in connection with Figure 5.2.2c, the separation location is independent of Reynolds 
number and surface condition, and for all but very low Reynolds numbers, the drag coefficient will tend to be 
nearly independent of Reynolds number. 

On bodies with smooth contours such that separation is not fixed by sharp edges, the flow pattern and the 
drag vary more strongly with Reynolds number and can be influenced by surface roughness. For example, Fig- 
ure 6.1.6a shows the drag coefficient of a smooth sphere over a wide range of Reynolds number. At very low 
Reynolds numbers, it follows the Stokes-flow solution (creeping flow, in which inertial effects are negligible) 


‘ . | 
for which Ca ~ Re . Above Rq = 1, inertial effects become significant, and the drag diverges upward from 
the Stokes solution. At Ra ~ 50 there is a small separated region behind the sphere, and at Ra ~ 130 the separ- 
ated region is longer and becomes unsteady. The variation of Cq with Rq is surprisingly smooth in spite of these 
changes in flow pattern. 


Figure 6.1.6 Drag behavior of a smooth sphere over a wide range of Reynolds number in incompressible flow. 
(a) Drag versus R curve defined by data from numerous experiments plotted by Hoerner (1965). (b) Surface 
pressure distributions for high Reynolds numbers with laminar and turbulent separation, plotted versus angle 
from the front stagnation point 
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At higher Reynolds numbers (up to about 4 x 10° ), the boundary layer remains laminar to the separation 
point. Laminar separation occurs well forward, only about 80° from the front stagnation point, leading to a large 
separated wake with considerable Ee gra and a roughly oe drag coefficient of about 0.5 over the 
range of Reynolds number from 1 x 10° to 4 x 10°. Above 4 x 10° , the boundary layer transitions to turbulent 
upstream of separation, the separation point moves back to about 140°, and the drag coefficient drops abruptly 
to about 0.1. Figure 6.1.6b shows the surface pressure distributions for these cases with laminar and turbulent 
separation, compared with potential-flow theory. In both cases, separation reduces the level of the suction peak 
ahead of separation and produces a leveling off of the pressure downstream of separation, but the effects are 
much more dramatic in the laminar case, due to the earlier separation: Not only is the separated region wider, 
the pressure downstream of separation is lower. 


For bodies that exhibit this kind of behavior, the Reynolds number at which the separation switches from 
laminar to turbulent is often called the critical Reynolds number, and the abrupt drop in the drag coefficient 1s 
called the drag crisis. A golf ball is one such body, and its dimpled surface is a purposeful adaptation to the 
Reynolds-number range in which it flies. Over the range of speed achievable by even the strongest golfer, the 
Reynolds number is below the critical value for a smooth sphere. The purpose of the dimpled surface is to pro- 
voke early transition and thereby move the drag coefficient toward a lower post-crisis level. 


6.1.7 Questionable Drag Categories: Parasite Drag, Base 
Drag, and Slot Drag 


The term “parasite drag” is often used to refer to the entire portion of an airplane's drag other than induced drag, 
and has even found its way into the dictionary (Websters, 1976). “Parasite drag” thus has a workable defini- 
tion, as long as we're willing to have it depend on a theoretical value for the induced drag. (Recall from Section 
6.1.3 that induced drag doesn't have a rigorous definition apart from the idealized theory.) But there is a semant- 
ic problem in that the term imputes a particular character to the drag that isn't logically justifiable. In general, 
“parasite” refers to something that is supported by something else and gives nothing in return (paraphrased from 
Websters, 1976). The viscous drag of the wings and fuselage of an airplane is a necessary byproduct of produ- 
cing lift, and housing structure and payload. These functions can't be said to “give nothing in return,” so calling 
their byproduct a “parasite” makes no sense. 

If a bluff body or 2D airfoil-like shape is “squared off’ in back so that separation takes place at two corners 
as illustrated in Figure 5.2.2c, the “additional” drag due to the squared-off base is often called “base drag.” The 
problem with the term “base drag,” however, is that it has no defensible definition. Here are the three logically 
possible candidates I know of, along with the reasons why they don't work: 

1. Define base drag as the portion of the actual drag that is somehow caused by the presence of the blunt 
base. 


Problem: While it's true that a body with a squared-off base tends to have higher drag than one that 
doesn't, there is no good way to isolate the portion of the drag that can be “blamed” on the base. Whether 
we assess the drag in terms of the stresses on the body surface (Equation 6.1.1) or the dissipation in the 
field (Equation 6.1.9), there is no rigorous way to divide either the surface or the volume to decompose 
the drag. 

2. Define base drag relative to the drag of a comparable sharp-tailed body. 

Problem: There is no unique way to define the comparable body. 


3. Define base drag as the deficit in the static pressure on the base (which does tend to be nearly constant) 
relative to ambient pressure, times the area of the base. 


Problem: There is no reason why ambient pressure is the relevant reference, as we noted in Section 
6.1.3. In Section 6.1.6, we discussed how misleading it can be to try to infer differences in drag from local 
static pressures, and this situation is no exception. That this definition doesn't work is evident from two 
counterexamples. We'll see in Section 7.4.1 that a streamlined airfoil typically has a region of positive 
Cp (pressure higher than ambient) near the trailing edge. We could truncate the airfoil within that region 
and have positive Cp on the base, and the current definition would then assess negative base drag, which 
makes no sense. The example of a sphere is also instructive. A sphere doesn't have a squared-off back 
end, but it does have a large separated region in the back. Note that in the case of turbulent separation in 
Figure 6.1.6b, the surface Cp in the separated region is positive. But it certainly doesn't make sense to say 
that a sphere has negative base drag. 


So it seems that defining base drag 1n a rigorous quantitative way is hopeless. And without a workable defin- 
ition, “base drag” seems questionable, even just as a concept. 

Hoerner (1965) in his section III.8 uses the term “base drag” but doesn't provide an explicit definition. He 
presents experimental values of “base drag” in his Figures 37 and 39 but doesn't specify whether they were de- 
termined by comparison with sharp-tailed baseline bodies (candidate definition 2 above) or by measurements 
of base pressure (candidate definition 3 above). In his Figure 38, he plots total drag for a series of bodies that 
includes a sharp-tailed baseline, which suggests that he is thinking in terms of candidate definition 2. Whatever 
the definition, he proposes that “base drag” is determined by a “jet-pump mechanism,” which is supposed to 
work as follows (paraphrased): 


The “jet” is the outer flow, “placed like a tube around the space behind the base,” and it “mixes with the 
dead air and tries to pump it away,” thus reducing the base pressure. The separated boundary layer has an 
“insulating” effect that reduces the effectiveness of the jet pump. The longer the forebody, the thicker will 
be the boundary layer that separates from the edge of the base and the less effective the jet-pump mechan- 
ism will be in reducing the base pressure. The “base drag” thus correlates with a parameter related to the 
fineness ratio of the forebody and decreases with increasing fineness ratio. 


The data presented by Hoerner support the correlation reasonably well, but I propose that it 1s not for the 
reason that he supposes. The problem with Hoerner's explanation 1s that it ignores the dominant role of the outer 
inviscid flow in determining the pressures on the body. The separated shear layer that forms the outer boundary 
of the near wake can support very little pressure difference across it, so the base pressure must be essentially the 
same as the static pressure in the inviscid flow outside the boundary layer where it separates from the edge of 
the base. This static pressure 1s a result of the external inviscid flow, as influenced by the effective displacement 
shape of the forebody and the separated wake downstream. The thickness of the boundary layer at separation 
influences the effective shape of the wake to some extent, but the static pressure at separation, and thus on the 
base, is much more strongly driven by the overall shape and fineness ratio of the body, through the inviscid 
flow. So the correlation of the “base drag” with forebody fineness ratio is much more of an inviscid-flow effect 
than a result of any “jet-pump mechanism.” The data appear to be just as consistent with this explanation as 
with Hoerner's. 

We've seen why base drag is a questionable concept. In a similar vein, airfoils with high-lift slots, which 
we'll discuss in Section 7.4.4, tend to have higher drag than airfoils without slots, and the excess is sometimes 
referred to as “slot drag.” This terminology has the same problem: There is no good way to isolate the portion 
of the drag that can be “blamed” on the slot. The slot doesn't represent a mechanism that can produce pressure 
drag that is separate from the viscous drag. In a 2D inviscid, shock-free flow, a slotted airfoil has zero drag, just 
like any other airfoil. In the real world, a slot imposes strong pressure gradients and curvature effects on the 
boundary layers, as well as separation and reattachment if the slot has a “cove” (see Section 7.4.4, especially 
Figure 7.4.18). These are things that tend to increase the total viscous dissipation in the field, but they do not 
make slot flows in any way unique, and they do not define a lower limit on the amount of “additional” drag a 
slot must cause. 


6.1.8 Effects of Distributed Surface Roughness on 
Turbulent Skin Friction 


In Section 4.7, we briefly listed the various ways surface roughness can affect an aerodynamic flow. In this sec- 
tion, we look at how surface roughness can increase turbulent skin friction, which can influence total drag even 
when there is no change in transition or separation. We'll look at basic physical explanations of how the skin- 


friction increase comes about, the associated scaling issues, and empirical data for different types of surfaces. 
Then, in Section 6.2.2, we'll look at methods for using this kind of information in drag estimation. 

In turbulent flow, surface roughness elements that are sufficiently large are known to cause significant 1n- 
creases in area-averaged skin friction. It seems intuitively obvious that they should, but explaining how this 
happens is trickier than one might expect. The roughness elements are like small bluff bodies immersed in 
the highly nonuniform and unsteady flow at the bottom of the boundary layer, and they are often so closely 
spaced that the flow around each element is strongly affected by neighboring elements. The physical explan- 
ations offered in most of the classical sources tend to be incomplete at best and sometimes contradictory (see 
Bradshaw, 2000). But before we address explanations, let's look at how these skin-friction increases have actu- 
ally been observed to behave, first in the idealized situation of uniform roughness along the entire length of a 
flow and then locally in more-general flow situations. 

The effects of two different types of roughness on the average skin friction coefficient C ! of a turbulent 
flat-plate boundary layer are plotted in Figure 6.1.7. The solid roughness curves are for standard sand-grain 
roughness, which consists of a single layer of densely packed grains of uniform diameter ks and was the focus 
of many early investigations of roughness effects both in pipes and boundary layers. Curves are shown for three 
different grain sizes relative the length of the plate. These are idealized curves based on the pipe-flow measure- 


ments of Nikuradse (1933), see Figure 20.21 in Schlichting (1979), for example), converted to flat-plate C f 
by the “equivalent-flat-plate method” that we'll discuss in Section 6.2.2. Starting at a low Reynolds number, 
each curve follows the smooth-surface curve (nominally the same as the turbulent curve in Figure 4.3.1) up to 


a critical Reynolds number and then diverges from it and eventually levels off. Before C ! levels off, it dips 
to a minimum about 5% below the final value. This general progression has led to the definition of three re- 
gimes: a hydraulically smooth regime before the divergence, a fully rough regime in which C ! is independent 
of Reynolds number, and a transitionally rough regime in between. As pointed out by Bradshaw (2000), it is 
unfortunate that this “transition” terminology came to be applied to roughness in turbulent flow, as it risks con- 
fusion with laminar-to-turbulent transition, which is unrelated. But it is frequently used in the literature, and for 
lack of anything better, we'll continue to use it. 


Figure 6.1.7 Average turbulent skin friction coefficient for a flat plate of length Z in a smooth condition and 
uniformly covered with roughness of two different types: standard sand-grain roughness (densely packed sand 
grains of uniform diameter k), shown by the solid curves, and roughness typical of commercially rough pipes, 
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Standard sand-grain roughness 1s a highly artificial kind of surface, and surfaces more representative of those 
usually found in real-world applications don't generally show the same kind of behavior in the transitional re- 
gime. A well-known example of a real-world surface 1s documented in the measurements by Colebrook (1939) 
in commercially rough pipes, which served as part of the basis for the famous Moody chart for resistance in 
pipe flow (Moody, 1944). The dashed curves in Figure 6.1.7 are flat-plate roughness curves derived in the same 
way as the sand-grain curves (solid), but based on Colebrook's pipe data instead of Nikuradse's. In Colebrook's 
curve-fits, the drag of a rough surface approaches the smooth-surface curve only asymptotically with decreasing 
Reynolds number, and there is no critical Reynolds number below which the effect of roughness is taken to be 
zero, so that there is no hydraulically smooth regime, strictly speaking. However, there 1s a fully rough 1 regime 


in which C } is independent of Reynolds number, just as there is for standard sand-grain roughness. Cy in the 
transitional regime does not dip below the fully-rough value, as it did in the standard-sand-grain case. 

For a real-world surface, there is generally no unique physical scale length corresponding to the grain dia- 
meter that we used for uniform sand grains. As a substitute for defining a length scale based on roughness geo- 
metry, Schlichting proposed the concept of equivalent sand-grain roughness (see Bradshaw, 2000), in which the 
surface is characterized by the roughness height ks of the standard sand-grain roughness that would produce the 
same skin-friction coefficient in the fully-rough regime. Obviously, knowing the equivalent sand-grain-rough- 
ness height for a surface provides a reliable indication of its drag behavior only in the fully rough regime, be- 
cause different kinds of surfaces have different behaviors in the transitional regime. 

The relationship between the equivalent sand-grain height ks and the actual physical height h of the rough- 
ness varies greatly, depending on the nature of the surface. Even if the height of the roughness elements is uni- 
form, the drag depends strongly on the shape of the elements and on the density of the surface coverage. Waigh 
and Kind (1998) correlated data for uniform roughness elements of various shapes (e.g., spheres, cylinders, rect- 


angular blocks) and with a wide range of coverage densities and found that the ratio ks/h varied from 0.1 to 
almost 30 depending on shape and area coverage. 

When the height of the roughness elements is not uniform, there is no single height h to characterize the 
surface, so an average measure such as Rag (the average of the absolute value of the deviation from the mean sur- 
face) or krms (the rms deviation) is often used, usually measured in practice by a stylus profilometer. We should 
expect the ratio ks/Ra or Ks/Krms to vary depending on the nature of the surface, but not over as wide a range 
as ks/h does for roughness of uniform height. Much of the variation in the ks/h values correlated by Waigh and 
Kind was due to the wide range of surface coverage. When coverage is sparse, the drag penalty is reduced, but 
so are Ra and krms, so that using Ra or krms instead of h partly compensates for this effect. But the ratio ks/Ra or 
ks/Krms still varies depending on the nature of the surface. 

Basic roughness geometry is another factor that must be kept in mind when using Ra or krms. These average 
measures are typically considerably smaller than the maximum element size (diameter). This is because max- 
imum deviations are, of course, larger than average deviations, and diameter is larger by another factor of 2 
because it is a peak-to-peak measure rather than a deviation from the mean. A careless error that sometimes 
arises 1s to forget this and to take Ra or krms to be interchangeable with ks. This can lead to a serious underestim- 
ation of the roughness effect, because Ra and krms are typically almost an order of magnitude smaller than ks. A 
simple geometric argument shows that this is true even for standard sand-grain roughness. If we ran a stylus pro- 
filometer over a standard sand-grain surface, the stylus would rarely penetrate into the crevices between grains 
to more than about a half grain diameter below the peaks, because of the relatively blunt shape of the stylus, and 
the resulting Ra or krms would be a small fraction of the grain diameter. 

In Figure 6.1.7, we looked at the global effect of roughness in the idealized case of a flat plate with roughness 
that is uniform over the whole surface. In flow situations that are more general, with roughness distributions 
that may be nonuniform, we must deal with the effects of roughness locally. Unless the characteristics of the 
roughness change too suddenly as we move along the surface, the local effect is closely determined by the rela- 
tionship between the roughness geometry and the mean velocity profile in the wall region of the boundary layer, 
which we discussed in Section 4.4.2. As we saw there, for a smooth surface, this wall-region velocity profile 
is very close to being a function only of the viscosity and the shear stress at the surface. The velocity profile 
expressed in wall variables is then universal, and the functional relationship ul = fen(y") is known as the Jaw 
of the wall, which applies to the viscous sublayer and the logarithmic region, as illustrated in Figure 4.4.6. The 
effect of roughness on this local velocity profile is to shift the logarithmic portion of the velocity profile down- 
ward by a constant Au’ as shown in Figure 6.1.8. 


Figure 6.1.8 Shift, due to roughness, in the logarithmic portion of a turbulent wall-region velocity profile 
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The size of the shift depends on the roughness Reynolds number k", with a functional relationship Au'(k’) 
that depends on the particular roughness geometry. The logarithmic velocity profile is then given by: 


uT = Ing? + B— Aut (k™), 

6.1.16 K 
where « and B are the same as they were for a smooth surface. Roughness thus adds one parameter, k", and one 
function, Au'(k’), to the problem. 

The functional relationship Au'(k’) is a fixed characteristic of the particular roughness geometry, and the 
value of Au" ina particular situation is the dominant factor determining the local increase in skin friction coef- 
ficient, ACf. In general, the larger the shift Au’, the larger AC¢ will be. But because Cf is generally normalized 
by a velocity outside the wall region (Ue or Uo usually), ACf also depends to some extent on the global flow 
conditions. To estimate the rough-surface Cf in a given flow situation, the first thing we need to know is the 
function Au’ (k’) for the surface in question. Then we need to use that information to estimate ACf. The function 
Au’(k') can be incorporated into a turbulence model for use in detailed CFD calculations, or the effect of the 
roughness can be estimated for a flat-plate flow that is assumed to be similar to the target application. In Section 
6.2.2, we'll discuss both of these approaches in more detail. 





The shift functions Au'(k’) for three types of roughness are shown in Figure 6.1.9a, including standard- 
sand-grain and Colebrook's commercially rough pipe surfaces, whose global flat-plate behavior as a function 
of Reynolds number was shown in Figure 6.1.7. What these shifts mean in terms of AC¢/Cf as a function of k* 
depends on the global flow situation. AC¢/C¢ for the particular case of a flat-plate boundary layer at a length 
Reynolds number of 107, is shown in Figure 6.1.9b. Looking first at Au"(k’) for the standard sand-grain rough- 
ness, we see that the divergence from smooth-surface behavior takes place at about k" = 5. The fully rough 
regime starts at about k" = 70, after which Au'(k’) increases in a straight line. The length scales & for the other 
roughness types are the equivalent sand-grain roughness heights, defined so that the curves match the standard- 
sand-grain curve in the fully rough regime. 


Figure 6.1.9 Local behavior of three types of distributed roughness. (a) Log-law shifts. (b) Relative change in 
local Cr 
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The question of whether the drag penalty for standard-sand-grain roughness is actually zero up to k" = 5 has 
not been settled. The experimental data on which the curve-fits are based show considerable scatter. Bradshaw 
(2000) has pointed out that a curve-fit in which there is no finite hydraulically smooth regime, for example, Au" 
= 0.02(k*)* for k" <7, would fit the data just as well as the curve in Figure 6.1.9a does. The C¢ increase at kr 
= 5 might be a couple of percentage instead of zero, but is still small compared with about 20% for commercial 


pipes. 


The Au” and ACf curves of Figure 6.1.9 make it even more clear what a wide range of behavior different 
types of surfaces can display in the transitionally rough regime and why the concept of equivalent sand-grain 
roughness doesn't apply there. To show that the standard-sand-grain and commercially rough pipe curves don't 
bracket the range of possible behavior, I included Hama's (1954) curve for glass beads with a Gaussian size 
distribution, which 1s seen to lie far outside the range defined by the other two types. The nonuniform element 
size might lead one to expect behavior intermediate between uniform sand and commercial pipe roughness, but 
instead this surface shows behavior that is roughly twice as far from commercial pipes as the standard-sand- 
grain surfaces are, and Hama's curve-fit defines an even wider hydraulically smooth regime. I have seen no 
convincing explanation for this in the literature. The message we should take from Figure 6.1.9 is one of caution 
regarding roughness in the transitional regime. In the neighborhood of k” = 10, the Cr increase can be anywhere 
from near 0 to about 30%. This high level of uncertainty is unfortunate, as the transitional regime is the range 
of the greatest practical importance in aeronautical applications. There is much less uncertainty in the fully 
rough regime, but roughness in that range is very rarely tolerated in applications because Cf is nearly double its 
smooth-surface value at the start of the regime and goes up from there. 

Now that we've seen what the behavior looks like, how do we explain it physically? The specifics, such as 
the details of the different roughness curves in the transitional regime, seem to be beyond the reach of simple 
qualitative explanations. But we should at least be able to explain the general trend: Whether or not there is a 
finite hydraulically smooth regime in which the effect is zero, the drag increment eventually “takes off’ and 
increases monotonically for larger roughness Reynolds numbers. Explanations for this that have appeared in the 
classical sources are often fuzzy and in some cases obviously incorrect. 

In one category of explanation, a major distinction is drawn between the sublayer and the rest of the boundary 
layer outside it. It is assumed to be obvious that roughness elements that are submerged in the sublayer should 
have little or no effect and that the farther the elements protrude outside the sublayer, the more drag they should 
cause. But a detailed argument as to why this should be so is not offered. Some versions are clearly under the 
influence of an erroneous interpretation of a “laminar sublayer,” while in others it is ambiguous as to how the 
sublayer is being interpreted. Here is a sampling: 


¢ Hoerner (1965): “there is always a comparatively thin sublayer within which stable laminar flow is pre- 
vailing. As long as the protuberances of a rough surface are deeply enough submerged within the sublay- 
er, the surface is, therefore, hydrodynamically equal to a smooth one.” 

¢ Schlichting (1979): “It is clear that roughness will cause no increase in resistance in cases where the 
protrusions are so small (or the boundary layer is so thick) that they are all contained within the laminar 
sublayer.” 

¢ White (1991): “even a small roughness will break up the viscous sublayer and greatly increase the wall 
friction.” 


Hoerner seems to have been misled by the terminology into thinking the flow in the sublayer is actually 
“laminar” and presumably steady (“stable’’). We saw in Figure 2.1f that the flow in the sublayer is far from lam- 
inar, and we've noted that the only thing “laminar” about the sublayer is that the Reynolds stress is effectively 
zero due to the effect of the wall in constraining the turbulent motions that are large enough not to be damped 
much by viscosity (see Section 4.4.2 and Equation 4.4.2). Schlichting doesn't explicitly refer to “laminar flow” 
in the sublayer, but it is implied that there is something different about the sublayer. White uses the more mod- 
ern “viscous” to designate the sublayer, but he doesn't explain what he means by “break up.” If the sublayer 
isn't laminar, what is there about it that is prone to being broken up? In any case, whether these explanations are 
based on correct interpretations of the sublayer or not, they don't provide much of a physical explanation as to 
why the skin friction increases when the roughness elements protrude outside the sublayer. 


Another category of explanation hinges on the type of flow around the roughness elements and its dependen- 
ce on the local Reynolds number. The general line of argument is that the skin friction begins to increase when 
the flow around individual roughness elements leaves the very low Reynolds number regime and begins to dis- 
play a separated or unsteady wake. Smith and Kaups (1968) make an argument of this type. First, they draw an 
analogy between the roughness elements and isolated bluff bodies in free air. They show that in the part of the 
boundary layer where the velocity profile is linear, the local roughness Reynolds number is related to kr by Rx 
= kux/v = (k*), where uk = u at y =k. Thus the critical roughness height k' =5 corresponds to Rk = 25, “at 
which isolated bluff bodies begin to develop a substantial and perhaps unstable wake.” They also point out that 
25 is the minimum Rx “that first affects transition in laminar flows,” though this presumably has no bearing on 
the turbulent-skin-friction issue. So the onset of the shedding of unsteady wakes from the roughness elements 
seems to correspond to the onset of the skin-friction increase in a turbulent boundary layer. But then Smith and 
Kaups acknowledge that the analogy with bluff bodies in free air doesn't seem to apply to the behavior of the 
drag curves. In the range of Reynolds number where unsteady wake shedding begins, the drag coefficient of a 
sphere, for example, does nothing remarkable, and 1n fact, it is decreasing smoothly with Reynolds number, not 
increasing (see Figure 6.1.6). In view of this, Smith and Kaups conclude 


Hence the drag increment of roughness does not relate directly to the drag of bluff bodies at low Re in a 
free stream. Instead the drag begins to be significant when a substantial wake region develops or when this 
wake region becomes unstable. 


They leave it at that and don't explain how the onset of unsteady wake shedding is supposed to produce a 
change in the drag curve for a rough-wall boundary layer, while it doesn't for isolated bluff bodies. 

In addition to the fact that 1t doesn't really explain the skin-friction increase, this argument has an incon- 
sistency that Smith and Kaups overlooked. The argument for the correspondence between k" =5 and Rg = 25 
makes sense in the context of an isolated roughness element exposed to a linear velocity profile that has not 
been disturbed by nearby roughness elements. Yet the k" =5 threshold is derived from the skin-friction curve 
for standard-sand-grain roughness, in which the grains are tightly packed together on the surface. For this type 
of surface, most of each grain is shielded by adjacent grains, and the average deviation of the surface contour 
from the mean is much less than the grain diameter k. Thus the effective Reynolds number of the protuberances 
is much less than 25 at the k" = 5 threshold where the drag curve deviates from that of a smooth surface. In view 
of this, it seems likely that there is no unsteady wake shedding involved in the onset of the skin-friction increase 
for standard-sand-grain roughness. 

So we see that the two main explanations that have been offered don't explain the skin-friction increase 1n a 
satisfying way. Is there a better way to explain it? I think there is. But we should expect to be able to explain 
it only in loose, general terms. After all, we're dealing with complicated three-dimensional roughness shapes 
immersed in the complex turbulent flow at the bottom of a boundary layer. 

I propose that the general trend of increasing drag with increasing k" is related to the height of the roughness 
elements relative to the thickness of the viscous sublayer and that it has nothing to do with changes in the wakes 
of the roughness elements. Two key features of the sublayer play roles: the shape of the mean velocity profile, 
and the distribution of Reynolds shear stress. The explanation is conveniently divided in two parts. To summar- 
ize before we dive into the details: 

1. When roughness elements are submerged in the sublayer, where the mean-velocity profile is approxim- 
ately linear, their effects needn't increase the area-averaged skin friction, because the boundary layer can 
“adjust” to their presence with nothing more than a shift in its effective origin. This should be true regard- 
less of what flow regime the roughness elements find themselves in, for example, whether the wakes are 
separated or not, or unsteady or not. A shift in effective origin 1s consistent with the equations of motion, 


as we'll find below, and it can prevent an increase in skin friction by keeping the average velocity of the 
flow around the roughness elements low. 


2. When the roughness height is increased, and the elements are no longer sufficiently submerged 1n the 
sublayer, a shift in the effective origin can no longer prevent an increase in skin friction. Taller roughness 
elements would require a larger shift in effective origin, and a thicker layer of low-velocity fluid would 
have to be maintained to prevent an increase in skin friction. But turbulent mixing prevents the main- 
tenance of sufficiently low velocities over a thicker region. Recall from our discussion of Equation 4.4.2 
that effective turbulent mixing (as embodied in the Reynolds stress) is damped by viscosity near the wall, 
but that the damping 1s able to keep the Reynolds stress near zero only over a thin region (the sublayer). 
So when roughness elements reach a certain size, sufficiently low velocity cannot be maintained across 
their whole depth, and turbulent mixing will raise the average velocity around them, increasing the av- 
erage skin friction. This explanation is supported by the success of the “discrete element” approach to 
predicting roughness drag (see Taylor, Coleman, and Hodge, 1984). In this approach, the drag of rough- 
ness elements is assumed to be affected by the turbulence physics (the local Reynolds stress) in just the 
way I've proposed here, through the shape of the mean-velocity profile. 


The idea of a shift in effective origin for small roughness is supported by a physical argument that starts 
with an idealized Couette flow. Consider the parallel flow between two flat plates, the lower one stationary and 
the upper one moving to the right as illustrated in Figure 6.1.10. For the smooth-surface case in part (a), this is 
the same flow situation we considered in our discussion of the basic effects of viscosity in Section 3.6 (Figure 
3.6.2a). The shear stress is constant across the gap between the plates, and we'll assume the flow to be laminar, 
so that the velocity profile will be linear, and the Reynolds stress will be zero, as in the sublayer. In part (b), we 
have added roughness to the lower plate, and now the flow near the rough surface is nonuniform on the scale of 
the roughness elements. Some short distance above the tops of the elements, however, the flow becomes prac- 
tically uniform in both horizontal directions, and we can adjust the speed of the upper plate so that the velocity 
gradient Ou/Oy in the uniform part of the flow is the same as it was in the smooth-surface case in part (a). The 
shear stress across the part of the gap not occupied by roughness elements is now the same as in the smooth- 
surface case, and the shear forces per unit area on the two plates are the same as well. So if the velocity gradient 
a short distance above the tops of the roughness elements is the same as it was in the smooth case, the skin 
friction will also be the same, and there will be no drag penalty due to roughness. The only difference will be a 
shift in the apparent origin of the mean velocity profile, defined by extrapolation of the velocity profile to zero, 
as indicated by the dashed line in part (b). And note that we didn't have to assume anything about the type of 
flow that surrounds the roughness elements, other than that it didn't result in transition to turbulence across the 
entire gap in this idealized laminar example. 


Figure 6.1.10 Laminar Couette flow like that of Figure 3.6.2a, illustrating the probable effect of surface rough- 
ness 1n a flow with a linear mean-velocity profile. (a) Smooth wall. (b) Rough wall with the same velocity 
gradient far above the roughness elements as in (a), but with a shift in the apparent origin of the velocity pro- 
file 
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Now consider the same comparison between a smooth surface and a rough surface, but in a boundary layer, 
which can be either laminar or turbulent. The situation is illustrated figuratively in Figure 6.1.11, with the 
smooth case shown in part (a). In the rough case in part (b), we assume that the roughness elements are well 
submerged in the region where the velocity profile is nearly linear, and where the Reynolds stress is negligible 
(the viscous sublayer in the turbulent case). Then we know from the Couette-flow argument above that the re- 
lationship between the skin friction and the velocity gradient at some distance above the tops of the roughness 
elements is unchanged by the roughness. And then if the velocity profile above the tops of the roughness is 
a shifted version of the profile in the smooth case, the skin friction in the two cases will be the same. In the 
Couette-flow case, we could force the velocity profile to be a shifted version of the smooth wall profile by ad- 
justing the speed of the upper wall. But why should such a simple shift apply in the case of a boundary layer? 
For a plausible answer, consider the equations of motion. 





Figure 6.1.11 Boundary-layer velocity profiles illustrating the probable effect of roughness elements that are 
submerged within the region where the profile 1s linear. (a) Smooth wall. (b) Rough wall case in which the ve- 
locity profile well above the tops of the roughness elements is the same as it was 1n the smooth-wall case, with 
a small shift in the apparent origin. The size of the roughness elements is exaggerated for clarity 
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The displacement thickness of the boundary layer would be affected by such a shift, and the interaction with 
the outer flow could therefore change the pressure distribution imposed on the boundary layer, but in most ex- 
ternal flow situations this will be a very small effect. (In a pipe or duct flow, the effect wouldn't be so small 
because such flows are sensitive to small changes in the effective area of the duct.) What 1s more important 
for our purposes is that the integrand in the definition of the momentum thickness (Equation 4.2.11) vanishes 
when the velocity goes to zero, so that the shift in origin would not affect the momentum thickness to a signi- 


ficant degree. Thus a simple shift of the velocity profile, with no change in either the skin friction or the mo- 
mentum thickness, is consistent with the rough-wall flow satisfying the same integrated momentum balance as 
the smooth-wall flow, as expressed by Equation 4.2.10. Furthermore, in the region well above the tops of the 
roughness, the shifted velocity profile of Figure 6.1.11b should satisfy the differential equations of motion in the 
same way the smooth-wall flowfield of Figure 6.1.lla does, provided the roughness doesn't directly affect the 
Reynolds stress. The conclusion from this arm-waving line of argument is that roughness elements sufficiently 
submerged within the linear part of the velocity profile should cause no significant increase in skin friction, and 
only a small shift in the apparent origin of the boundary layer. 

But when roughness elements are not sufficiently submerged in the viscous sublayer, a shift in origin can no 
longer prevent an increase in skin friction, as we argued in item (2) above. We also noted that this explanation 
is supported by the “discrete element” approach to predicting roughness drag in Taylor, Coleman, and Hodge 
(1984). In this approach, the flow domain that has roughness elements protruding into it 1s divided into thin 
slices parallel to the nominal base of the rough surface, as illustrated in Figure 6.1.12. In each slice, the flow is 
treated as 2D in plan view, and the effects of the roughness elements protruding through the slice are accoun- 
ted for in terms of flow quantities averaged over the planform area of the slice. Based on the geometry of the 
protrusions, an average element diameter and the average flow area blocked by protrusions are calculated. The 
average flow velocity in the slice and the average mass flow are related through a reduced cross-sectional area 
that accounts for this blockage. In each slice, the average drag of the roughness elements is calculated from 
the local average dynamic pressure and a drag coefficient that depends on the Reynolds number based on the 
local average velocity and the average element “diameter” at that value of y. Nothing magic 1s assumed regard- 
ing how this drag coefficient varies with Reynolds number. The turbulent shear stress exerted across interfaces 
between slices is determined by a conventional mixing-length model for smooth-wall turbulent boundary lay- 
ers, which is not modified to account for roughness. 





Figure 6.1.12 In the “discrete-element” approach to predicting roughness drag by Taylor, Coleman, and Hodge 
(1984), the flow domain into which roughness elements protrude is divided into thin slices parallel to the nom- 
inal base of the rough surface 
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When all is said and done, the theory predicts that Cf increases when roughness elements protrude outside 
the sublayer, and agreement with experiments for various rough surfaces is fairly good. The “complicated” drag 
curve of a given rough surface is thus found to be predicted fairly well by simple local physics and geometry, 
and detailed bookkeeping of the effects of the nonlinear mean-velocity profile. 

So the success of the “discrete element’ approach seems to support the proposal that a shift in the effective 
origin of the boundary layer prevents a significant increase in skin friction when roughness is sufficiently small, 
but not when roughness is too large to be sufficiently submerged in the sublayer. And the fact that predicting 
even the gross features of the Cy-versus-k* curves requires detailed bookkeeping calculations offers an excuse 
as to why our “simple” qualitative argument can explain only the existence of a smooth regime and the sign of 
the trend above that, but not the shapes of the curves. 


6.1.9 Interference Drag 


Aerodynamic interference between aircraft components can alter the total drag compared with what would be 
estimated for the components in isolation. Drag increases are much more common than drag reductions (“favor- 
able interference’). Whatever the sign, such increments are often referred to as interference drag. This termin- 
ology incurs the same problem we encountered above in our discussion of “base drag” and “slot drag’: there 
is no rigorous way to define the baseline for comparison. Separate airplane components, by themselves, don't 
generally form closed bodies that can be realistically “flown” in isolation for a baseline drag evaluation. And 
even if they could, the flow around them in isolation wouldn't be representative of a reasonable baseline condi- 
tion, because the components were designed to fly in conjunction. Thus interference drag is not a “component” 
of drag with a physically rigorous definition. But it does make sense in the context of drag estimation, where 
we can accept a looser definition of the baseline in terms of “what would be estimated” (usually by handbook 
methods) for the drag contributions of the separate airplane components, before any accounting for interference. 

Taking this looser definition, interference drag can comprise contributions from induced drag, shock drag, 
and viscous drag. Changes to induced drag and shock drag are the only interference mechanisms that look ef- 
fectively “inviscid” in the near field. The mutual inviscid buoyancy effect that occurs when two bodies with 
volume are “flown” in proximity is not properly considered interference drag, as we concluded in Section 6.1.3. 
In a shock-free inviscid flow, such effects would always add up to zero for the total configuration. Again, as we 
saw above in connection with pressure drag, the total drag can change only if one or more of the drag-producing 
mechanisms in the field 1s affected. Now let's look specifically at the viscous contribution to interference drag. 

For a classical aircraft configured according to the Cayley ideal (see Kiichemann, 1978) with relatively high- 
aspect-ratio wings and distinct fuselage and tail surfaces, much of the viscous drag is generated in boundary 
layer and wake flows that are generally similar to the simple flows discussed in Section 6.1.5. Over large por- 
tions of the fuselage surface, the boundary layer behaves qualitatively like that on an axisymmetric body. Even 
if the wings are swept, the flow over the airfoil sections is nearly 2D in the sense that flow quantities change 
relatively slowly in the spanwise direction. (With sweep, the wing boundary layer is 3D in the sense that sub- 
stantial cross-flow generally develops, but 2D in the sense of slow spanwise change. We discussed this type of 
behavior in theoretical terms in Section 4.3.5, and we'll look at the behavior on wings in more detail in Section 
8.6.2.) The generation of viscous drag in these flows follows the qualitative pattern of the airfoil flow in Figure 
6.1.5, in that pressure drag is a relatively small fraction of the viscous drag, more of the dissipation takes place 
in the boundary layer than in the wake, and the dissipation in the wake is essentially complete within a relatively 
short distance. 

Aerodynamic interference between aircraft components generally increases the viscous drag over what the 
components would experience if the simple flow patterns described above prevailed. The viscous part of this in- 
terference results mainly from strong three-dimensionality and local 3D separations that occur at tips and junc- 
tions. Figure 4.5.la schematically illustrates the “necklace vortex” type of separation that 1s common in wing- 
root junctions. Even a junction with a strakelet or fillet that does not produce a distinct separation, as shown 
in Figure 4.5.1c, still induces strong three-dimensionality in the fuselage boundary layer, which probably still 
produces a drag increase. 

The drag penalties associated with such viscous interference effects are difficult to isolate and assess. Three- 
dimensional NS calculations can reproduce such flows as necklace vortices at least qualitatively, but the fidelity 
of turbulence models is not good enough that we should expect accurate quantitative assessments of the asso- 
ciated drag penalties. Assessment of these effects therefore often relies on testing or on empirical correlations, 
which are necessarily rather crude. The most extensive collection of correlations is in Hoerner (1965). 


Another source of viscous drag 1s surface excrescences and roughness. The total drag associated with such 
items is not limited to the forces on the items themselves, but is usually increased by the effects of aerodynamic 


interference with the rest of the configuration. The flowfields around these items produce changes in the skin- 
friction on the surrounding surface that can be either positive or negative, but the net change in the pressure 
drag of the rest of the configuration is generally positive. The resulting “magnification” effect on the drag due 
to an item is usually larger than what would be expected just because the item might be immersed in a part of 
the flowfield where the local velocity is greater than that of the freestream. 

One theoretical model for these effects is that of Nash and Bradshaw (1967), and the example from their 
paper is shown in Figure 6.1.13. The theory is 2D and is based on some very crude simplifying assumptions. In 
the general 3D case, estimates of the magnification effect based on this 2D theory should be regarded as very 
uncertain. 





Figure 6.1.13 Example from Nash and Bradshaw (1967) of an excrescence on an airfoil, showing how the 
excrescence-drag magnification factor m can differ from a simple dynamic-pressure correction. 
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Another model for this kind of effect, really more of a rule of thumb, is based on the idea of work done, 
and is even more crude than Nash-Bradshaw, but is not restricted to 2D. If the local flow field surrounding the 
excrescence has an identifiable local “freestream” velocity different from the global freestream, the rate of work 
done against the part of the total drag that we can blame on the excrescence should be close to the local drag 
force on the excrescence (in the local flow ty times the local flow velocity, not the freestream velocity. 
This suggests a magnification factor of (qlocal/qoo) for the total drag. The difference between the force on the 
excrescence (this time resolved in the global drag direction) and the magnified drag increment would presum- 
ably be mostly pressure drag on the larger body, due to the source effect of the viscous wake of the excrescence. 
An idealized example that supports this rule is shown in Figure 13 of Drela (2009). 

We've looked at two crude models for a magnification effect on the drag of an excrescence, Nash-Bradshaw 
and a local-q! model. Both of these models suggest that the total drag increment does not depend on the dir- 
ection of the local flow in which the excrescence is immersed, only the local velocity magnitude. For example, 
an excrescence located on an airfoil leading edge where the local flow direction is vertical should produce the 
same total drag increment as if the same excrescence were placed somewhere else on the airfoil where the local 
velocity magnitude is the same. Thus even if the local drag force on the excrescence is perpendicular to the 
global drag direction, the excrescence will still cause a drag penalty. 


6.1.10 Some Basic Physics of Propulsion 


As we noted in the introduction to this chapter, the purpose of propulsion is generally to control the total flight- 
direction force on a vehicle, for example, to make it zero for steady, level flight or to make it a net propulsive 
force for climbing or accelerating flight. Thus although propulsion systems often affect other components of the 
aerodynamic force to some extent, their effect on the flight-direction component is usually the largest one and 
the one we're most interested in. In this section, we'll touch on only the most basic physics of propulsion and its 
effects in the external flowfield. A more detailed discussion of the different types of engines and their internal 
flows 1s beyond the scope of this book. 

The total aerodynamic force in the flight-direction affects the momentum balance in the flowfield in the same 
way whether propulsion is present or not, and the analysis that we applied to drag at the beginning of Section 
6.1.4 also applies to situations with propulsion, with one caveat that is usually minor. The exhaust gasses of 
most propulsion systems include some mass of fuel that was not part of the flow from upstream, but this is usu- 
ally small compared to the mass of the flow affected by the propulsor. Thus Equations 6.1.2 and 6.1.3 apply to 
situations with propulsion, with only a small error due to the neglect of fuel mass addition. When we applied the 
theory to pure drag in Section 6.1.4, we saw that drag implies a wake with a net deficit in momentum flux and/ 
or pressure. In cases with propulsion, we'll see a combination of viscous wake and propulsion plume that will 
have a combined net deficit or excess in momentum-flux and/or pressure, depending on how much propulsion 
is being applied. For example, when the total flight-direction force is zero, as required for steady level flight, 
the net deficit would be zero. 

Whether the wake and propulsion plume are separately distinguishable depends on the how close to the 
vehicle we look and on the degree to which the propulsion system is integrated with the rest of the configur- 
ation. And there is a spectrum of possibilities for the degree of propulsion integration, with the two extremes 
illustrated in Figure 6.1.14. At one end of the spectrum is the Cayley concept of a classical aircraft (see Ki- 
chemann, 1978), which assumes that the propulsion system is distinct from the rest of the configuration. The 
aircraft wake and the propulsion plume are distinct in the near field, and this distinctness provides one basis 
for determining separate drag and thrust. At the other end of the spectrum we have the Ackeret ideal, in which 
propulsion is integrated with the configuration to such an extent that the propulsion system energizes only the 
air that has experienced losses in the boundary layer, and no viscous wake or propulsion plume remains to pro- 
duce further dissipation. Assigning separate thrust and drag in this case is problematic. The Ackeret ideal makes 
no provision for making up the induced losses associated with lift in 3D, which we'll discuss in Section 8.3. 


Figure 6.1.14 The Cayley and Ackeret ideals of propulsion 
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On the spectrum between these two ideals, real aircraft fall much closer to the Cayley than to the Ackeret. 
However, there is always some interference between the propulsion system and the rest of the aircraft, such as 
the interference effects of propeller slipstreams, the complicated mixture of propulsion effects and drag effects 
on the surfaces of bodies or nacelles near jet exhausts (the “boat-tail drag” associated with narrowing of a body 
ahead of a jet nozzle, for example), and the interaction of propulsion inlets with the rest of the flowfield. 

Much of the theory of propulsion deals with propulsors as isolated thrust producers, as they would be in the 
Cayley ideal. When an isolated propulsor produces thrust, according to Equation 6.1.3 its exhaust plume must 
have a net excess of momentum flux and/or pressure. In a simple propulsion plume without circumferential 
velocities, the pressure far downstream must decay to freestream pressure, as we assumed in our analyses of 
simple viscous drag in Section 6.1.4. In this case, only an excess of velocity remains, as in the Cayley “propul- 
sion plume” sketched in Figure 6.1.14. When a propulsion plume retains significant circumferential velocities, 
as in the case of a single-rotation propeller, the plume will have a pressure deficit consistent with centrifugal 
effects, and for a given thrust this must be accompanied by a correspondingly larger excess in streamwise velo- 
city. Note that this is not the same issue as the energy loss due to “swirl” that we'll discuss later in connection 
with induced efficiency. 

To produce a plume with an excess in momentum flux, the propulsor must add energy to the flow through 
some combination of mechanical work and heat addition. The possible combinations range from work only (a 
propeller with no exhaust gas added to the flow) to heat only (a ram jet). The most practical modern propulsors 
are open propellers and turbofans, both of which use rotating airfoil-shaped blades to transfer mechanical work 
to and from the flow. 


If the situation is nominally steady in the long term and we ignore or time-average fluctuations associated 
with the passage of propulsor blades, we can distinguish a propulsion streamtube in which the flow is acted on 
directly by the propulsor. For a propeller, it 1s the streamtube that passes through the disc swept by the propeller 
blades, and for a ducted engine such as a turbojet or turbofan, it is the streamtube that enters the inlet. Defining 
a propulsion streamtube identifies the part of the field that is most directly affected by the propulsor, and in 


which the most important changes 1n total-pressure and total-temperature take place. When work 1s done on the 
fluid in the propulsion streamtube, it increases both the total-pressure and the total-temperature of the stream. 
Heat addition further increases the total-temperature but reduces total-pressure (see Shapiro, 1953, Section 7.2). 
Unsteady fluctuations (due to blade passage) that extend outside the propulsion streamtube can affect total-pres- 
sure, but these effects are usually small and are ignored in the simpler theories. Downstream of the propulsor, 
there is generally turbulent mixing between the propulsion streamtube and the external flow, redistributing total- 
pressure and total temperature, but these effects are also usually neglected in the theories. 

To delve into the physics issues further, let's consider the open propeller, the simplest propulsor in terms of 
basic physics, with no duct and no heat or mass addition to deal with. 

The simplest and oldest theories for propellers ignore the details of the propeller itself and use control- 
volume analysis to predict average velocities in the propulsion streamtube and to make rough estimates of pro- 
peller performance trends, such as how efficiency varies with the thrust loading coefficient (defined shortly 
in Equation 6.1.18). The flowfield assumed in these sorts of theories is illustrated in Figure 6.1.15, with the 
propulsion streamtube running from left to right. The flow is assumed to be axisymmetric, steady, inviscid, and 
incompressible. The discrete effects of passing propeller blades are ignored, and nothing is assumed about the 
shape of the propeller required to produce the integrated effects. The theories can be thought of as representing 
either the time average of a real flow, or the flow in the limit as the number of blades goes to infinity and their 
chord goes to zero. They incur significant error by neglecting discrete blade effects, and of course they cannot 
predict differences due to different numbers of blades. 


Figure 6.1.15 Illustrations of the idealized propulsion streamtube and propeller actuator disc assumed in 
actuator-disc theories for propellers. The contraction of the propulsion streamtube and the excess axial velocity 
downstream are indicated approximately to their correct scales for the indicated thrust loadings. (a) Moderate 
thrust loading, t = 0.2, typical of cruising flight. (b) High thrust loading, t = 1.2, typical of climbing flight 
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The propeller itself is represented by an idealized actuator disc across which the pressure increases as a result 
of the work done by the propeller, and the tangential velocity can change to account for circumferential velocity 
(swirl) in the downstream flow. No change in axial velocity occurs across the disc, as required by mass conser- 
vation in incompressible flow. The pressure jump across the disc is assumed uniform for simplicity. The axial 
velocity just upstream and downstream of the disc is also assumed uniform, though we'll see later that this isn't 
realistic. Because the jumps in axial velocity and axial momentum flux are zero, momentum conservation re- 
quires that the integrated pressure jump equal the thrust. The assumption of axisymmetric flow means that there 
is no mechanism for pressure forces to produce changes in circumferential velocity and thus dictates that cir- 


cumferential velocities must be zero upstream of the actuator disc. The integrated flux of moment of momentum 
must therefore be zero upstream of the disc. Changes in circumferential velocity and in the flux of moment of 
momentum can take place only as jumps across the disc. The integrated flux of moment of momentum down- 
stream of the disc must equal the torque applied by the propeller, and continuing downstream it must remain 
constant. 

For purposes of physical understanding, actuator-disc theory is useful because it indicates the general time- 
averaged shape that the propulsion streamtube must have, to satisfy both mass and momentum conservation. 
For the idealized propeller flowfield we're considering, the axial momentum balance requires axial velocity 
higher than freestream at the propeller disc and even higher still in the propulsion streamtube downstream. The 
propulsion streamtube must therefore contract as illustrated in Figure 6.1.15. The theory indicates that half the 
change in axial velocity occurs upstream of the disc and half downstream, with the streamtube area decreasing 
inversely with the velocity. The degree of streamtube contraction and the level of excess velocity downstream 
both increase with increasing thrust loading and are shown approximately to scale in Figure 6.1.15. Part (a) 
corresponds to a moderate thrust loading typical of a single-engine light plane cruising, and part (b) to a high 
loading typical of climbing. 

The velocity field we've assumed requires the bounding surface of the propulsion streamtube downstream of 
the actuator disc to be a vortex sheet across which there is a jump in the velocity vector. Associated with the 
jump in longitudinal velocity is a circumferential vorticity component, and associated with the jump in circum- 
ferential velocity is a longitudinal vorticity component. The vortex lines in the bounding sheet are thus helical, 
and the vorticity in the sheet is uniformly distributed circumferentially. The real distribution of vorticity pro- 
duced by the propeller blades is of course complicated and unsteady. 

An early version of actuator-disc theory that doesn't account for swirl is credited to Rankine in 1865 and was 
motivated more by nautical than aeronautical applications. Later elaborations on the theory account crudely for 
swirl and for the integrated effects of the viscous drag of the propeller blades, as described by von Mises (1959). 

Discussions in the literature (including that in von Mises, 1959) generally fail to mention a problem with 
simple actuator-disc models, and it has to do with the axial velocity where the bounding vortex sheet abuts the 
edge of the actuator disc. We've assumed no jump in axial velocity across the disc, but the jump in total-pressure 
across the disc will generally dictate a jump in axial velocity across the vortex sheet that abuts the edge of the 
disc. Spalart (2003) has proposed that a spiral singularity in the shape of the vortex sheet at the edge of the disc 
might solve the problem. This is not a settled issue. 

Moving beyond the idealized actuator-disc models, a detailed discussion of the physics must include the ef- 
fects of the discrete, rotating blades. As the propeller moves forward and rotates about its axis, its blade tips 
follow helical paths as illustrated in Figure 6.1.16. The ratio between the distance traveled in one revolution 
and the diameter is called the advance ratio, J = V/ndp, where V is the forward velocity through the air mass, 
n is the rotations per unit time, and dp is the propeller diameter. Blade sections inboard of the tips move along 
helices with the same pitch but smaller radii, thus moving at steeper angles relative to the circumferential direc- 
tion. In the reference frame of a blade section, the flow approaches the section from a direction determined by a 
combination of the section's helical motion and the disturbance velocities produced by the propeller. Preferably, 
the blades are twisted to account for all of this, so as to keep each section at a favorable angle of attack to the 
local relative flow and to produce a favorable distribution of loading along the blade. We'll see later how one 
theoretical estimate of the “optimum” load distribution is determined. 


Figure 6.1.16 A propeller moving axially through the air mass at velocity V, rotating at m revolutions per unit 
time. The propeller tips trace helical paths as shown. The ratio between the distance traveled in one revolution 
and the diameter is known as the advance ratio, J = V/ndp 
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An intuitive starting point for understanding how a propeller works is to think of it as a screw. This is an 
image that has been with us for a long time: In the early days an aeronautical propeller was called an airscrew, 
and a nautical propeller is still sometimes referred to as a screw. Imagine the propeller as a short section of a 
machine screw being turned by some applied torque and thus moving itself along a threaded hole and applying 
thrust against something that resists its axial motion. (Or, instead of a threaded hole and a machine screw, 1ma- 
gine the helical tip path of Figure 6.1.16 as a rigid, stationary rail along which the propeller tip slides. And the 
rail analogy needn't be limited to the propeller tip. We can just as well imagine any section along the span of the 
blade sliding along its corresponding helical rail.) Further, imagine that there is no mechanical friction. Without 
friction losses, and with the propeller reacting against a stationary thread or rail, the input work done in a given 
time by the turning torque and the output thrust work done against the axial resistance would be equal, and the 
propeller efficiency (thrust work out/shaft work in) would be 100%. Of course in a fluid, especially a viscous 
fluid, propeller efficiency must be less than 100%, and for a given thrust, forward speed, and rotation speed, the 
torque required is always greater than it would be for the idealized screw. 

We can think of the required additional torque as comprising two parts, one to make up for viscous loss, and 
one to make up for induced loss. The viscous loss is due to the viscous drag of the blade sections, the same thing 
as the profile drag of an airfoil, which we'll discuss in Section 7.4.2. In the rail analogy, mechanical friction 
between the blade and the rail would be analogous to sectional viscous drag. The induced loss results from the 
propeller's setting air in motion, both axially and circumferentially, and thus leaving kinetic energy behind. To 
model the induced loss in our mechanical rail analogy, we would have to allow the rail to move (or “yield”’) in 
the directions of both the thrust force and the torque, and thus to absorb some of the work done by the turning 
torque. Holding the same rotation speed and forward speed would then require a steeper helix angle, and greater 
torque would be required to make up for the loss and deliver the same thrust. Aerodynamically speaking, the 
induced loss is analogous to the induced drag of a wing of finite span, which we'll discuss in Section 8.3. Note 
that this decomposition of the additional torque into viscous and induced parts is not exact and can only be es- 
timated with idealized models, the same situation we pointed out with regard to drag in Section 6.1.3. 

A propeller blade acts like a lifting wing operating in a non-uniform flowfield in which the relative local 
velocity vector varies along the span. A schematic vector diagram of the situation seen by a typical section of 
the blade at radius r from the propeller axis is shown in Figure 6.1.17. The upper portion of the diagram shows 


the components of the relative flow velocity seen by the section. First, the forward velocity V of the propeller 
and the circumferential velocity wr of the blade section combine to produce an effective local “undisturbed 
freestream” velocity vector Veffoo. Next we take into account the local inviscid disturbance velocity produced 
by the propeller at the location of the section, which we call Vind because it is often referred to as the “induced” 
velocity. When Vind is added to Veffoo, the resultant is the total effective local flow velocity vector Veg. The 
effect of Vind 1s generally to increase the axial velocity and decrease the circumferential velocity seen by the 
section. 


Figure 6.1.17 Schematic diagram of the relative flow velocities and forces seen by a propeller blade section at 
radius r from the propeller axis. The section is moving from right to left with flight velocity V and upward 
with circumferential velocity wr resulting from the angular velocity @ = 27n of the propeller. The view 1s radi- 
ally inward along the axis of the blade and is consistent with right-hand rotation of the propeller, as in Figure 
6.1.16. The upper portion of the diagram shows the components of the local relative flow velocity Veff seen by 
the section, and the lower portion shows three versions of the resulting force components on the section for a 
fixed net thrust contribution 
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The lower portion of Figure 6.1.17 illustrates three versions of the resulting force components on the section, 
with and without the effect of Vind, and with and without the effect of viscous drag d. First, note that the local 
sectional lift is defined as the component of the sectional force perpendicular to the local relative flow direction 
(in contrast with the global definition we introduced in Section 5.4, where total lift was the component of the 
total aerodynamic force perpendicular to the global freestream). Note further that the axial component of the 
local sectional lift contributes to the thrust, and the axial component of the sectional drag subtracts from thrust. 
For this comparison, the lift components are varied so as to hold the net sectional contribution to thrust constant, 
at the value indicated by the horizontal arrow at the bottom of the diagram. The circumferential components of 
both lift and drag contribute to the torque that retards the propeller's rotation. The net circumferential compon- 
ents contributing to the torque are indicated by the downward arrows at the lower left. The three versions of the 
force vector are: 





“Tdeal:” {In the idealized situation with no “induced” velocity (which would apply in the limit of zero load on the pro- 
eller), the lift vector /ideal 1s perpendicular to the effective local “undisturbed freestream” velocity vector 
effoo. This situation is analogous to having the section slide along a stationary frictionless rail, and with no 


mechanism for any loss, all of the work done against the resisting torque shows up as thrust work at an effi- 


local effective flow velocity vector Verf. Note that holding the sectional contribution to thrust constant requires 
he magnitude of /iny to be larger than that of /ideal. Compared to the “ideal” case, the resisting torque has in- 
creased, and even in this case with zero viscous drag the efficiency is less than 100%. 


hen both the inviscid “induced” velocity and a viscous-drag component d are included, the resultant is the 
otal sectional force vector f. Note that to maintain a constant net contribution to thrust, the lift magnitude has 
had to increase further, that is, the magnitude of /yis is larger than that of /iny. This would produce a larger Vina, 
and to be strictly correct, /yis and /iny should be shown as having slightly different directions. But in practical 
situations, in which d tends to be small, this is a small effect, so it is ignored in the diagram. Compared to the 
“inviscid” case, viscous drag has increased the resisting torque, further reducing efficiency. 





Note that holding the net sectional thrust constant in these comparisons simplified the vector diagram a bit, 
but was not essential. Instead, we could have held the magnitude of the lift vector constant, for example. In that 
case, including the effects of Vind and d would have increased the required torque less but decreased the thrust. 
The directions of the resulting force vectors would be the same, as would our conclusions regarding efficiency. 
With the diagram in its present form with net thrust held constant, the longer vertical arrows on the lower left 
for the “inviscid” and “viscous” cases represent the additional torque required for a given thrust, relative to that 
required to turn the “ideal” screw. But keep in mind the caveat above, that this decomposition of the additional 
torque into an “induced” part and a viscous part is an idealization. In the view illustrated in Figure 6.1.17, this 
would be reflected in the fact that in a real flow the “induced” velocity Vind cannot be defined exactly, but only 
through the use of an idealized inviscid model. 


The flow produced by the propeller is quite three-dimensional and unsteady. The flow around and down- 
stream of each blade has much in common with the flow around a 3D wing, which we'll discuss in Section 
8.1.1-3. Propeller blades and 3D wings both shed vortex sheets from their trailing edges. In both cases the sheets 
start out streaming back in the local flow direction, and as they progress downstream they deform and roll up 
at the outer edges to form “tip vortices.” In the case of a propeller, the vortex sheets and the tip vortices fol- 
low roughly helical paths, but relative to the helical paths of the propeller tips in Figure 6.1.16, the tip vortices 
generally migrate aft faster and migrate inward, consistent with the increased axial velocity of the flow and the 
general contraction of the propulsion streamtube illustrated in Figure 6.1.15. 

The torque exerted by the propeller introduces angular momentum into the downstream flow in the form of 
circumferential velocities that tend to be everywhere in the same direction as the propeller rotation. This cir- 
cumferential motion is loosely referred to as “swirl,” a term that can be misleading because it is easily confused 
with the axial vorticity. Although the swirl tends to have the same sign everywhere, it is easy to show that axial 
vorticity of both signs must always be present. If we ignore the radial velocity, which is usually small in pro- 
peller flows, the axial vorticity is given by 
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where Ug is the circumferential velocity. Moving from the axis outward, wp must first increase from zero and 
then return to zero outside the slipstream, and m, must have opposite signs in the regions of increase and de- 
crease. And this is consistent with an even stronger statement about the axial vorticity, which is that the in- 
tegrated axial vorticity over a any constant-x surface that cuts the entire propulsion streamtube must be zero. 
This can be established by applying Stokes's theorem to any closed contour that surrounds the entire propul- 
sion streamtube, combined with a capping surface bounded by the closed contour, where the capping surface 
bulges out in front (upstream) of the propeller and cuts none of the downstream propulsion streamtube. The flow 


everywhere on such a capping surface is irrotational, and the integrated vorticity flux through it must therefore 
be zero, so that the circulation around any closed contour outside the propulsion streamtube is also zero. This 
point is not always correctly understood. There have been depictions incorrectly showing propeller streamtubes 
as having net axial vortex strength that “induces” circumferential velocities at locations outside the propeller 
streamtube (Loth and Loth, 1984, for example). Actually, we've just shown that this can't happen. Zero net axi- 
al vorticity in the propeller streamtube means that there can be no such effect extending outside the propeller 
streamtube. Further, it means that in the idealized case of a time-averaged, axisymmetric propeller flow there 
can be nonzero swirl velocities only inside the propulsion streamtube, and only in the part downstream of the 
propeller. 

The fact that the propeller streamtube carries zero integrated axial vorticity means that the blade tip vortices 
must be offset by vorticity of opposite sign inside the streamtube. For efficient propeller designs, the vorticity 
feeding the tip vortices is shed close to the blade tips, and the offsetting vorticity of opposite sign is shed from 
the inboard portions of the blades. For an idealized isolated propeller with no fuselage or nacelle behind it, the 
vorticity shed inboard gathers downstream into a vortex core similar to the tip vortex behind a 3D wing that 
we'll discuss in Section 8.1.2. In real propeller installations, this inboard vorticity forms a kind of shell stream- 
ing back along the outer surface of the fuselage or nacelle. 

Between the inboard and outboard regions of shedding, there is usually a significant expanse of the blades' 
radius from which relatively little vorticity of either sign is shed. There is thus an annular region in the propeller 
streamtube in which there is little vorticity and the flow is similar to that around a potential vortex, with the 
circumferential velocity ug going roughly as I/r. 

We've seen that a propeller flowfield 1s complex, with contorted, moving vortex sheets and tip vortices, and 
complicated unsteady distributions of axial and circumferential velocity. But despite the complexity, we can use 
a simple dimensional argument to predict how propeller performance should scale, and idealized models to pre- 
dict the major performance trends. The dimensional argument is as follows. At low Mach numbers and high 
Reynolds numbers, the dependence on Mach number and Reynolds number should be weak. For a propeller of 
a given shape and operating at a given advance ratio, the flowfield in dimensionless terms is effectively determ- 
ined, regardless of the geometric scale or the velocity level, and the pressure disturbance anywhere on a blade 
element will scale closely with V~. Then the following dimensionless performance measures should depend al- 
most exclusively on propeller shape and advance ratio: 


Thrust loading: 7 = T/qs,,. 


6.1.18 
ee Torque coefficient: js = M/qs,, dp, 
Per Shaft power loading: a = P/qVs,, 


6.1.21 Efficiency: thrust power out/shaft power in = 7 = TV/P =7/o, 

where Sp is the propeller disc area. Actually predicting values for these coefficients requires calculating the de- 
tailed forces on the propeller blades, which can be done to various levels of fidelity under the general rubric of 
blade-element theory. 

The problem blade-element theory deals with is similar to that of predicting the spanwise distribution of lift 
(the “spanload’’) of a 3D wing, which we'll discuss in Section 8.2. Just as in the case of a wing, the load on 
a section of a propeller blade influences the flow direction seen everywhere along all the blades and thus in- 
fluences the loading at all blade sections. In the simplest blade-element theory, this influence is ignored, and 
blade section loadings are calculated based on undisturbed local conditions, an approximation valid in the lim- 
it of light loadings (see von Mises, 1959). Then there are more-accurate potential-flow theories for wings and 


propellers, in which this nonlocal influence is inferred from the vorticity distribution in the wake, that is, the 
velocities “induced” by the wake vorticity are calculated using the Biot-Savart law. In such theories, the vortex 
sheets are usually assumed not to distort and roll up at their edges, which for a propeller means that a helical 
screw shape is assumed for the vortex sheets. Prandtl (in Prandtl and Betz, 1927) derived approximate expres- 
sions for the velocities “induced” by such sheets, which have been used in both analysis methods (calculating 
the loading on blades of a given shape) and design methods (calculating the twist of the blades so as to produce 
a given loading). Betz (also in Prandtl and Betz, 1927) showed that the loading that produces the minimum in- 
duced loss for a given thrust, diameter, and advance ratio produces a velocity field that ideally should not distort 
the assumed helical-screw-shaped vortex sheets (1.e., a velocity field compatible with “rigid” but not stationary 
vortex sheets), and Goldstein (1929) derived exact potential-flow solutions for these cases, for propellers with 
two blades or four blades. Goldstein's solutions define “optimum” load distributions that have been used for the 
design of propellers for minimum induced loss (see Drela, 2006), which should come close to maximizing ef- 
ficiency overall. Although the potential-flow solutions on which these designs are based are exact, the physical 
model is not, because it assumes an idealized configuration of helical vortex sheets of fixed pitch, ignoring the 
overall contraction of the propulsion streamtube and the sheets' distortion and rollup into tip vortices. 


Now we'll use numerical calculations to explore how propeller efficiency varies with thrust loading and ad- 
vance ratio for propellers designed to meet Goldstein's minimum-induced-loss blade loadings. The QMIL code 
(the design version of the QPROP analysis code, see Drela, 2006) takes the number of blades, thrust loading, 
advance ratio, and propeller blade airfoil characteristics as inputs, and designs the propeller to meet an approx- 
imate version of the appropriate Goldstein blade loading. Design outputs are the distributions of blade chord and 
pitch angle. Flow solution outputs include distributions of forces on the blades, radial distributions of “induced” 
velocities, both at the blade locations and circumferentially averaged, total torque and power, and induced and 
total efficiencies. Blade section viscous drag is taken into account in the calculations or not, as desired. For the 
plots presented below, cases were calculated for several dense arrays of (t, J) pairs, and curves of constant ef- 
ficiency and maximum efficiency were interpolated from the resulting database. Keep in mind that every point 
on any of these curves represents a different minimum-induced-loss propeller operating at its design condition. 
These are not performance curves for fixed propellers. 

First, let's look at induced efficiency ni for two-bladed propellers designed and operating with no viscous 
drag on the blades. Curves of constant nj are shown on a J-versus-t plot in Figure 6.1.18. Two basic trends are 
evident. First, induced efficiency decreases with increasing thrust loading. Second, induced efficiency decreases 
with increasing advance ratio (and the associated lower rotation speed and higher torque). Both trends are con- 
sistent with a simple momentum-versus-energy interpretation. The “payoff” (thrust) is proportional to the ex- 
cess momentum imparted, while part of the “cost” (power required) is related to the kinetic energy left behind. 
A low thrust loading imparts a lower excess velocity and thus lower kinetic energy for a given momentum, and 
yields higher induced efficiency. Likewise, a low advance ratio corresponds to higher rotation speed for a given 
forward speed and diameter, which delivers the required power with lower torque, thus imparting less swirl, 
leaving behind less kinetic energy, and giving higher induced efficiency. In these inviscid calculations, maxim- 
um induced efficiency for a given induced efficiency. In t comes in the limit of zero advance ratio, as rotation 
speed goes to infinity, and torque goes to zero. Obviously this part of the trend won't hold up for blades with 
viscous drag, as we'll see next. 


Figure 6.1.18 Curves of constant induced efficiency n; in terms of J versus t for two-bladed propellers with 


zero viscous drag on the blades. Calculated by the QMIL code (Drela, 2006) 
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We saw in Figure 6.1.17 how viscous drag increases the resisting torque when thrust is held constant, re- 
ducing efficiency. Now, before we look at the results of detailed calculations, let's think about how we would 
expect the effect of viscous drag to vary with advance ratio J. Two limiting cases are informative: 


A typical blade element advances along a helix with a shallow angle relative to the circumferential direction. The sec- 
ional lift (/ in Figure 6.1.17) lines up closely with the thrust direction and contributes nearly 100% to thrust. This 
minimizes the blade lift forces required for a given thrust, and provided the blade chord is sized so that the section 
operates near its maximum lift/drag ratio, this also minimizes the viscous drag forces. But for a given distance of for- 
ard travel by the propeller, the distance traveled by the section is large, so that even the relatively small viscous 
drag forces produce a large energy loss and a large loss of efficiency. 


A typical blade element advances along a helix with a steep angle (nearly 90°) relative to the circumferential direc- 
ion. The sectional lift (/ in Figure 6.1.17) lines up close to the circumferential direction and contributes little to 
hrust. If the helix angle is too close to 90°, the viscous drag will more than cancel the small thrust contribution from 
he lift, and net positive thrust will be impossible. Even if we back off on the helix angle enough to get some net pos- 
itive thrust, the lift required for a given thrust will be large, and so will the viscous drag. Efficiency will be low even 
hough the distance traveled by the section is only slightly longer than the forward travel of the propeller. 





So with the inclusion of viscous drag we expect efficiency to drop off strongly for either extremely low or ex- 
tremely high J. Between these two extremes there should be a favorable range where we avoid the high loss due 
to either excessive distance traveled or excessive lift required. It's not that the losses due to viscous drag will be 
negligible, just that we'll avoid excessive losses. 


Now let's look at what detailed calculations predict for the effects of viscous drag. Recall that for each cal- 
culated operating point QMIL designs a propeller to a particular radial distribution of lift along the blades, that 
is, the load distribution given by the approximated Goldstein solution for minimum induced loss for that oper- 
ating condition. When viscous drag is to be included, an arbitrary radial distribution of sectional viscous drag 
versus sectional lift can be specified. On a well designed propeller, the chord distribution would be tailored so 
that each section operates near its maximum sectional lift/drag ratio. Thus for generic calculations a constant 
sectional lift/drag ratio is not a bad assumption, and for these calculations, a constant ratio of 50 was enforced 
everywhere along the blades. The resulting curves of constant overall n for two-bladed propellers are shown on 


a J-versus-t plot in Figure 6.1.19. The corresponding nj curves are included for comparison. They are slightly 
different from those in Figure 6.1.18 because the blade viscous drag constitutes a negative contribution to the 
thrust, so that for a given total t the inviscid part of the thrust must be larger, and the induced losses are greater 
than in the inviscid cases. First, note that n is everywhere lower than 1, as it should be, given that an additional 
loss mechanism is included. Note also that the efficiency drops off dramatically for low values of J, as we ex- 
pected above. The maximum efficiency for a given tT now occurs at nonzero advance ratio. Over the practical 
range of J shown in Figure 6.1.19, we see no sign of the dramatic drop-off in efficiency that we expect for very 
high values of J. 


Figure 6.1.19 Curves of constant overall efficiency 7 (solid) in terms of J versus t for two-bladed propellers, 
calculated by QMIL (Drela, 2006) for a blade-section lift/drag ratio of 50. The corresponding constant-n; 
curves (long dashed) are included and are slightly different from those in Figure 6.1.18 because blade viscous 
drag constitutes a negative contribution to the thrust, so that for a given total t the inviscid part of the thrust 


must be larger, and the induced losses are greater than in the inviscid cases 
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Figure 6.1.20 summarizes the maximum-efficiency trends, showing nNmax and the corresponding J at nmax 
as functions of t. This makes clear the relationship between higher induced velocity and reduced efficiency as 
thrust loading increases. It also shows how the optimum advance ratio decreases as thrust loading increases. 
Going to four blades instead of two increases efficiency slightly and increases J at nmax. It makes intuitive sense 
that having more blades decreases the losses due to discrete blade effects and would favor slower rotation speed 
for a given forward speed, that is, higher advance ratio. 


Figure 6.1.20 Trends for the point of maximum efficiency as functions of t, for two-bladed and four-bladed 


propellers, calculated by QMIL (Drela, 2006) for a blade-section lift/drag ratio of 50 
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For a practical perspective, the operating points for some propellers in real applications are indicated in Fig- 
ure 6.1.19. The points for a light plane cruising and climbing are the same ones for which the ideal propul- 
sion streamtubes were sketched in Figure 6.1.15. The Human-Powered-Aircraft (HPA) point shows the result of 
seeking very high efficiency, that is, a very low thrust loading and high advance ratio. The light plane accepts 
more of a penalty in efficiency to avoid having to use a larger-diameter, slower-turning propeller, which would 
require a longer landing gear and a gearbox to reduce speed between the engine and propeller. 

Figure 6.1.21 shows the induced axial and circumferential velocities, at the blade locations and circumferen- 
tially averaged, calculated for t = 1.0, J = 0.5, which is near the operating condition for a lightplane climbing as 
shown in Figure 6.1.19. It is clear that the axial velocity is not constant as is assumed in the simple actuator-disc 
theories. The averaged circumferential velocity goes roughly as 1/r from r/R = 0.3 to 0.8, which is consistent 
with the flowfield we described earlier in which most of the vorticity shed from the blades is shed in an inner 
region near the axis and an annular outer region. 


Figure 6.1.21 Radial distributions of induced velocities for a two-bladed propeller designed and operating at t 


= 1.0, J=0.5, calculated by QMIL (Drela, 2006) for a blade-section lift/drag ratio of 50 
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For flight at high-subsonic and supersonic speeds, turbojets and turbofans have been strongly preferred over 
open propellers. The reasons for this involve several aspects of how speed affects the different propulsion sys- 
tems. 

First, consider the factors affecting energy consumption by an open propeller. For a given flight mission, 
practical airframes tend to have optimum lift-to-drag ratios that fall within a narrow range regardless of the tar- 
get speed, and propellers tend to have practical optimum efficiencies that also fall within a narrow range. Thus 
the amount of shaft work required to drive an airplane of a given weight over a given distance is roughly fixed. 
Furthermore, piston and turbo-shaft engines tend to burn a roughly fixed amount of fuel per unit of work done, 
regardless of the target rate (power level) for which they're sized. So far, this indicates that the amount of fuel 
required for a given mission should be relatively insensitive to the target speed up to the point where compress- 
ibility effects begin to reduce propeller efficiency. And there are other considerations that come into play, such 
as the fact that higher speeds favor higher cruise altitudes, and the effects of altitude on engine efficiency and 
on the power available from an engine of a given size and weight. The upshot is that the use of conventional 
open propellers has mostly been limited to cruise Mach numbers below 0.60. 

Turbojets and turbofans allow operation at higher Mach numbers because their inlets slow the flow down 
before it encounters the rotating blades. But these engines don't just allow operation at higher speeds, they actu- 
ally favor it because of the way their fuel consumption varies with speed. They don't share the open propeller's 
characteristic of requiring a roughly fixed amount of fuel per unit of thrust work. Instead, they favor speed (or 
are penalized for low speed), a bit like a pure rocket propulsion system. A pure rocket tends to require a fixed 
amount of fuel per unit thrust per unit time, regardless of speed, so that the fuel required for a given thrust over 
a given distance decreases with increasing speed, as 1/V. Because turbojets and turbofans are air-breathing, they 
don't favor speed quite as much a pure rocket. The fuel consumption for a given thrust over a given distance 
varies approximately as v ©? for the turbojet and as Vv °° for the modern high-bypass-ratio turbofan. Thus 
even the turbofan favors flight at high subsonic Mach numbers, where airfoils should be designed specifically 
for transonic flow, as we'll discuss in Section 7.4.8, and wings are usually swept, as we'll discuss in Section 8.6. 

Even a turbofan tends to operate at relatively high thrust loadings, in the neighborhood of 0.6 in cruise, com- 
pared with 0.2 for the open propeller of a light plane in Figure 6.1.19. The other major difference in the flow- 
field is that while an open propeller producing positive thrust always speeds the flow up ahead of it, a ducted 


engine can slow the flow down before it enters the fan. Part of this deceleration takes place ahead of the inlet, 
and part inside. The result is that the pressure ahead of the fan is higher than freestream, a situation that is pos- 
sible only because of the presence of the inlet duct. To support the high pressure inside, the inlet duct carries a 
load radially outward, effectively acting as a ring-shaped wing lifting outward. Deceleration of the flow ahead 
of the fan is essentially taking place in the large-scale flowfield of this ring-wing. We'll discuss airfoil and wing 
flowfields in detail in Sections 7.3.4 and 8.11. 

So the flow ahead of a turbofan in cruise actually slows down ahead of the fan: And while the propulsion 
streamtube ahead of an open propeller always contracts, the “captured streamtube” of a turbofan in cruise starts 
off smaller than the inlet and grows larger approaching the fan, as sketched in Figure 6.1.22. However, at the 
low forward speeds and higher thrust loadings typical of takeoff and initial climb, the captured streamtube of a 
turbofan starts off larger than the inlet and contracts, more like that of an open propeller. 


Figure 6.1.22 Sketch showing the general character of the inlet and exhaust streamtubes of a turbofan engine 
operating at a thrust loading typical of cruise. The flow entering the engine slows down as it approaches the in- 
let, and the inlet streamtube grows larger. At higher thrust loadings typical of takeoff and climb, the opposite 
happens, more like the propeller streamtubes of Figure 6.1.15 
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The actuator-disc approach can be extended to compressible flow and to include heat addition. For modeling 
ducted engines, the duct must generally be included. Actuator-disc modeling of ducted engines is used mainly 
in CFD applications, as we'll discuss in Section 10.4.6. 

This concludes our treatment of isolated propulsors. Now let's return to the fully integrated ideal. Propulsion 
according to the Ackeret ideal as in Figure 6.1.14 can also be called boundary-layer propulsion, in which the 
boundary-layer air leaving the tail or trailing edge is reenergized to freestream total head, so that the dissipa- 
tion that would otherwise take place in the wake (at least the part associated with the viscous drag) would be 
prevented. The airfoil example in Figure 6.1.5 indicates that the potential power saving might be equivalent to 
eliminating on the order of 10% of the viscous drag. Propulsion systems 1n some nautical applications (ships, 
submarines, and torpedoes) take at least some advantage of this effect. But the concept has never found an ap- 
plication in aeronautical practice because it would require distribution of propulsion energy from an engine to 
points throughout the airframe (all along the wing trailing edge, for example), and the resulting internal losses 
and extra weight and complexity would more than offset the benefit. 

There is one vehicle configuration where such penalties would be minimal, and that is a body of revolution, 
as might be used for a lighter-than-air (LTA) airship or an underwater vehicle. Goldschmied (1982) developed 
a design for such a body that carried the concept of boundary-layer propulsion a step further, combining 
boundary-layer propulsion with boundary-layer control by suction to prevent separation. The body was designed 
to produce a pressure distribution similar to that of the Griffith-type laminar-flow airfoils (see Thwaites, 1958), 
with the pressure recovery concentrated over a single suction slot where most of the boundary-layer air is re- 
moved. There is no adverse pressure gradient on any of the external surface of the body, as there would have to 


be on the aft portion of a conventional body designed to achieve attached flow passively. The body shape and 
incompressible potential-flow pressure distribution are sketched in Figure 6.1.23. 


Figure 6.1.23 Shape and incompressible potential-flow pressure distribution of a self-propelled body of re- 
volution using a combination of boundary-layer control and boundary-layer propulsion. (Based on Gold- 
schmied, 1982) 
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The suction air is pressurized by a fan and expelled through a central exhaust pipe at the tail. Because the 
boundary layer on the tailcone develops at very low velocity and in a favorable pressure gradient, its momentum 
deficit when it reaches the tail (the exhaust-nozzle exit) is small, and the exhaust air needs very little momentum 
excess to make up for it. Thus if the tail pipe is sized properly, the body achieves self-propulsion with an exhaust 
total head only slightly above that of the freestream, and, in principle, the body comes very close to achieving 
the ideal of Ackeret propulsion. Goldschmied's experimental results verified that attached flow was achieved at 
a suction flow rate consistent with the criterion that all boundary-layer air with total pressure lower than the stat- 
ic pressure on the downstream attachment surface must be sucked away (Taylor's criterion, see Thwaites, 1958). 
They also seemed to indicate that self-propulsion was achieved at the predicted low exhaust total pressure, but 
there was an unresolved discrepancy between the drag determined by the force balance and that determined by 
the wake surveys. 


6.2 Drag Estimation 


Predictions of viscous drag range from crude, preliminary estimates to predictions based on very detailed (and 
often laborious) calculations, usually combined with experimental test results. What level of prediction is appro- 
priate depends of course on the purpose. In the initial stages of preliminary design, estimates based on empirical 


correlations are often sufficient. In the final stages of development, very accurate predictions can be needed, for 
example, to support performance guarantees. At this stage, a great deal of effort 1s usually justified, including 
very careful model testing and extrapolation to full scale, making use of the best CFD tools available. Between 
these extremes, particularly with regard to detailed design and design optimization, lies a gray area where the 
right balance between empiricism, testing, and computation 1s difficult to define. 

CFD is playing an ever-increasing role in the detailed design process, and with good reason. In general, CFD 
is reasonably successful at predicting the pressure distribution on an aircraft, at least near the cruise condition 
where the flow tends to be well-behaved. CFD-based inverse design and optimization methods now make it easy 
to design shapes that produce good aerodynamic performance, and this has speeded up the design process con- 
siderably. However, the evaluation of the drag of a configuration, especially the viscous drag, is currently one 
of CFD's weaker suits (along with the prediction of separated-flow phenomena). While CFD drag predictions 
can provide a useful indication and are improving rapidly, a good deal of caution 1s still in order. For example, 
current turbulence models shouldn't be relied on to provide very accurate predictions of swept-wing flow, as 
we'll see in Section 8.6.2. For some time to come, testing, and the experience of the designer, will continue to 
play a role in drag evaluation during detailed design. 


6.2.1 Empirical Correlations 


6.2.1.1 Flat-Plate Skin Friction 


The simpler a flow is, the more likely it is that it can be represented by an empirical correlation involving only 
a few dimensionless parameters. For viscous drag prediction, the simplest flows are those in which the drag is 
dominated by laminar or turbulent skin friction, and pressure gradients are mild enough that flat-plate flow is a 
good model. For slender fuselage-like bodies or thin flying surfaces at very low angles of attack, flat-plate skin 
friction can provide a very reasonable viscous-drag estimate. In Section 4.3.1, we discussed flat-plate boundary 
layers in some detail. For purposes of drag estimation, we use the average skin friction coefficient given by 
Equation 4.3.1 for laminar flow and Equation 4.3.2 for turbulent flow. These curves are plotted in Figure 4.3.1, 
along with curves for flows that transition from laminar to turbulent at a range of locations. For a flow that is 
either all laminar or mostly turbulent, only the wetted area and length Reynolds number need be known. For 
flows with transition, an accurate estimate is difficult, because the drag depends strongly on transition location, 
as 1s Clear in Figure 4.3.1, and the transition location is sensitive to pressure gradients, surface imperfections, 
and freestream turbulence (see Section 4.4.1). For high subsonic speeds and above, an adjustment for Mach 
number should be applied, such as the one given for turbulent flow by Equation 4.3.7 and illustrated in Figure 
4.3.4. 


6.2.1.2 Airfoils with Significant form Drag 


Moving up the complexity ladder, we come to effectively 2D wing sections with significant thickness and/or lift 
in subsonic flow. Of course, if there is flow separation, the drag can increase dramatically, and no simple correl- 
ation can be expected to work. Even if the flow remains attached all the way to the trailing edge, the significant 
pressure gradients associated with thickness and lift require correction to the flat-plate skin friction. First, the 
average skin friction is different from that on a flat plate, and second, the form drag must be added. These two 
effects are generally lumped together into an empirical form factor applied to the corresponding flat-plate skin- 
friction drag. In Section 7.4.2, we'll look at how the form factor varies with lift for a typical subsonic airfoil. 
Correlations relating form factors to airfoil geometric parameters can be found in several sources (see Hoerner, 


1965, and the ESDU, 1990, data units). Such simple correlations for profile drag are risky, as illustrated by the 
following example. An airfoil with a Stratford recovery pressure distribution (zero skin friction throughout the 
pressure-rise region; see Smith, 1975) can have a very low skin-friction drag, which would be at least partly 
compensated for by a relatively high form drag. The distinction between this and a more conventional airfoil 
would be missed by any simple correlation scheme based on airfoil geometry. 


6.2.1.3 Full Configurations 


Preliminary-design methods for estimating the drag of an entire configuration make use of the above ideas for 
estimating the drag contributions of the major components (1.e., flat-plate skin friction corrected by an appro- 
priate form-drag factor) and then add induced drag, interference drag, and the other viscous-drag contributions 
such as junctions, excrescences, and so on. Probably the most wide ranging compilation of empirical data on 
which to base estimates of all these contributions is Hoerner (1965). More recent data and theoretical deriva- 
tions can be found in the ESDU (1990) data units. 


6.2.1.4 Excrescence Drag 


The drag contribution of small excrescence items such as fastener heads, steps, and gaps depends strongly on 
how deeply the object is immersed 1n the boundary layer. In the correlation of these effects, the height of the ob- 
ject can be scaled in two distinctly different ways, that is, relative to boundary-layer wall variables or boundary- 
layer thickness. It is important to use a correlation that is scaled in the way that is appropriate to the situation at 
hand, as illustrated in Figure 6.2.1. 





Figure 6.2.1 Types of scaling appropriate for excrescence items, depending on height relative to boundary-lay- 
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If the object occupies a small fraction of the boundary layer thickness, the height should be expressed in 
terms of wall variables h” = huy/v, because the velocity profile in the near-wall region correlates best with these 
variables. If the object protrudes higher than about 10—20% of 6, it is more appropriate to use a correlation in 
terms of y/o. Once a suitable correlation has been found in the literature, 1t should be assumed only to give the 
local effect of the excrescence. To estimate the effect on total drag, a correction for the “magnification” effect 
of pressure gradients should be applied, such as the one by Nash and Bradshaw (1967), or the local-q!° model, 
both discussed in Section 6.1.9. 


6.2.2 Effects of Surface Roughness on Turbulent Skin 
Friction 


In Section 6.1.8, we looked at the behavior of the turbulent skin friction for various types of rough surfaces and 
discussed physical explanations for the skin-friction increase. In this section, we'll look at how empirical data 
from one situation can be applied in other situations for purposes of drag estimation. 


6.2.2.1 Scaling of Roughness Effects 


While it is often important to be able to assess the drag impact of deviations from a smooth surface condition, 
it is rarely practical to measure the effects in place in the actual application. Instead, we must usually rely on 
measurements made in other flow situations, usually in ground-based test facilities. Most such measurements 
are made in simple flow situations, such as a turbulent boundary layer on a flat plate in a wind tunnel or towing 
tank, a pipe or channel flow, or flow between rotating concentric cylinders. In such experiments, skin-friction 
levels with and without roughness may be measured directly or inferred from pressure and/or flowfield meas- 
urements. In any case, the use of the resulting increments for drag estimates in flow situations different from 
that of the basic experiment requires some analysis. Below we look at the theoretical basis for two different 
ways of doing the required analysis, one using detailed CFD calculations and the other based on an equivalent 
flat plate. In Section 6.1.8, we looked at curve-fits based on some of the available empirical data for several 
types of roughness, and in Section 6.3.2, we'll look at typical data for riblets. 

We wish to estimate the drag increment due to a particular surface texture exposed to a turbulent boundary 
layer in the flow environment of some practical application. We have available experimental measurements of 
the skin friction of the surface in question, preferably taken in some simple flow situation such a flat-plate flow, 
pipe flow, or rotating-cylinder test rig. Getting accurate increments from the experiment generally requires that 
measurements were also made for a reference smooth surface. To use such experimental results for estimation 
purposes, we must first put the measured smooth-to-rough increments in a form that is universally applicable 
and then convert them into increments for the application in question. In this section, we describe the theoretical 
basis for two practical ways of doing the required analysis. 

The effect of a given rough surface on the flow depends on the flow conditions, particularly the Reynolds 
number. What complicates the situation, and the basic reason that analysis 1s required, is that the drag increment 
due to roughness depends on both the local roughness Reynolds number and the global Reynolds number of the 
flow, and that we can't generally match both of them between a generic experiment and a specific application. 
The best we can usually hope for is to match the roughness Reynolds number, since the drag increment has the 
strongest dependence on it, and to adjust by analysis for the unavoidable mismatch in the global Reynolds num- 
ber and for differences in other variables such as Mach number. As we saw in Section 6.1.8, the local roughness 
Reynolds number is defined by k” = ku;/v, where k is a length scale characterizing the size of the roughness 
elements, and ur is the “friction velocity,” which as we saw in Section 4.4.2 is the velocity scale of choice for 
the “wall region” of a turbulent boundary layer. The global Reynolds number can be characterized in different 
ways depending on the situation, for example, in terms of Rx, Ro, or Rs*. 

The approaches outlined here are applicable to compressible flows, but we assume that the flow around the 
roughness elements themselves is not locally “compressible.” The local density and temperature in the wall re- 
gion may be significantly different from freestream, but the local Mach number applicable to the roughness, say 
M’ =uz/a, is assumed to be small, so that the single parameter k” suffices to correlate the local roughness effect. 
This assumption is probably valid for most applications with small roughness levels in the transonic and low 


supersonic regimes. For stronger compressibility effects, we would presumably reach a point where something 
like M" should be included as a correlation parameter in addition to kK, 

It is clear that for the results of a generic experiment to be useful, the measurements must cover a sufficiently 
wide range of k” to include what will be encountered in the application. One way to do this is to run the experi- 
mental rig at a fixed overall Reynolds number and vary the size of the roughness, while keeping the character of 
the roughness the same. But because it 1s difficult to accurately reproduce a given surface texture at more than 
one scale, this is very rarely attempted for surfaces other than standard sand grains. Instead, the usual way is to 
use a single rough surface and to vary k* by varying the overall Reynolds number in the test rig. In either case, 
the results of such an experiment can be expressed fairly directly as a smooth-to-rough increment AC¢ as a func- 
tion of k", where each value of k” corresponds to a particular value of the overall Reynolds number, say Rs«. 
What we need now is a way to adjust ACf for the fact that at a given ke Rg*, and possibly other flow variables 
in the application will generally be different from what they were in the experiment. To start, we recall from 
Section 6.1.8 that the one universal local effect of roughness is to shift the logarithmic part of the wall-region 
velocity profile by an amount Au’ that depends on k", as expressed in Equation 6.1.16. 

The variation of Au’ as a function of k", that is, the log-law-shift function Au" (k"), is a universal character- 
istic of the particular roughness shape in question, and within the limitations of the Law-of-the-Wall approxim- 
ation, is independent of the type of flow (e.g., boundary layer, duct, pipe) as long as the Law of the Wall applies 
to it. What is most useful to us is that Au" (k") is also independent of the overall Reynolds number of the flow 
and that it can, with some calculation, be related to ACf. 

We will discuss two ways of relating Au" (k*) to ACf: a “turbulence-modeling approach” for detailed CFD 
calculations, and an “equivalent-flat-plate approach” for preliminary estimates. 


6.2.2.2 The Turbulence-Modeling Approach 


In this approach, we modify a turbulence model so that when it is used in CFD calculations, the flow solutions 
match the known Au’ (k’) for the roughness in question. We then use the modified turbulence model to calculate 
the flow in the experiment or target application with and without roughness and thus to predict ACf. The advant- 
age of this method is physical fidelity. It takes into account whatever level of flow complexity can be handled by 
the CFD code and provides as accurate an estimate of ACf as the turbulence model is capable of. Furthermore, 
the total drag calculated by the CFD code contains an estimate of how AC¢ affects the viscous pressure drag 
of the body through the change in displacement thickness. The disadvantage of this approach 1s that it requires 
modifying a CFD code and running detailed CFD calculations. 

Turbulence models that allow accurate calculation of the mean flowfield in the near-wall region of a turbulent 
flow generally have an explicit dependence on the distance y from the wall, usually expressed as y". This func- 
tional dependence on y" is typically tailored so that numerical solutions using the model match experimental 
velocity profiles in the Law-of-the-Wall region, including the viscous sublayer and the logarithmic region. The 
algebraic eddy-viscosity model of Mellor (1967) as used in the 3D boundary-layer code by the author (McLean, 
1977) is one example of such a model. 

A CFD solution for a turbulent flow can be made to simulate the effects of surface roughness through a 
simple modification to the turbulence-modeling part of the calculation. The modeling equations themselves are 
not changed. Only the value of y" input to the near-wall part of the model is modified by a shift Ay" added to 
the geometric y’. When a Ay" shift is applied, the resulting flow solution no longer realistically represents the 
flow at the very bottom of the near-wall region, where part of the volume in the real flow domain 1s occupied 
by roughness elements. However, the logarithmic part of the calculated velocity profile experiences a shift Au’ 
like the shift in a real flow with roughness, and the skin friction increases by the right amount. With the right 
input for Ay’, the shift Au’ can be made to match the Au” that a real flow would have at the same k”. When 


that is done, the calculated AC¢, relative to a baseline calculation for a smooth wall, should closely match ACf 
for the corresponding real flows. 

Thus what is needed is a way to determine the right Ay’ to input to the turbulence model so that flow solu- 
tions match the Au” (k") behavior for the particular type of rough surface in question. This is made easier by 
the fact that the near-wall turbulence model determines a unique relationship between Ay” and Au’, which in 
practice can be determined numerically by a series of flow solutions with different constant input values of Ay”. 
When both the Au '(Ay’) behavior of the turbulence model and the desired Au’ (k’) behavior for the roughness 
in question are known, it is just a matter of interpolation to define a function Ay" (k*) to be used to determine 
the input Ay’. In the author's 3D boundary-layer code, Ay™ (k") functions for several different types of rough- 
ness are included in the code in the form of numerical tables. For a calculation of a flow over a rough surface, 
the user specifies which of the roughness functions 1s to be used and inputs the roughness size k as a function of 
location on the surface (it needn't be constant). Every time the eddy-viscosity subroutine is called in the course 
of such a calculation, the friction velocity uz is known from the previous iteration of the flow solution, k” is 
calculated based on the local k, and the corresponding value of Ay" is determined by numerical table lookup. 

The Au '(Ay") behavior of Mellor's eddy-viscosity model is shown in Figure 6.2.2. The resulting Ay* (k") 
curves corresponding to the Au’ (k*) roughness curves of Figure 6.1.9a are plotted in Figure 6.2.3. Note that 
over the entire transitional roughness regime (k* > 70) the Au’ values in Figure 6.1.9 never exceed 8, so that 
Ay” is nearly the same as Au’, and Figure 6.2.3 is therefore very similar to Figure 6.1.9. 


Figure 6.2.2 Illustration of how the logarithmic wall-region velocity profile, as calculated with a typical near- 
wall turbulence model, responds to an input Ay". This Au'(Ay') response is used to determine Ay" (k’) curves 
like those shown in Figure 6.2.3 that can be used to predict roughness effects. These calculations used the al- 
gebraic eddy-viscosity model by Mellor (1967) 
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Figure 6.2.3 Illustration of Ay'(k’) curves that can be used in a near-wall turbulence model to simulate the ef- 
fects of roughness. These curves are the result of combining the Au'(k’) roughness curves of Figure 6.1.9 with 
the Aw'(Ay’) response of the turbulence model in Figure 6.2.2 
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6.2.2.3 The Equivalent-Flat-Plate Approach 


In this approach, we define an equivalent-flat-plate flow to represent the experiment or target application, and 
then calculate the development of the equivalent-flat-plate flow with and without roughness, thus determin- 
ing ACf. In these calculations, we use an explicit relation between Au’ (k*) and Cf that is valid locally for a 
flat-plate boundary layer, in combination with the von Karman momentum integral equation and an empirical 
smooth-wall skin-friction law that defines the local skin-friction coefficient for the smooth case. The roughness 
size k and even the type of roughness can vary with location along the plate. The advantage of this method is 
that it does not require running detailed CFD calculations. The disadvantage is that the accuracy of predicted 
Cf increments is compromised by the neglect of flow features in the application that are different from a flat- 
plate flow, for example, the effects of pressure gradients or three dimensionality. A further disadvantage 1s that 
it provides no estimate of the effect of ACf on pressure drag. 

For flat-plate flow, we have the following approximate relation between Cr and Au’ (see Hama, 1954, for a 
derivation): 
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which holds not at the same x station in the flows over the rough and smooth walls, but at stations where the two 
flows have the same Rg*. Because the shape factor H of a flat-plate velocity profile varies slowly with Reyn- 
olds number and is only weakly affected by roughness, stations with the same R9g* also have approximately the 
same Rg, and in practical calculations, we will find it more convenient to assume that Equation 6.2.1 applies at 
constant Rg. Our other two basic equations are the von Karman momentum integral equation specialized for a 
flat plate: 


6.2.2 dr 7 dk. — Z : 
and a smooth-wall skin-friction law: 
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where Cfi(smooth) on the right-hand side is not an analytic expression, but instead has to be evaluated by inter- 
polation from a numerical table derived to be consistent with the expression for local turbulent skin-friction as 
a function of Rx, Equation 4.3.5. 
For purposes of calculating the smooth equivalent-flat-plate flow, Equation 6.2.2 is integrated as follows: 
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6.2.4 Res 
For purposes of calculating the rough equivalent-flat-plate flow, Equation 6.2.1 can be solved for Cf(rough), 
and Equation 6.2.2 can be integrated as follows: 
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The numerical table to be used for Equation 6.2.3 and for the value of Cf(smoorn) on the right-hand side of 
Equation 6.2.5 is generated by numerical integration of Equation 6.2.2, with Cr from Equation 4.3.5. Note that 
Cf(smooth) on the right-hand side of Equation 6.2.5 is evaluated at Ro(rough) in keeping with the discussion above 
regarding how Equation 6.2.1 should be applied. 

The numerical calculations required to implement this method can be programmed 1n a spreadsheet. A fixed 
array of points in Rx is set up, and the solutions for both a smooth wall and rough wall are advanced step by 
step by trapezoidal integration. Note that Equations 6.2.3 and 6.2.4 for the smooth case and Equations 6.2.5 and 
6.2.6 for the rough case represent circular references, that 1s, Cf(smoorh) from Equation 6.2.3 is used on the right- 
hand side of Equation 6.2.4 but also depends on the result (left-hand side) of Equation 6.2.4. In a spreadsheet, 
these circular references can be resolved by automatic iteration. 

Figure 6.2.4 shows an example of such a calculation, for an all-turbulent flat-plate flow at RL = 10/ with 
roughness only over a portion of the length (0.2 < x/L < 0.4). At the start of the rough region, the local Cf in- 
creases by about 30%. At the end of the rough section, Cr decreases to a level slightly below that of the smooth 
baseline because the roughness thickened the boundary layer more than in the smooth case. In terms of integ- 
rated skin friction, this “undershoot” effect offsets only a small fraction of the drag penalty of the roughness. 


Figure 6.2.4 Local turbulent Cy for a flat plate at RL = 10’ with roughness only over a portion of the length 
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6.2.3 CFD Prediction of Drag 


As we saw in Section 6.1, drag involves subtle higher-order phenomena in the flowfield. Thus it 1s more difficult 
to compute accurately than the other force components, and it places higher demands on the accuracy of the 
numerical method and on the adequacy of the grid. We'll defer further discussion of these issues to Chapter 10. 


6.3 Drag Reduction 


Efforts to reduce aerodynamic drag can be aimed at the lift-induced drag, the shock drag, the skin-friction drag, 
or the viscous pressure drag. Reducing one component of the drag often entails compromise with the other drag 
components or with nonaerodynamic considerations. In this section, we'll concentrate mainly on the skin-fric- 
tion drag, but first let's put all the components in perspective: 


¢ Lift-induced drag: In Sections 8.3 and 8.4, we'll discuss the induced drag and how reducing it generally 
requires increasing the effective span and thus the structural weight of wings, and must therefore be sub- 
ject to compromise. 

¢ Shock drag: The wing is often the dominant contributor to the shock drag of an airplane in the transonic 
regime, and for high-aspect-ratio wings, the shock drag is largely attributable to the sectional behavior of 
the airfoils. The shock drag of a transonic airfoil depends on how well the airfoil was designed. Assum- 
ing good design, shock drag can generally be reduced by thinning the airfoil, which tends to increase the 
structural weight of the wing, or by increasing the chord, which can increase viscous drag. We'll look at 
these airfoil-design issues in more detail in Section 7.4.8. 

¢ Viscous drag: The dominant issues in viscous-drag reduction depend on whether the body is bluff or 
streamlined. 


In Section 6.1.6, a bluff body was defined as one that fails to produce attached streamline flow at the back end 
and instead has a large separated wake. We saw that because of the large region of separated flow, the drag of 


such a body is relatively high and is mostly pressure drag. Still, such bodies can benefit from careful contouring 
forward of the bluff back end, so as to avoid early separation. This is why the front ends of some cars and trucks 
have become more slender and rounded as fuel mileage has become more important. 

The viscous drag of a streamlined body is mostly skin friction, but there 1s always some pressure drag due 
to the displacement effect of the attached boundary layer. Reducing the wetted area by shortening the body can 
reduce the skin-friction drag, but it tends to increase the pressure drag. Generally speaking, a major goal of good 
design of the body shape is to optimize the trade between skin-friction drag and pressure drag, subject to all of 
the nonaerodynamic considerations that apply. We'll see how this applies to airfoil design in particular in Sec- 
tion 7.4. Finally, there are various ways of reducing the skin friction directly, which is where we'll concentrate 
the rest of this section. 


6.3.1 Reducing Drag by Maintaining a Run of Laminar 
Flow 


In Figures 4.3.1 and 4.3.2, we saw that laminar skin friction is much lower than turbulent skin friction and that 
delaying the transition of the boundary layer from laminar to turbulent should therefore be a powerful means of 
drag reduction. To delay transition, we must generally do something to delay the onset and reduce the growth of 
the instabilities that beset the laminar boundary layer. In Section 4.4.1, we discussed the associated theory and 
saw that instability growth is strongly affected by the pressure gradient and can also be influenced by distributed 
surface suction. 

There are three main approaches to implementing laminar flow for low drag: 


¢ Natural laminar flow (NLF): Instabilities are controlled through the pressure distribution alone. 

¢ Laminar flow control (LFC): Instabilities are controlled through a combination of the pressure distri- 
bution and distributed suction. 

¢ Hybrid laminar flow control (HLFC): A specialized technology for swept wings, in which LFC is ap- 
plied only in the leading-edge region of a swept wing, and NLF persists for some further distance aft. 


In practice, the distributed suction required for LFC or HLFC is usually approximated by suction through 
closely spaced narrow slots or small round holes. 

The application of NLF to drag reduction on 2D airfoils is discussed in Section 7.4.6, and laminar flow on 
swept wings is discussed further in Section 8.6.2. State-of-the-art design for laminar flow generally requires 
doing calculations of instability growth rates and integrated growth factors, using the theory discussed in Sec- 
tion 4.4.1. Implementing laminar flow in high-Reynolds-number applications generally requires very smooth, 
wave-free surfaces, which can be difficult to produce and maintain. 


6.3.2 Reduction of Turbulent Skin Friction 


Like death and taxes, turbulent skin friction on major parts of a vehicle's surface is practically unavoidable. Both 
through the direct sheer force at the wall and it's indirect effect on pressure drag, it accounts for a substantial 
fraction of the drag of most airplanes and water-born vessels. In Section 6.1.8, we saw that surface roughness 
can increase turbulent C¢, so that simply making the surface smooth enough to avoid this penalty 1s the first op- 
tion that should always be considered. Beyond that, to reduce Cf below the smooth-surface level, many schemes 
have been proposed and tested, but most either don't work or are impractical for one reason or another. Because 
being proposed and rejected doesn't prevent their being proposed again and again, it's good to be familiar with 


the things that have already been rejected and the reasons why they didn't work. To this end, I've compiled the 
following list (Table 6.3.1) of turbulent-skin-friction reduction concepts, with brief explanations of their pur- 
ported mechanisms and their actual workability: 


Table 6.3.1 Turbulent skin-friction reduction concepts 


assive turbulent skin-friction reduction 

Slick surfaces or coat- Intuitively, it seems like a slick surface ought to allow the air to just slide along and thus greatly re- 
duce Cr. But in reality, such surfaces have the same no-slip condition that other surfaces do. Coat- 
ings that smooth the surface can sometimes delay laminar-to-turbulent transition and reduce aver- 
age skin friction as a result, but to reduce turbulent skin friction locally, a surface would have to al- 
low significant molecular “slip” at the surface, and no material has ever shown this capability at or- 
dinary atmospheric conditions. 
2D surface waves of low amplitude can reduce average skin friction, but the reduction is more than 
offset by the additional pressure drag caused by the waves (see Lin et al., 1983). 


have never worked in ordinary turbulent air flows (see Hough, 1980). 
Outer-layer manipu- /These are thin plates or airfoils positioned in the outer part of the boundary layer with the intent of 
lators or large-eddy- [interfering with the large eddies. In experiments at low Reynolds numbers, these have produced 
breakup devices drag reductions ranging from none to substantial, but at Reynolds numbers typical of full-scale 
(LEBUs) flight, they have never demonstrated a net drag reduction (Anders, 1989). 
pattern of tiny, alternating ridges, and grooves aligned longitudinally (approximately in the direc- 
ion of the flow), these have been demonstrated to work in many different experiments and applica- 
ions, but the reduction is modest, a maximum reduction in turbulent skin friction of about 6.5% in 
practice at high Reynolds numbers. Because these actually work, they are discussed further below 
Shark-skin surfaces |The tiny scales (dermal denticles) that make up the outer surface of shark skin typically have longit- 
dinal ridges that look like riblets (3—9 ridges on each scale) and seem to be scaled to the flow con- 
ditions in the same way riblets would be (see Bechert and Bartenwerfer, 1989). These have not been 
investigated extensively, but they probably are no better than riblets and would be harder to pro- 
duce. 
“Negative dispersion” [These were a mosaic-like pattern of diamond-shaped protuberances that were predicted theoretic- 
surfaces (Gao and ally to extract turbulence kinetic energy and return it to the mean flow. Initial experimental results 
Chow, 1992) looked promising but have not been replicated. 
Chevrons with ran- [This concept involved spanwise rows of small chevron-shaped protuberances with random phases 
dom phases (Sirovich |between rows and was derived from theoretical calculations. Experiments were not conclusive. 
and Karlsson, 1997) _| 
Active turbulent skin-friction reduction 
pstream slot injec- /These injection schemes can reduce skin friction, but the reduction is more than offset by the ram- 
tion or distributed in- |drag penalty of taking the injection air on board. 
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Electromagnetic tiles (Oscillating electric currents and magnetic fields introduced through a mosaic of surface electrodes 

(Nosenchuck and applies spatially and temporally periodic body forces to the fluid that reduce turbulence production. 

Brown, 1993) This scheme works only in fluids with much higher electrical conductivity than ordinary air. 

MEMS (micro- Arrays of tiny actuators on the surface that operate on the turbulence locally, these will probably 

electro-mechanical never be practical for applications at full-scale Reynolds numbers. The scheme would require very 

systems) actuators large numbers of very small actuators and sensors, covering large areas, and they would be easily 
damaged or contaminated by dirt or ice. 





Of this entire list, only riblets have been shown to have a chance of working in practice, so let's discuss them 
further. 


6.3.2.1 Riblets 


Riblets are a pattern of tiny, alternating ridges, and grooves aligned longitudinally (approximately in the dir- 
ection of the flow). They reduce turbulent Cf by a modest amount by inhibiting the lateral turbulence motions 
near the bottom of the boundary layer, primarily the motions associated with the near-wall streamwise vortices, 
which are associated with the streaks in the sublayer (see Figure 2.1f). Inhibiting the near-wall vortices slightly 
reduces the overall production of turbulence in the boundary layer. 

The effect scales with turbulent-boundary-layer wall variables, just like the effects of surface roughness in 
the transitional regime that we looked at in Sections 6.1.8 and 6.2.2. The Cr reduction correlates with s", the 
ridge-to-ridge spacing in wall units, and typical behavior 1s shown in Figure 6.3.1. In this example, Cris reduced 
over a range of s* up to about 23, a range that varies somewhat depending on the particular riblet profile. The 
peak Cr reduction is around s" ~ 14, which also varies somewhat with the profile. The behavior of the effect 
in the range of low s" below about 5 is not well established by the data, just as we saw in the case of surface 
roughness in Section 6.1.8. At high s", Cr increases above the smooth-surface value, much as it would with 
surface roughness. The optimum spacing of about 14 wall units means that riblets must typically be quite small, 
just as the permissible roughness of a surface is small. We'll look at a specific sizing example next. 


Figure 6.3.1 Typical turbulent-Cyreduction by riblets as a function of the dimensionless spacing s* 
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The general shape of the curve in Figure 6.3.1, with a Cf reduction over a midrange of s' and a Cr increase 
at high s", reflects a competition, as s" increases, between increasing effectiveness of the ridges in inhibiting 
lateral motions, and increasing penetration of high-velocity fluid into the grooves. How this competition plays 
out seems to be strongly related to how the dominant scale of the sublayer vortices in the instantaneous flow- 
field varies relative to s as s’ increases. Based on direct numerical simulation (DNS) calculations of the tur- 
bulent flowfield, Choi, Moin, and Kim (1993) point out that the vortices' average diameter is d” ~ 30 and that 
they tend on average to be centered at y" ~ 20. They present a schematic diagram, reproduced here in Figure 
6.3.2, indicating that at high s", the vortices are able to settle farther into the grooves and to bring high-velocity 
fluid close to the groove walls, thus increasing the average shear stress. This also seems to be reflected in the 
time-averaged flow in an interesting way. Suzuki and Kasagi (1993) found, based on flowfield measurements, 
that there is a time-averaged downward flow into the middle of the groove coupled with upward flow along the 


groove walls. The pattern is very weak in the drag-reducing range of s", but it becomes much stronger in the 
drag-increasing range, as shown in Figure 6.3.3. 


Figure 6.3.2 Schematic diagram of the mechanisms for drag increase and drag reduction by riblets. . (a) s” = 
40 (b) s’ = 20 
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Figure 6.3.3 Time-averaged flow patterns in a riblet groove. From Suzuki and Kasagi, (1993). Used with per- 
mission. Reference vector at the bottom of each plot represents 0.005 we. (a) s* ="5 (b) s° =31 
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The drag-reduction effect of riblets seems to be fairly forgiving of misalignment of the ridges in yaw. There 
have not been many measurements of the yaw effect, so there is no well-established curve. A few degrees of 
misalignment seems to make little difference. The drag reduction is roughly half gone at 15°, and completely 
gone at about 30°. With this level of tolerance, groove alignment is not a major practical issue in most applica- 
tions. 

The effects of profile shape on the C¢ reduction are known almost entirely through experimental investig- 
ations, with the most extensive single collection to be found in Bechert et al. (1997). It has been found that 
sharp and narrow ridges are favored. For ridges with a triangular cross section and a depth ratio h/s = 0.5, the 
dependence on the included angle at the ridge is as shown in Figure 6.3.4. The C¢ reduction is maximum for a 
0° -included-angle “razor blade” ridge and decreases to less than half the maximum at 60°. A profile with a 30° 
included angle, as shown in Figure 6.3.5, gives up a little performance but is a more practical compromise than 


the razor blade. The depth of the grooves seems to be less important than the shape of the ridges, as long as the 
depth is about half the spacing or greater. 


Figure 6.3.4 Cyreduction of riblets with triangular-cross-section ridges as a function of the ridge included 


angle. (Data from Bechert et al., 1997) 
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Figure 6.3.5 Riblet profile with flat valleys and ridges with 30° included angle 


The effects of riblets on the near-wall turbulence have been studied using DNS calculations, but only at rel- 
atively low Reynolds numbers (see Choi, Moin, and Kim, 1993, for example). Besides DNS, there is no quant- 
itative theory that deals with these effects at the level of the actual turbulence physics and explains how the 
Cr reduction varies as a function of the riblet profile. The closest thing we have to a theory is the concept of 
protrusion height, which 1s defined as follows. Consider the riblet profile to be immersed at the bottom of a 
theoretical laminar shear flow at very low Reynolds number (Stokes flow). Far above the tops of the ridges, the 
flow is uniform in horizontal planes, and the vertical gradient of the velocity is uniform. Extrapolating the linear 
velocity profile to zero defines an effective origin that is always found to be between the bottoms of the grooves 
and the tops of the ridges. The protrusion height hp is the distance that the ridges protrude above this effective 
origin, as shown in Figure 6.3.6. The idea of protrusion height has been used in two different ways. Bechert and 
Bartenwerfer (1989) correlated turbulent Cf reduction with the protrusion height calculated for flow parallel to 
the ridges (streamwise protrusion height). Lucini, Manzo, and Pozzi (1991) noted that the cross-flow protrusion 
height is always greater than the streamwise protrusion height and used the difference between the two to cor- 
relate turbulent Cf reduction. The applicability of these theories to the actual physics is questionable, but both 
correlate with the observed qualitative trends. For example, they both predict that sharp ridges are best and that 
Crf-reduction effectiveness increases with decreasing included angle at the ridges. 





Figure 6.3.6 The protrusion height 1s defined as the vertical distance between the riblet ridge and the effective 
origin of an idealized Stokes flow (the point of zero velocity extrapolated from the uniform velocity gradient 
far above the riblets) 
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Choosing the appropriate ridge spacing and estimating the drag reduction for a particular riblet application 
can be done to varying levels of accuracy. A rough estimate can come from a plot like Figure 6.3.1, 1f one is 
available for the riblet profile in question, combined with an estimate of the average s" for the surface, based 
on the dimensional s for the riblets and the average turbulent Cr from Equation 4.3.2. But Cé/Cfsmoorn for a rib- 
let surface varies with Reynolds number and other aspects of the flow conditions, and more accurate estimates 
require using the same kinds of calculations outlined for distributed roughness in Section 6.2.2. For many ap- 
plications, the equivalent-flat-plate approach can be adequate, but detailed CFD calculations with a turbulence 
model modified to simulate the particular riblet profile can provide more accurate accounting for complications 
such as pressure gradients and 3D flow. 

When experimental data are available for the riblet profile in question, they are usually expressed in terms 
of AC¢/Cf£smooth, aS a function of the Reynolds number that was varied in the test facility. With appropriate cal- 
culations applied to the situation in the test facility, ACf/Cfsmooth (R) can be converted to Au’ (s'). This can be 
used directly in the equivalent-flat-plate approach as described in Section 6.2.2. Figure 6.3.7 shows the Au’ (s*) 
curve derived in this way from experimental data for the riblet profile of Figure 6.3.5. 


Figure 6.3.7 Au'(s") curve derived from experimental data for the riblet profile of Figure 6.3.5. Analogous to 
the Au’ (k’) roughness curves of Figure 6.1.9a. Can also be used as a Ay'(s") curve for simulating riblet effects 
in calculations with a turbulence model. It is thus analogous to the curves for simulating surface roughness in 
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In Section 6.2.2, we described how Au" (s*) data for a rough surface can also be converted to a Ay" (s*) table 
for inclusion in a turbulence model for CFD calculations. In the case of riblets, such a conversion isn't generally 
required because their Au’ values are small and are generally in the linear part of the Au '(Ay") response curve 
for near-wall turbulence models, where Au” = Ay’, as can be seen in Figure 6.2.2. The Au’ (s*) curve of Figure 
6.3.7 can therefore also be used directly, without conversion, as the Ay™ (s*) curve for inclusion in a turbulence 


model. We also noted in Section 6.2.2 that using detailed CFD calculations to predict AC¢ due to roughness has 
the advantage that the indirect effect on pressure drag is also predicted. The same advantage applies to using 
CFD to predict the effects of riblets. In applications of riblets to bodies with pressure gradients, the effect on 
pressure drag 1s typically a significant portion of the total drag reduction. 
The fact that the riblet effect scales with turbulent-boundary-layer wall variables makes the use of riblets in 
applications surprisingly simple. First, turbulent Cr tends to vary slowly along the length of a surface, and s* 
0. 


varies as Cf, which varies even more slowly. Thus it turns out that a single riblet spacing can be used along 
the length of a long aerodynamic surface, and the average Cf reduction can be nearly equal to the maximum that 
could be achieved with a variable spacing. To illustrate how this works, Figure 6.3.8 shows the C¢ reduction as 
a function of riblet spacing at 3 and 100 ft from the leading edge of a flat plate at a Reynolds number of 1.7 
million per foot, typical of flight at Mach 0.85 at 35 000 ft. In this example, a spacing of between 0.003 and 
0.004 in. works well over the whole length. This example also gives an indication of how much AC¢/Cfsmooth 
varies with length Reynolds number, and thus illustrates the inaccuracy that can be incurred by using a single 
Cf/Cf£smooth curve like the one in Figure 6.3.1 to make drag estimates. 


Figure 6.3.8 Turbulent-Cy reduction by riblet profile of Figure 6.3.5, as a function of spacing, calculated for a 
flat plate at Mach 0.85 and a Reynolds number of 1.7 million per foot 
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The fact that optimum riblet spacings tend to be so small raises a host of practical issues. Producing sharp 
ridges on such a small scale is difficult, and whatever material the ridges are made of, durability is a problem. 
So far, the favored approach is to mold the profile into a plastic or elastomeric appliqué that can then be applied 
to the surface. But covering large areas of an airplane's surface with “tape” poses many problems, and this tech- 
nology has not yet found its way into routine service on airplanes, though it may in the future. 


| Note that IENs and 20/c should agree far downstream, but that in this case, the discrepancy at x/c = 2 1s lar- 
ger than it was for the flat plate in Figure 6.1.4. Presumably this 1s due to cumulative integration error, which 
could be reduced by grid refinement. The trailing-edge region was very likely the major contributor to the 
error buildup, since the calculations were in the direct mode, and the trailing-edge singularity was simply 
glossed over. 


Chapter 7 


Lift and Airfoils in 2D at Subsonic Speeds 


In a way, lift is the most visible of the aerodynamic forces. We see heavier-than-air animals and machines flying 
through the air every day, and something has to be holding them up there. Of course, we can predict the exist- 
ence of aerodynamic lift mathematically by solving equations of motion for the flow around the lifting object. 
The accuracy of such predictions depends on the level of fidelity of the equations we choose to solve and varies 
with the type of lifting-surface shape and with the flow situation. Some types of lifting flow are easier to predict 
accurately than others. In principle, however, if we had the computing power available to carry out a direct nu- 
merical simulation (DNS) solution of the Navier-Stokes (NS) equations for the flow, we would be able to predict 
any lifting flow, 2D or 3D, with high accuracy. So in one sense, the physics of lift is perfectly understood: Lift 
happens because the flow obeys the NS equations with a no-slip condition on solid surfaces. 

On the other hand, physical explanations of lift, without math, pose a more difficult problem. Practically 
everyone, the nontechnical person included, has heard at least one nonmathematical explanation of how an airfoil 
produces lift when air flows past it. Such explanations fall into several general categories, with many variations. 
Unfortunately, most of them are either incomplete or wrong in one way or another. And some give up at one 
point or another and resort to math. This situation is a consequence of the general difficulty of explaining things 
physically in fluid mechanics, a problem we've touched on several times in the preceding chapters. 

In the real world, lifting flows are never precisely two dimensional, even when we try to make them so, as we 
do in so-called “2D” wind-tunnel testing. It seems like it should be possible to produce precisely 2D airfoil flow 
in a wind tunnel: Just mount a 2D airfoil model so that it spans the space between parallel tunnel sidewalls. But 
in reality 2D flow is practically impossible to achieve because of viscous effects on the tunnel sidewalls and in 
the junctions between the model and the sidewalls, and results of “2D” wind-tunnel testing are always question- 
able to some extent. On many flight vehicles, however, wings are of high enough aspect ratio that the local flow 
at stations over most of the span behaves at least qualitatively like the 2D ideal. Thus exploring the physics and 
doing some of our design work in the ideal 2D world makes sense. Even trying to simulate 2D flow in the wind 
tunnel can be useful in spite of the generally imperfect results. In this chapter, we'll concentrate on nominally 2D 
flow because it's simpler, and we can learn a lot that is generally applicable. 

In this chapter, we'll start with the easy part: the mathematical prediction of lift through solutions of equations 
of motion, and the closely related (and still mathematical) explanations of lift in terms of circulation and vorticity. 
Then we'll look at physical explanations of lift in 2D, starting with a discussion of the strengths and weaknesses 
of many of the explanations already in circulation. With these cautionary tales in mind, we'll try to develop our 
own “best we can do” physical explanation of lift in 2D. Then we'll discuss some of the major physics and design 
aspects of airfoils. In Chapter 8, we'll extend the discussion from 2D to 3D. 


7.1 Mathematical Prediction of Lift in 2D 


The earliest attempt that we know of to predict lift mathematically was Newton's theory for the lift of an inclined 
flat plate. Unfortunately, Newton got it wrong, but that is forgivable, given that his theory predates any rational 
theory of continuum fluid dynamics by more than a 100 years. Newton assumed a “bullet” model for the flow, 
as Shown in Figure 7.1.1, in which particles do not interact with each other before they strike the forward-facing 
surface of the body, and no particle strikes the aft-facing surface, a model whose shortcomings we've already 
discussed in Section 5.1. 


Figure 7.1.1 Newton's model for the lifting flow around an inclined flat plate (an application of the “bullet 
model” of Figure 5.1.1) 
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Assuming that the particles that strike the plate transfer the perpendicular component of their momentum 
to the plate and retain the parallel component, Newton predicted that lift is proportional to the square of the 
angle of attack, for small angles. The square relationship arises because the theory predicts that lift is propor- 
tional to both the number of particles intercepted per unit time and the momentum absorbed from each particle 
and that both increase with angle of attack. Later we'll see that in continuum lifting flows, we get a linear re- 
lationship (ideally) between lift and angle of attack, largely because the angle of attack has very little effect on 
the amount of fluid per unit time that the lifting surface effectively interacts with. So the square relationship 
is simply wrong, and for reasonable angles of attack, the magnitude of the lift predicted by Newton's model is 
much too small and was cited even into the late 1800s as a reason why heavier-than-air flying machines would 
never be possible. 

To get the right answer, we must model the continuum behavior of the fluid correctly. I've already claimed 
that a DNS solution of the full unsteady NS equations does this and should, in principle, predict any lifting flow 
with high accuracy. This claim is untested, however, because such a calculation has yet to be carried out for 
any Reynolds number typical of an aeronautical application. For routine calculations with current computers, 
we must scale our ambitions back to solutions of the Reynolds-averaged Navier-Stokes (RANS) equations with 
turbulence modeling (see Section 3.7). At this level of fidelity, we can predict lift with reasonable accuracy, 
provided that the airfoil or wing is nicely shaped, has a reasonably sharp trailing edge, and 1s at a low enough 
angle of attack that the flow stays attached to both surfaces at least to within a short distance of the trailing edge. 
Fortunately, most airfoils that are of practical interest meet these requirements, and the attached-flow regime 
covers much of the flight envelope of most flight vehicles, so that this predictive capability is highly useful. 

As the angle of attack is increased, however, and boundary-layer separation moves forward on one surface, 
the accuracy of RANS predictions generally deteriorates, primarily because turbulence models are less accurate 
for separated flows than they are for attached boundary layers. As the angle of attack increases further, the sep- 
arated region becomes larger yet, the real flow becomes increasingly unsteady, and most numerical schemes for 


the steady RANS equations will at some point fail to converge to a steady solution. Obtaining realistic solu- 
tions for flows with large separated-flow regions requires solving for at least the large-scale unsteady motions 
associated with the separation. It is not enough simply to solve the unsteady Reynolds-averaged Navier-Stokes 
(URANS) equations. The calculation procedure must somehow enforce a proper distinction between the large- 
scale motions that can be computed and the small-scale turbulence that must still be modeled. Large-eddy sim- 
ulation (LES) and detached-eddy simulation (DES) are approaches that are under exploratory development, and 
we'll look further at them in Chapter 10, but they are not yet in routine use. 

At the next step down in fidelity from steady RANS solutions with turbulence modeling, we find the various 
viscous-inviscid coupling schemes in which the flow is divided into an outer inviscid region and an inner bound- 
ary layer. With the right kinds of coupling and solution schemes, such methods can predict much of the same 
class of flows that can be predicted by RANS, including flows with small separated regions, and within the 
range where they can produce converged solutions, their accuracy can be comparable to that of RANS solu- 
tions. The widely used ISES and MSES codes (Drela and Giles, 1986; Drela, 1993) are examples of this class 
of methods. A disadvantage of coupled methods is that they tend not to handle geometric complexity as well as 
RANS, especially in 3D. 

Even viscous-inviscid coupling solutions require extensive computation and have been practical only since 
the early 1970s. Before that, flowfield prediction for airfoils was restricted to inviscid flow. The loss of fidelity 
incurred by the neglect of viscous effects depends on the type of airfoil and the flow conditions. At low Reyn- 
olds numbers, boundary layers are thick and usually have major effects on the pressure distribution. In the tran- 
sonic regime, the pressure distribution is extremely sensitive to the effective shape of the airfoil and is there- 
fore sensitive to the displacement effect of the boundary layer. But for airfoils with sharp trailing edges in the 
attached-flow regime, at moderate-to-high Reynolds numbers, and outside the transonic regime, predictions of 
lift and pressure distributions from inviscid-flow theory can be adequate for many purposes. In Section 7.4.1, 
we'll look at this issue further, using computational examples to assess the magnitude of viscous effects on lift 
and pressure distributions in incompressible flow at moderately high Reynolds numbers. 


The easiest way to obtain solutions for inviscid flow is through potential-flow theory, which for much of 
the twentieth century provided our only means for predicting the lift and pressure distributions of airfoils in 
2D flow. As simple and powerful as potential-flow theory 1s, however, applying it to lifting flows gives rise 
to a couple of interesting mathematical difficulties. The first difficulty is in representing a lifting flow at all in 
terms of a velocity potential in the manner defined by V = V 9 (see Section 3.10 for an introductory discus- 
sion of potential-flow theory). Two requirements must be satisfied. First, if a potential function is to represent 
a continuous velocity field, the potential must be continuous and have continuous first derivatives. Second, the 
Kutta-Joukowski theorem, which we'll discuss in detail in Section 7.2, tells us that we must have circulation 
around the airfoil if we are to have lift. These requirements conflict: A single potential function that is continu- 
ous throughout the domain surrounding the airfoil cannot represent a flow that has nonzero circulation. This 
follows from the definition of the velocity as the gradient of the potential. The line integral of the velocity on a 
contour from any point A to any other point B must be equal to the change in the value of the potential between 
the two points. If there is nonzero circulation on a closed contour from point A back to point A, then the po- 
tential must have two different values at point A. Thus if we are to be able to represent flows with nonzero 
circulation, we must relax the requirement for continuity of the potential by defining a branch cut from some 
point on the surface of the airfoil to infinity, as shown in Figure 7.1.2, across which a jump in the value of the 
potential, but not the first derivatives, can take place. Note that the jump in potential across the cut 1s equal to 
the circulation around the airfoil and must therefore be the same everywhere along the cut. Note also that for 
any given velocity field with circulation, 1t doesn't matter where we put the branch cut. The only requirement 1s 
that there be a cut. 


Figure 7.1.2 Illustration of a branch cut to allow a velocity potential to represent a flow with circulation 
around an airfoil 
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Even after we've introduced a branch cut to allow for flows with circulation, another difficulty remains. The 
potential jump across the branch cut is now a free parameter in the problem that our potential equation and 
boundary conditions are not sufficient to determine. A solution exists for any value of the potential jump and 
the circulation. Thus the lift, which is the main thing we seek to predict, is indeterminate. Figure 7.1.3 illustrates 
this problem, showing streamline patterns for the same airfoil shape with different values of circulation, all of 
which satisfy the potential equation and the boundary conditions. Kutta (Durand, 1967a) proposed a resolution 
to this indeterminacy, observing that the only physically reasonable solution is the one in which the flow leaves 
the trailing edge smoothly, as in Figure 7.1.3b. This criterion for choosing the preferred value of the circulation 
is known as the Kutta condition. Note that it resolves the non-uniqueness of the potential-flow solution only if 
the airfoil has a single sharp trailing edge. 


Figure 7.1.3 Sketches of potential-flow streamline patterns around the same airfoil shape with different values 
of circulation. (a) Zero circulation; zero lift. (b) Circulation such that flow leaves trailing edge smoothly; some 
lift. (c) More circulation; higher lift than (b) 
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Over the many years since Kutta introduced it, there has been much discussion of what the Kutta condition 
really means, physically speaking. Some critics have seen it as a mathematical artificiality and argued that the 
need for it demonstrates the physical inadequacy of potential-flow theory. The holders of this harsh view seem 
to be in a minority, however. The more prevalent view is that when a physical theory leaves out some physical 
effect, 1t is reasonable to adopt a rule or adjustment if it can be shown to effectively compensate for what was 
left out. In the case of potential-flow theory, what was left out is the combination of viscosity and the no-slip 
condition. It is often observed that viscous flow cannot generally negotiate a sharp corner, as in Figure 7.1.3a,c, 
without separating, and that the presence of viscosity is therefore the reason that the flow pattern of Figure 
7.1.3b is the only physically reasonable one. In this view, the Kutta condition 1s seen as a perfectly reasonable 
proxy for one of the major effects of viscosity; that is, that viscous flows tend to separate from sharp edges. 


The compensation that the Kutta condition provides is not perfect, but it is quite effective. It resolves the 
indeterminacy of the lift and enables potential-flow theory to predict something close to the right lift-versus- 
alpha curve and airfoil pressure distributions in the attached-flow regime, but it leaves the displacement effect 
of the boundary layer unaccounted for (see Section 4.1.3 for a discussion of the displacement effect in general, 
and Section 7.4.1 for examples of the effects of displacement on the lift curve and pressure distribution of an 
airfoil). 

Of course, an “edge” doesn't have to be perfectly sharp to provoke viscous-flow separation, and in the real 
world separation will still anchor itself near the trailing “edge” of an airfoil even if there 1s a considerable degree 


of rounding. Thus even if the trailing edge is rounded, something like the Kutta condition still applies, albeit 
with some uncertainty as to where it applies. 

So the Kutta condition is reasonably seen as accounting for the major effect of viscosity. A logical extension 
of this line of thinking 1s that lift would not exist without viscosity. It can be shown, based on starting the flow 
from rest, that in the absence of viscosity, the nonlifting flow pattern of Figure 7.1.3a is the one that would 
occur (see Gentry, 2006). There is some experimental support for this conclusion, though it is not extensive. 
Fluids without viscosity exist only in the form of superfluids such as liquid helium, and these have been pro- 
duced in the laboratory only in small volumes in which the usual kinds of aerodynamics experiments can't be 
done. There has been one experiment in which a tiny propeller was suspended on a slender fiber in a flow of 
superfluid liquid heltum (Donnelly, 1967). There was no detectable torsional deflection of the fiber, indicating 
that there was no lift on the propeller's blades. I don't know how definitive this result is. Given the small scale 
of the experiment, it's unlikely that the trailing edges were very sharp on the scale of the blade chord. 

A possible alternative mechanism for the Kutta condition in gas flows is compressibility. Compressible flow 
around a sufficiently sharp trailing edge would expand to a vacuum condition, with no plausible way for the 
flow to reach the stagnation point on the other surface. Thus even in the absence of viscosity, we might still see 
the Kutta condition obeyed through the effects of compressibility, a possibility we consider in more detail in 
Section 9.1.5. 

In the early years of our discipline, even solutions for incompressible potential flow around general airfoil 
shapes required too much computation to be practical. As we noted in Section 3.10, analytic conformal mapping 
provides a way to generate solutions for fairly “complicated” shapes with practically no computing. By ap- 
plying this method, Joukowski (Durand, 1967a) was able to define some reasonably practical-looking airfoil 
shapes and predict their pressure distributions. Another simplification, developed by Munk around 1920 (Dur- 
and, 1967a), is the linearized inviscid theory, applicable in the limit of small thickness and angle of attack. We'll 
consider the linear theory further in Section 7.4.1, where we find that it provides interesting insights into the 
effects of shape and angle of attack on an airfoil's pressure distribution. 


7.2 Lift in Terms of Circulation and Bound 
Vorticity 


A major relationship that can be derived from momentum conservation and kinematic considerations (circula- 
tion and vorticity theorems) is that lift is always accompanied by circulation and vorticity. In Section 7.1, we 
saw how this relationship must be accounted for in potential-flow solutions for 2D airfoil flows. It is also a key 
ingredient in quantitative theories of lift and induced drag in 3D, as we'll see in Sections 8.2 and 8.3. And it 
plays a role in a particular kind of explanation of lift in general, as we'll see below, though not the kind of phys- 
ical explanation we'll be seeking in Section 7.3. 

The most basic statement of this relationship is the Kutta-Joukowski theorem (Joukowski, 1906). As origin- 
ally derived, it applies to lift in 2D, but it also applies to stations along the span of a 3D wing in the limit of 
high aspect ratio. The theorem states that the lift per unit span on a 2D airfoil of any shape in a steady, inviscid, 
irrotational flow is proportional to the circulation T on a closed contour enclosing the airfoil: 
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where p and Uw are the density and velocity at infinity. The contour on which T is defined can be any closed 
contour enclosing the airfoil, because the flow everywhere outside the airfoil surface is assumed irrotational, 


and Stokes's theorem (Equation 3.3.4) then requires that [ must have the same value regardless of what closed 
contour is chosen. In a real, viscous flow, the theorem should still apply with high accuracy, provided the con- 
tour is outside the boundary layer and doesn't cut the viscous wake too close to the trailing edge. 

Now, given that lift requires circulation around the airfoil, Stokes's theorem also requires that any closed 
contour enclosing the airfoil must have a net spanwise vorticity flux passing through it, equal to the circulation. 
This vorticity is referred to as bound vorticity because it is associated with the airfoil itself and is not convected 
away from the airfoil by the flow. Consider four ways that the bound vorticity can be thought to reside with the 
airfoil, in decreasing order of realism: 

1. In a real, viscous flow with a physical boundary layer on the surface of the airfoil, the bound vorticity 
is the actual physical vorticity in the boundary layer, as illustrated in Figure 7.2.la. The bound vorticity 
is therefore distributed over some short distance off the surface into the flowfield. In a steady boundary- 
layer flow, vorticity is replaced by convection from upstream and diffusion from above or below as fast 
as it is convected away by the flow, so that it is as if the vorticity were stationary, and we can still think 
of it as bound vorticity (see Section 4.2.4 for a discussion of the vorticity “budget” of a boundary layer). 


2. In an inviscid flowfield like we assumed in the derivation of Equation 7.2.1, the bound vorticity cannot 
reside anywhere in the interior of the field because the flow is irrotational. The vorticity must therefore 
reside on the airfoil surface, as shown in Figure 7.2.1b. It takes the form of an ideal vortex sheet, which 1s 
a sheet of zero thickness in which the vorticity is infinite, but the vorticity flux (strength) per unit width 
of sheet is finite, a concept we discussed in Section 3.3.7. The strength of the sheet is equal locally to the 
magnitude of the potential-flow velocity at the surface, which is consistent with having the interior of the 
airfoil filled with fluid at rest and with the idea that a vortex sheet defines a velocity jump, as we saw in 
Section 3.3.8. 


3. In either of options (1) or (2) above, the vortex sheets on the upper and lower surfaces consist of vor- 
ticity of opposite signs, and their effects are partially offsetting. What really counts out in the field is the 
sum of the upper and lower surface contributions, or the net vorticity at each longitudinal station along 
the chord, which in some models of the flow is taken to reside in a vortex sheet on a camber line between 
the upper and lower surfaces, as illustrated in Figure 7.2.1c. This picture simplifies things considerably 
and is usually what is meant when the term “bound vorticity” is used. We'll find that thinking just of the 
net vorticity makes it easier to think about what happens in the flowfield around a 3D wing, which we'll 
take up in Section 8.1.2. The picture also applies naturally to an airfoil with zero thickness, in which case 
the vorticity strength is equal to the magnitude of the velocity difference (jump) between the upper- and 
lower-surface flows. 


4. In some highly simplified models that are used when only the farfield effect 1s important, the bound 
vorticity is represented by a single vortex, usually placed at the quarter chord, as shown in Figure 7.2.1d. 


Figure 7.2.1 Illustrations of the bound vorticity associated with a lifting airfoil, represented in different ways, 
in order of decreasing physical fidelity. (a) As the vorticity in the physical boundary layer of a real viscous 

flow. (b) As an ideal vortex sheet on the surface itself, in an inviscid flow. (c) As the net vorticity of the upper 
and lower surfaces, located on a camber line between the surfaces. (d) As a single vortex at the quarter-chord 
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Of course, the viscous wake downstream of the trailing edge in 2D flow also contains vorticity, but of op- 
posite signs in the upper and lower portions of the wake. At any longitudinal station along the wake, the net 
vorticity (integrated over the thickness of the wake) is thus a difference between contributions of opposite sign. 
The near wake of a lifting airfoil usually has some curvature and therefore a pressure difference and a velocity 
jump across it, both usually small. This means that the net vorticity convected past the trailing edge is small 
but not exactly zero. As the wake straightens out downstream, the net vorticity in it must go to zero. For this 
to happen, any net vorticity that was there in the near field must be destroyed by viscous diffusion between the 
vorticity of opposite signs in the upper and lower portions of the wake. Then practically no net vorticity is con- 
vected out of the near field by the wake. The global situation 1s as if all the vorticity resided close to the surface 
(the bound vorticity), and there were no other vorticity in the field, just as 1t was for the inviscid-flow case. 

The usual derivation of Equation 7.2.1 invokes momentum conservation applied to a control volume with 
its inner boundary at the airfoil surface and its outer boundary far enough away that the velocity field can be 
approximated by the combination of the uniform freestream and a single vortex of strength I’ located on or near 
the airfoil, representing the integrated bound vorticity on the airfoil surface. The original derivation assumed 
incompressible flow, but it is sufficient to assume subsonic flow and small disturbances 1n the farfield. 

So conservation of momentum in control-volume form, combined with Stokes' theorem, tells us that lift must 
be accompanied by circulation and bound vorticity. Note, however, that Equation 7.2.1 is just a relationship 
between the lift and the circulation, and that it doesn't predict what the lift or circulation will be for a particular 
airfoil shape and flow condition. To predict the lift, we must still solve for the flowfield to some level of fidelity, 
as discussed 1n Section 7.1. And, of course, Equation 7.2.1 doesn't explain what causes the circulation. 


7.2.1 The Classical Argument for the Origin of the Bound 
Vorticity 


An argument to explain the origin of the circulation and thus the existence of lift is given by Prandtl and Tietjens 
(1934) and by Batchelor (1967). It follows the airfoil through the process of a start from rest and relies more on 
observation of experimental flow visualizations and on circulation and vortex theorems than on direct physical 
reasoning. The argument can be paraphrased as follows: 


With the air initially at rest relative to the airfoil, the circulation around any closed contour enclosing the 
airfoil is zero. The air is then suddenly put into “uniform translatory motion” (motion that is uniform in 
the limit in the farfield). One of Kelvin's theorems states that when a nonviscous fluid is suddenly put into 
uniform motion, the circulation around any closed contour is unchanged, so that the circulation around the 
airfoil is still zero. Immediately after this sudden start, the velocity field looks like the steady potential-flow 
solution without circulation, as in Figure 7.1.3a. At this point, the same argument comes into play that was 
used to support the Kutta condition that determines the lift in steady potential flow: The flow around the 
sharp trailing edge has infinite velocity, and now, because air actually has a small amount of viscosity, the 
flow separates smoothly from the trailing edge. Prandtl and Tietjens state that as a result of the separation, 
a “surface of discontinuity,” or vortex sheet, begins to shed from the trailing edge. According to one of 
Helmholtz's theorems, this vorticity must be convected with the flow, and it will therefore be carried down- 
stream. This sheet rolls up into what is called the starting vortex. Prandtl and Tietjens don't explain why the 
separation from the trailing edge leads to the shedding of a vortex, but appeal instead to flow-visualization 
photos that show the vortex forming. The photos also show that the formation of the starting vortex 1s ac- 
companied by the formation of a circulatory flow around the airfoil in the direction opposite to that of the 
starting vortex. Prandtl and Tietjens point out that this is consistent with another one of Kelvin's theorems, 
which requires that the total circulation around the airfoil and the starting vortex continues to be zero, and 
that the individual circulations around them must therefore be equal and opposite. Finally, the flow around 
the airfoil settles into a steady state with nonzero circulation, and the existence of lift follows from the 
Kutta-Joukowski theorem. 


Note that this scenario is more of a description of the process by which circulation is established than it is 
an explanation. To the extent that it is an explanation, it is more logical and mathematical than physical, as all 
the crucial steps along the way are justified by mathematically derived theorems: Kelvin, Helmholtz, and Kutta- 
Joukowski. The theorems are correctly applied, so the logical inferences are correct. But they do amount more 
to logical inferences than to physical explanations. 

A further shortcoming of this argument as an explanation of lift in general is that it concentrates on a partic- 
ular time sequence by which lift can be established (an impulsive start from rest at a fixed angle of attack) and 
“explains” only one aspect of the final steady state (the circulation). One might wrongly infer from the Prandtl- 
and-Tietjens scenario that the final steady-state lift depends on the details of the initial time history. But the final 
steady state 1s nearly always a unique “stable attractor” that is the end point of any one of an infinity of time 
histories (impulsive starts, gradual starts, varying angle of attack, etc.). Thus the final steady state is in a sense 
more fundamental than any of the possible starting sequences, and a more satisfying explanation would deal 
directly with it and show us why it does what it does. And this would require establishing more details than just 
the circulation. 

The Prandtl-and-Tietjens scenario falls short of being a real physical explanation in other ways as well. First, 
it assumes the existence of the starting vortex and supports it only by appealing to experimental flow visualiz- 
ations. Just how or why the vortex forms is never explained. Then it strays into questionable cause and effect 
when it says that “Owing to the formation of the starting vortex, the velocity field is changed... ” This implies 
that vorticity 1s somehow a cause of the velocities that occur elsewhere, which reflects an incorrect interpreta- 
tion of the Biot-Savart law, as we saw in Section 3.3.9. Otherwise, the Prandtl-and-Tietjens scenario, as presen- 
ted, neglects to assign any cause-and-effect relationships or at least fails to make a distinction between logical 
inference and physical cause. 

The general order in which the argument is presented leads the reader to infer that lift is a result of the form- 
ation of the starting vortex. The overall logical inference is true, that if a starting vortex was shed, then there 
must be lift on the airfoil. But this particular inference works just as well in the opposite direction: If there 1s lift 
on the airfoil, then a starting vortex must have been shed. The general direction of physical cause and effect is 


such that it would be closer to the truth to say that the lift force 1s the prime mover, and the starting vortex is a 
result of it, not a cause. 

Batchelor's version of the explanation also refers to experimental flow visualizations, but it dwells in greater 
detail on the initial phase of the formation of the starting vortex, and it makes a more convincing argument that 
the shedding of vorticity is a necessary consequence of viscosity. Overall, however, it also implies that the shed- 
ding of the starting vortex somehow causes the establishment of the circulation on the airfoil. 

A close analogy to the relationship between lift and the starting vortex is the relationship between a bear 
walking through the woods after a fresh snowfall, and the paw prints he leaves behind in the snow. The pres- 
ence of the prints allows the logical inference that a bear has passed by since the snowfall. But this works in the 
opposite direction as well: Knowing that a bear has passed through soft snow allows the equally logical inferen- 
ce that prints must have been left. The logical inference can work either way, but the physical cause-and-effect 
relationship is clearly one-way: The passage of the bear caused the prints, but the prints did not cause the pas- 
sage of the bear. Like a paw print in the snow, the starting vortex is a mostly passive trace left behind by other 
physical events. It is not a cause of those events. 

In “The Origins of Lift,’ Gentry (2006) recounts essentially the same scenario as Prandtl and Tietjens, but he 
goes on to describe some key features of the airfoil flowfield, such as the upwash ahead of the airfoil, the higher 
velocity over the upper surface, and the downwash behind the airfoil, as being caused by the “circulatory flow.” 
Actually, these features are just parts of the circulatory flow pattern, and it isn't logical to say they are caused by 
it. Assigning causation to “circulatory flow” in this way is also closely related to the idea that vorticity at one 
location can “‘cause” velocities elsewhere, which is incorrect, as we saw in Section 3.3.9. 


7.3 Physical Explanations of Lift in 2D 


It's easy to explain how a rocket works, but explaining how a wing works takes a rocket scientist. 


—Philippe Spalart 


In this section, we seek to explain lift in 2D flows, in the context of continuum fluid mechanics, but without 
appealing to mathematics, both to further our own physical understanding and to have a satisfactory explanation 
that we can share with nonexperts. We've already alluded to what a difficult task this is, and because it is so 
difficult we'll end up devoting a lot of attention to it. Not only do we have to deal with the complexity of the 
physics, but we will also have to address an extensive collection of nontechnical lore that this topic has gener- 
ated over the years, most of which is deeply flawed. Because of this, we'll end up spending more time on the 
background than on the explanation itself. But all this effort is justified by the general importance of the issue. 
An explanation for what causes lift is probably the single most important thing most laymen want to know about 
aerodynamics. 

First, we'll look at some of the general characteristics of many of the explanations that are already in circu- 
lation (no pun intended) and discuss their strengths and weaknesses. With that as background, we'll set down 
our “requirements” for a more satisfying explanation, and then we'll proceed to compile our “best” explanation. 
We'll divide this into a basic part that can be shared with a nontechnical audience, followed by some additional 
technical details. 


7.3.1 Past Explanations and their Strengths and 
Weaknesses 


The purpose of this section is not to provide an exhaustive recounting of the explanations that have been offered 
in the past, but to look at some of the general lines of argument they share and to assess their strengths and 
weaknesses. 


7.3.1.1 A General Observation on the Nature of the Problem 


I propose that an underlying reason most of the explanations we'll discuss below fall short is that they set out to 
do more than is logically possible, given the nature of continuum fluid mechanics. In Section 3.5, we discussed 
how predicting what will happen in a fluid flow requires solving the equations of motion, something we can't 
generally do in our heads with sufficient precision to choose the “right” solution from the many kinematically 
possible flow patterns. Yet, in most of the proposed physical explanations of lift, there is an implied assumption 
that a simple linear argument starting from a few basic principles can both predict and explain the existence of 
lift and that no prior knowledge of the characteristics of the flowfield 1s required. We'll discuss this issue further 
when we set down our “desired attributes of a more satisfactory explanation” in Section 7.3.2. 


7.3.1.2 The Flowfield-First Fallacy 


A line of thinking that characterizes many of the physical explanations for lift that have been proposed is 
something I call the “flowfield-first” fallacy. It shows up most seriously in Bernoulli-based explanations, but it 
creeps into other explanations as well. The general line of argument is first to determine, by some argument or 
other, what the flow around the airfoil does and then to deduce that the flow exerts a lift force on the airfoil. 
There is no mention of whether the lift force influences what the flow does. It is not always explicit that the flow 
“causes” the force, but causation 1s implied by the way the inferences run. What is implied is one-way causation 
of the kind that I argued in Section 3.5 1s incompatible with continuum fluid mechanics, in which cause-and- 
effect relationships tend to be reciprocal. Any argument that claims to establish what a flowfield does, without 
reference to the forces that it exchanges with its environment, must be at least incomplete. Note that the math- 
ematical predictions of lift that we discussed in Section 7.1 don't share this failing because, in solving equations 
of motion for the continuum behavior of the fluid, they determine the flowfield and the force together, taking 
into account the proper circular cause-and-effect relationships. 

So the flowfield-first approach is faulty at the outset, in assuming that we can determine key features of the 
flowfield that will lead to a lift force, and that we can make the determination without knowing that the lift force 
is there. The Bernoulli-based explanations are the most serious offenders in propagating this fallacy, but they 
tend to have other faults as well, as we'll see. 


7.3.1.3 Bernoulli-Based Explanations 


The key flow feature that Bernoulli-based explanations appeal to is a region of high velocity that forms over the 
upper surface of the airfoil, which is then said to imply, or even cause, low pressure on the upper surface, as a 
consequence of Bernoulli's principle. Causation 1s not always explicitly stated, but it is implied. We'll see later 
that lower pressure and higher velocity over the upper surface are indeed necessary for lift. But the implication 
that the high velocity causes the low pressure amounts to the same kind of one-way causation that we just saw in 
connection with the flowfield-first fallacy, applied now at another level of detail in the flowfield, and it is wrong 


for the same reason. Again, one-way causation is simply not compatible with the physics. In this case, the high 
velocity and the low pressure are indeed related, but it is wrong to explain the causation as running only in one 
direction. And this error effectively dooms any explanation built along these lines. Because the low pressure 1s 
seen only as a result of the high velocity, and not as part of the cause, it is impossible to explain correctly how 
the high velocity got there in the first place. Most attempts to explain the high velocity without appealing to the 
pressure follow either of two main approaches, both unsatisfactory. 


7.3.1.4 Longer Path and Equal Transit Time 


This is an argument that is widespread 1n explanations aimed at the layman. In this approach, it is assumed that 
the upper surface of the airfoil is more convex than the lower surface, which is often true but not always, and 
that the path the air must follow around the upper surface is therefore longer than the path around the lower 
surface. It is further assumed that fluid parcels that are split apart at the leading edge to traverse the upper and 
lower surfaces must rejoin at the trailing edge as shown in Figure 7.3.1. Thus fluid parcels negotiating both 
paths must do so in equal transit times, and we conclude that the velocity over the upper surface must be higher 
than that over the lower surface. 


Figure 7.3.1 Fluid parcels splitting at the leading edge of an airfoil and rejoining at the trailing edge according 
to the erroneous equal-transit-time assumption 





This explanation is wrong for several reasons that we'll address next, but first we should note that relative 
path length doesn't work well as an indicator of how much lift an airfoil can produce. First, lift can be produced 
with zero difference in path length. For an airfoil consisting only of a camber line (no thickness), there is an 
ideal angle of attack at which the flow attaches smoothly to the leading edge, and the upper- and lower-surface 
flows see the same path length. For example, a camber line consisting of a segment of a circular arc produces 
lift at its ideal angle of attack of 0°. Then, even among airfoils that do have a difference in path length, the dif- 
ference tends to be no more than a small percentage. This is an order of magnitude too small for the path-length 
explanation to account for the lift that airfoils actually produce, as Craig (1997) has pointed out. Airfoils under 
real lifting conditions can produce much lower pressures and much higher velocities over the upper surface than 
any reasonable path-length difference can account for. So where does the path-length explanation go wrong? 

First, the assumption of equal transit time is wrong. There is no reason why fluid parcels that split at the 
leading edge must rejoin at the trailing edge. But to say more about relative transit times, we need to observe 
one careful distinction regarding what fluid parcels we are going to follow to measure the transit time. In real 
viscous flows, we assume that the no-slip condition holds, with zero velocity at the solid surface, so that transit 
times for parcels passing close to the surface approach infinity. Even in ideal inviscid flows there is a similar 
problem. In practically all 2D airfoil flows, the initial attachment of the flow is at a stagnation point at which the 
velocity 1s zero and away from which the velocity initially increases linearly, and again, transit times for parcels 
passing close to the surface approach infinity. This is an example of the infinite delay effect for blunt-nosed 
obstacles in general that we discussed in connection with Figure 5.1.3c. To get around this difficulty, we must 
measure the transit time for parcels that start their journeys at some arbitrary distance above or below the stag- 
nation streamline upstream, and in the case of viscous flow, the distance should be chosen large enough so that 
the parcels remain outside the boundary layers on the upper and lower surfaces. When we do this, we find that 


under lifting conditions, parcels that traverse the upper surface make the trip in less time and get to the trailing 
edge before the corresponding parcels that traverse the lower surface. Parcels that started close together near 
the attachment streamline ahead of the airfoil end up permanently displaced from each other after they pass the 
trailing edge, as shown in Figure 7.3.2. 


Figure 7.3.2 Fluid parcels splitting at the leading edge of an airfoil and ending up displaced from each other 
downstream in an ideal lifting flow that satisfies the Kutta condition 





The equal-transit-time explanation is problematic on another level as well, and that is that it is not a real 
physical explanation. Simply saying that something must go faster than something else to get somewhere at the 
same time does not explain how it goes faster, for example, by identifying the physical force that accelerates it 
to a higher velocity. 


7.3.1.5 Hump, Half-Venturi, or Streamtube Pinching 


As usually presented (see Anderson, 2008, Section 5.19, for example), this approach also assumes that the upper 
surface is more convex than the lower surface. The argument then assumes a general flow pattern along the 
lines of what we discussed in Section 5.1, what I called the “obstacle effect.” The upper surface acts as a kind 
of “hump” or larger obstacle to the flow than does the lower surface, with the result that the streamlines over 
the upper surface are pinched together more than those over the lower surface. Then, as a result of streamtube 
mass-flux conservation, the velocity is higher. As an alternative to the “hump,” an analogy is sometimes drawn 
between the upper surface of the airfoil and the inner surface of a Venturi tube, but this is essentially the same 
argument. 
This explanation is better than the longer-path-length explanation in that the velocity difference that it might 
account for is not so limited. But as usually presented, it has two major flaws: 
1. It doesn't really explain how streamtube pinching comes about at all, let alone why it is greater over the 
upper surface than over the lower surface. Streamtube pinching is not a kinematic necessity, as we saw 
in Section 5.1 in the discussion of Figure 5.1.4, so a dynamical explanation for it is needed. Anderson's 
(2008) version offers only that the flow somehow “senses the upper portion of the airfoil as an obstruc- 
tion” and pinches down to go around it. I assume this is not meant to say that a fluid flow actually has 
some kind of remote-sensing capability, just that in pinching down the flow behaves as if it were sensing 
the presence of the airfoil. Still, this leaves unexplained what physical principle is at work in the pinching- 
down response. Really explaining streamtube pinching requires getting into the details of the dynamics, 
as we did in our discussion of the obstacle effect in Section 5.1. 


2. Appealing to mass-flux conservation (continuity) isn't very satisfying as a physical explanation for 
higher velocity. Conservation of mass is a fundamental physical principle, but at the flowfield level it is 
really more of a kinematic constraint than a dynamical relation (see Section 3.4.1). Really understanding 
why something speeds up requires looking at the forces. 


Some versions of this explanation argue that the airfoil needn't be more convex on the upper surface and that a 
positive angle of attack is sufficient to cause the leading edge to act as a hump and produce high velocity over 
the upper surface near the leading edge. The explanation by Eastlake (2002) is in this category. This version of 
the hump argument has some appeal in that it provides a basis for the variation of lift with angle of attack, but it 


still has the major fault of not providing a satisfactory explanation for how the streamtube pinching and the high 
velocity happen. We can also counter this argument with the observation that it doesn't rule out the zero-lift flow 
pattern of Figure 7.1.3a, in which the trailing edge also acts as a hump, producing high velocity on the lower 
surface near the trailing edge. 


7.3.1.6 Confusion Regarding Low Pressures 


Confused thinking about what low pressure actually means, physically speaking, seems to be widespread, and 
the confusion isn't limited to explanations of airfoil lift. We all tend to think intuitively of lower-than-ambi- 
ent pressure as something that can exert a pull on surfaces that it touches. For example, Shevell (1989) uses a 
tornado as an example in his discussion of the low pressure in the core of a vortex, and states that “The low 
pressure is the force with which a tornado removes the roof from a house.” This of course can't be true in a 
literal mechanical sense. Even the lowest pressure reachable in the core of a tornado must still be positive in an 
absolute sense, and therefore cannot by itself be the force that lifts a roof. If a roof is lifted, it is lifted by the 
pressure beneath it. The low-pressure air above the roof plays its part by not pushing down as hard as it ordin- 
arily would. The low pressure doesn't directly provide the force that lifts the roof. 

The idea that low pressure can exert a pull finds its way into some airfoil explanations that discuss the low 
pressure on the upper surface. It is mostly explanations aimed at popular audiences that are guilty of this, and it 
is often seen 1n both the graphic illustrations and the words. 

In the graphics accompanying many explanations, the pressure distribution around the airfoil surface is plot- 
ted as an array of vectors (arrows) normal to the surface. The trouble is that the vectors' magnitudes are made 
proportional to the local pressure difference, relative to ambient, so that high pressure 1s shown as arrows point- 
ing toward the surface, and low pressure is shown as arrows pointing away from the surface, as seen in Fig- 
ure 7.3.3a. One could argue that this convention is perfectly fine for conveying quantitative information and 
that it shouldn't cause confusion for a technically literate audience, provided the basis is explicitly spelled out. 
However, the quantitative data could be conveyed more easily and clearly in a conventional Cartesian plot. 
In representing the pressure as arrows with directions, the intent is clearly not just to convey data, but also to 
provide a physical feel for how the pressure acts as a force. In this regard, an illustration like Figure 7.3.3a is 
misleading, at least to a nontechnical reader, because it gives the impression that the air in regions of low pres- 
sure 1s pulling on the surface. 


Figure 7.3.3 Airfoil pressure distributions represented graphically as vectors. (a) Arrows proportional to the 
pressure difference p — po. (b) Arrows proportional to the absolute pressure 





To provide the physical feel that Figure 7.33a seeks to provide, but to do it without misleading, is problem- 
atic. We could make the lengths of the arrows proportional to the absolute pressure, as in Figure 7.33b. In this 
case, we have had to assume the equivalent of a relatively high subsonic Mach number, so that the pressure 
differences are large enough to stand out relative to the magnitudes. And this is the only drawback to this type 
of presentation: It exaggerates pressure differences compared to what would actually occur, say, in a general- 
aviation application at low Mach numbers where the pressure differences would be too small to see clearly. 

The words that accompany an illustration like Figure 7.33a are often as misleading as the illustration. The 
word “suction” is often used to refer to the pressures that are lower than ambient. As a technical usage, there 1s 
nothing wrong with this, as long as everyone understands that it just means “lower pressure than ambient.” But 
for a nontechnical reader, it 1s potentially confusing because it tends to evoke an image of the air pulling on the 
surface, like the arrows in Figure 7.33a. 

Another idea that is often put forward 1s that the upper surface produces most of the lift, because the negative 
pressure differences on the upper surface, relative to ambient, are on average larger in magnitude than the pos- 
itive pressure differences on the lower surface. Of course, if we look at it in terms of the actual physical forces, 
it is more correct to say that the air pushing against the lower surface is responsible for all of the lift, and then 
some, and that the air over the upper surface helps out by not pushing down as hard as the lower-surface air 
pushes up. But the view in terms of differences from ambient also has some merit. After all, the absolute am- 
bient pressure would be there even if the airfoil were at rest, and it is the differences from ambient that result 
from the dynamics of the flow and are responsible for the lift. And it is true that the negative differences on the 
upper surface are generally larger than the positive differences on the lower surface. 

It is often implied that this is a general characteristic of the production of lift, which is only partly true. Ac- 
cording to the linearized theory we'll discuss in Section 7.4.1, lift results from the angle of attack and camber 
shape only, and airfoil thickness makes no contribution, to first order. For a cambered-plate airfoil (zero thick- 
ness) in the linear limit, lift is characterized by equal-and-opposite differences on the upper and lower surfaces. 
So for a very thin airfoil at a low lift level, the pressure differences associated with the lift are almost equal- 
and-opposite. But in most practical situations, both thickness and non-linear effects tend to reduce the pressure 
over most of the chord on both surfaces of the airfoil, so that the negative differences on the upper surface are 
actually larger than the positive differences on the lower surface. 


7.3.1.7 Momentum-Based Explanations and the Coanda Effect 


In momentum-based explanations, it is generally argued that the airfoil produces a flowfield in which some of 
the air is “deflected” downward and thus has downward momentum imparted to it. To acquire downward mo- 
mentum, the air must have a downward force exerted on it by the airfoil, and thus, by Newton's third law, the 
airfoil must have an upward force exerted on it by the air. At the highest level, this general approach avoids the 
flowfield-first fallacy that mars most of the Bernoulli-based explanations. At least the mutuality of the force ex- 
change between the airfoil and the air is explicitly acknowledged. But most explanations of this type fall short 
of providing a complete explanation of how the airfoil accomplishes the downward deflection of the stream. 
Some momentum-based explanations emphasize that it is not just the lower surface of the airfoil that deflects 

the flow, and that the flow pattern over the upper surface also contributes strongly to the overall downward de- 
flection. This general assertion is correct, but it is often followed by incorrect reasoning as to how the upper-sur- 
face flow does what it does. For example, Anderson and Eberhardt (2001) and Craig (1997) invoke the Coanda 
effect as the reason that the flow is able to follow the curved upper surface. This is problematic on more than 
one level: 

1. Applying the term “Coanda effect” to an airfoil flow 1s inaccurate and therefore confusing. The Coanda 

effect usually refers to the tendency of a powered jet flow (in which the jet has higher total pressure than 


the surrounding fluid) to attach to an adjacent solid surface and to follow the contour of the surface. Al- 
though the attached boundary layer on the surface of an airfoil is a shear layer, it is not the same as a 
powered jet, and is not usually considered an example of the Coanda effect. As we'll see below, there is a 
limited way in which something like the Coanda effect can be construed as playing a role in airfoil flows, 
but it is a bit of a stretch. 


2. The Coanda effect is erroneously seen as implying that viscosity plays a direct role in the ability of a 
flow to follow a curved surface. Anderson and Eberhardt assert that viscous forces in the boundary layer 
tend to make the flow turn toward the surface, specifically, as they put it, that the “differences in speed 
in adjacent layers cause shear forces, which cause the flow of the fluid to want to bend in the direction 
of the slower layer.” Actually, there is no basis in the physics for any direct relationship between shear 
forces and the tendency of the flow to follow a curved path. 


First, we'll look at what the term “Coanda effect” properly encompasses, and then we'll consider what “flow 
attachment” or “boundary-layer attachment” in ordinary aerodynamic flows really involves and how it differs 
from the Coanda effect. 

The phenomenon that Coanda himself investigated, for which others later coined the term “Coanda effect,” 
was limited to powered jet flows in which the jet is of the same phase (gas or liquid) as the surrounding fluid 
and has higher total pressure. A relatively thin 2D or annular jet (a jet in the form of a sheet) has a tendency 
to bend and attach itself to an adjacent solid surface, and to follow the surface even if the surface has strong 
convex streamwise curvature. This tendency is a result of the jet's entrainment of surrounding fluid and of the 
requirements imposed on the flow pattern by the continuity equation. 

The effect is illustrated in Figure 7.3.4. In (a), we see an isolated 2D jet flowing into otherwise quiescent 
air. Whether the flow issuing from the nozzle is turbulent or not, at high Reynolds numbers the jet downstream 
will be turbulent, and the important thing for our purposes is that a turbulent jet strongly entrains fluid from the 
surroundings as it spreads downstream. Outside of the turbulent jet itself, the fluid flows toward the jet, thus 
feeding the entrainment. The velocities required to feed the entrainment are not large, but they are important, as 
we'll see. In (b), we've introduced a solid surface adjacent to the jet, with the leading edge of the surface close to 
the edge of the jet, but with the rest of the surface curving increasingly away from the jet. Because we've placed 
the surface in a region that in the case of the isolated jet was nearly quiescent air, we might naively expect it 
not to have much effect on the high-velocity flow in the jet, and for the jet to continue to flow straight, as it 
did in (a). But blockage of the flow feeding the entrainment makes it impossible for the flow pattern of (b) to 
be sustained. The air between the jet and the surface is entrained faster than it is replaced from the surround- 
ings, and the flow quickly switches to the pattern in (c), where the jet bends to flow along the curved surface. 
The pressure field simultaneously adjusts (Remember circular cause and effect between velocity and pressure!) 
so that a pressure gradient normal to the local flow direction balances the centrifugal force associated with the 
curvature of the flow. So we see that even in a jet flow that exhibits the Coanda effect, the curvature of the flow 
is not a direct result of viscous forces (or their turbulent counterparts), but an indirect one. 


Figure 7.3.4 Illustration of the Coanda effect. (a) Isolated free jet in otherwise quiescent air. (b) Curved sur- 
face added adjacent to the jet, but not altering the flow. (c) Jet attached to the curved surface 





(c) 


In ordinary aerodynamic flows without powered jets, flow attachment has very little in common with the 
powered-jet effect that Coanda investigated and is really just the absence of boundary-layer separation. In Sec- 
tion 4.1.4, we saw that boundary-layer separation from a smooth surface in a 2D flow generally requires rising 
pressure (an adverse pressure gradient) to stagnate the low-velocity fluid. Counteracting the effect of an ad- 


verse pressure gradient is the favorable viscous force by which the higher velocity fluid farther from the surface 
drags the low-velocity fluid along. Boundary-layer separation thus involves a tug-of-war between the adverse 
pressure gradient and an opposing viscous force. At any given station along a surface subjected to an adverse 
pressure gradient, the pressure gradient will generally be winning the tug-of war locally, slowing the fluid near 
the wall, and reducing the velocity slope at the wall. How far the boundary layer perseveres into the adverse 
pressure gradient before it separates depends on the rate at which the pressure gradient wins and the velocity 
slope at the wall decreases. Until the slope of the velocity profile at the wall is brought to zero, the boundary 
layer remains attached, just like the corresponding inviscid flow would under the same conditions. Separation 
occurs only when the adverse pressure gradient has acted over a long enough distance to produce reversal of 
the velocity profile. Thus the role of viscous forces in maintaining boundary-layer attachment is to reduce the 
deceleration caused by the pressure gradient and to help the low-velocity fluid at the bottom of the boundary 
layer keep moving, and the viscous forces are needed only in situations where the pressure gradient is adverse. 
Viscous forces have nothing direct to do with causing the flow to turn and follow a curved surface. 
There are two things that very likely have contributed to confusion regarding this issue: 
1. The boundary layer on the surface of the airfoil is effectively a sheet of vorticity. In Section 3.3.5, we 
saw that one of the kinematic properties of vorticity is that fluid parcels have a solid-body-rotation com- 
ponent to their motion, with angular velocity w/2. If we picture an airfoil oriented so that the flow is from 
left to right, fluid parcels in the upper-surface boundary layer will have a solid-body-rotation component 
to their motion, in the clockwise direction. It seems intuitively natural that fluid parcels that are rotating 
will tend to follow curved paths (curved downward in this case). This is a pre-Newtonian sort of intu- 
ition, however, that has no basis in the physics (the meaning of “pre-Newtonian” is discussed at the end 
of Chapter 2). There is no connection between fluid-parcel rotation and curved paths in the flow. 


2. There 1s an indirect association between surface curvature and the need for viscous effects in the main- 
tenance of flow attachment. Convex surface curvature is often, though not always, associated with an 
adverse pressure gradient, in which case favorable viscous forces are needed to prevent separation. But 
the viscous forces prevent separation by dragging fluid along in the direction of the local flow, not by 
directly contributing to the turning of the flow. 


If viscous forces make no direct contribution to the turning of the flow when the surface is curved, what 
actually causes the flow to turn? The answer to this question lies in the interplay between the velocity field 
and the pressure field, which works in the same way whether the fluid is viscous or not. When a flow turns to 
follow a curved surface, it is able to do so because the pressure field adjusts so as to provide the force needed 
to accelerate the fluid toward the center of curvature. Thus the centrifugal force generated when the flow fol- 
lows a curved path is countered by a pressure gradient perpendicular, or normal, to the local flow direction. The 
normal pressure gradient and the flow curvature have a reciprocal relationship in which they cause and support 
each other simultaneously. This is just another aspect of the circular cause-and-effect relationship between the 
pressure and velocity fields that we discussed in Section 3.5 and that will figure heavily in the physical explan- 
ation for lift that we'll develop in Section 7.3.3. Note that the normal pressure gradient is perpendicular to the 
streamwise pressure gradient that we considered earlier and that it 1s the streamwise pressure gradient that plays 
the important role in determining whether the viscous boundary layer separates or remains attached. 

So viscous forces play no direct role in ordinary flow attachment, contrary to Anderson and Eberhardt's ex- 
planation. A good counterexample to the Anderson and Eberhardt argument is the flow around a rotating cir- 
cular cylinder with the freestream perpendicular to the cylinder axis. This is an example of a flow in which the 
tangential motion of the surface, due to rotation, affects the location of separation. In this case, the flow follows 
the curved surface farther around the side of the cylinder where the surface is moving with the flow, and it sep- 
arates earlier from the side on which the surface 1s moving against the flow. But if we apply the Anderson and 
Eberhardt argument to this flow, it predicts the opposite of what is observed. On the side where the surface is 


moving with the flow, the viscous stresses are reduced or even reversed, and the ability of the flow to follow 
the curved surface would be reduced, according to their argument, but in fact it is enhanced. And vice versa for 
the other side of the cylinder. The observed effects on both sides are consistent with ordinary boundary-layer 
theory, which correctly accounts for the effects of pressure gradient and surface motion. 

We've looked in some detail at what the Coanda effect actually is, as it is usually understood to apply to a 
“powered” jet flow, and why it is not needed for ordinary flow attachment. But might there still be a way that it 
can properly be said to apply to airfoil flows? Perhaps, but it is at most a very limited way. Figure 7.3.5 shows 
two candidate flow patterns around an airfoil at a moderate angle of attack. 


Figure 7.3.5 Candidate flow patterns around an airfoil at a moderate angle of attack. (a) Separation ahead of 
the trailing edge. (b) Attached flow to the trailing edge 
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In Figure 7.3.5a, the flow separates ahead of the trailing edge and doesn't reattach to the surface, and in Fig- 
ure 7.3.5b, the flow remains attached all the way to the trailing edge. At a low enough angle of attack, where the 
attached-flow pattern of (b) is the correct one, we could argue that what rules out the alternative separated-flow 
pattern of (a) is the same entrainment mechanism that is responsible for the Coanda effect. The separated shear 
layer of (a) entrains fluid faster than it can be replaced from downstream of the trailing edge. So for airfoils 
with attached flow to the trailing edge, we could say that it 1s something like the Coanda effect that rules out 
a separated-flow pattern. But this is not the same as saying that the Coanda effect is needed for the attached- 
flow pattern to be possible. The fact is that we needn't appeal to any viscous-flow mechanism, boundary layer, 
Coanda, or otherwise, to explain how flows can follow curved surfaces. Even ideal inviscid “flows” represen- 
ted by solutions to the potential equation have no trouble following curved surfaces. And in real viscous flows, 
the natural tendency of the boundary layer is to remain attached unless it is provoked to separate by an adverse 
pressure gradient that is too strong, as we saw in Section 4.1.4. 


7.3.1.8 The Water-Faucet Demonstration 


In some explanations that refer to the Coanda effect (Anderson and Eberhardt, 2001, for example), a “simple 
experiment” is cited as a demonstration. If a curved object, like a spoon or a water glass, is held tangent to a 
small vertical stream of water from a faucet, the stream will deviate from its original vertical path and follow 
the curved surface, as shown in Figure 7.3.6a. This effect is obviously not a result of the same turbulent-jet en- 
trainment that is responsible for the regular Coanda effect. Instead, it seems to be due to molecular attraction 
between the liquid and the solid and to the fact that the stream resists being torn apart, because of the surface 
tension at the water-air interface. These forces should be independent of the speed of the flow. Thus the amount 
of deflection they can produce should decrease as the speed of the stream increases, and, indeed, this seems to 
be the case. You can demonstrate to yourself that if the stream is sufficiently slow, the entire stream will follow 
the curvature of the glass, but if the stream is too fast, it will break up, and much of it will leave the surface of 
the glass without being deflected much, as shown in Figure 7.3.6b. In any case, such water-stream demonstra- 
tions are not relevant to explaining how aerodynamic flows follow convex surfaces. 





Figure 7.3.6 Water-faucet experiment that is sometimes proposed as a demonstration of the “Coanda effect” 
but actually demonstrates molecular attraction and surface tension (Photos by the author). (a) The slowest co- 
herent stream that an ordinary garden nozzle can produce enters vertically downward on the left, adheres to the 
surface of a drinking glass, and is deflected through more than 90°. (b) A somewhat faster stream from same 


nozzle breaks up and is not deflected nearly as much 
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7.3.1.9 A Momentum-Based Argument in Which Flow Turning Comes 
First 


A variation on the momentum-based argument is put forward by Weltner and Ingelman-Sundberg (2000) on 
their web site. It is noteworthy because it explicitly rejects the one-way causation from velocity to pressure that 
is common in Bernoulli-based explanations, but then implies one-way causation in the other direction. Their 
argument goes as follows (paraphrased): 


It is wrong to argue that the high flow speed over the upper surface of an airfoil “causes” the low pressure 
there because the pressure difference whose existence we're trying to justify must have been there in the 
first place to accelerate the flow to higher speed. So where does the pressure difference come from? It 
arises because the airfoil deflects the flow, or causes it to change direction. So the change in flow direction 
causes the reduction in pressure, which in turn causes, or at least implies, the increase in flow speed. 


This argument claims, in effect, that it is not correct to invoke a longitudinal acceleration (a change in speed) 
as the sole reason for a pressure change, but in the case of an airfoil flow, it is permissible to invoke the normal 
acceleration (a change in flow direction). The implied justification is that the primary effect of the airfoil surface 
is to force the flow to change direction and that it is therefore logical for the normal acceleration to precede the 
pressure change in the chain of cause and effect. 

This idea has considerable intuitive appeal, but it is not entirely correct. The problem is that the interaction 
of most of the flow with the solid surface is not as direct as this argument implies. Only one vanishingly thin 
streamtube (the stagnation streamline) actually comes into contact with the airfoil surface, and the normal ac- 
celerations of all other streamtubes happen out in the field, just like the longitudinal accelerations do. For most 
fluid parcels, there is no direct interaction with the airfoil surface, only with adjacent parcels, and in this situ- 
ation, there is no basis in the physics for making a distinction between the normal and longitudinal components 


of the acceleration. They are both just accelerations, and neither one has a one-way causal link to the pressure. 
The original argument correctly states that a change in flow speed requires a pressure difference. We can say 
the same thing about a change in flow direction: The only thing that can cause a change in the velocity vector 
is a pressure gradient. Thus for the normal acceleration to happen, the normal pressure gradient must already 
be there. And then if we incorrectly limit ourselves to one-way causation, we leave unanswered the question 
of what causes the pressure gradient. A correct explanation must acknowledge circular causation between the 
pressure and velocity fields. 


7.3.1.10 The Bernoulli-versus-Momentum “Controversy” 


Despite their flaws, both the Bernoulli approach and the momentum approach manage to get at parts of the truth. 
As we saw in Section 5.4, it is the pressure that transmits the lift force to any lifting body. And lift is invariably 
accompanied by low pressure on the upper surface and must therefore also be accompanied by elevated velocity 
outside the boundary layer, consistent with the Bernoulli explanation. Generating lift also requires deflecting 
the flow downward and imparting downward momentum, as in the momentum explanation. The Bernoulli and 
momentum explanations just appeal to different aspects of the same global phenomenon. The Bernoulli explan- 
ation is based on the near-surface flow and the surface pressure, while the momentum explanation is based on 
a manifestation of lift that extends into the far-field. They are not contradictory. However, having two different 
explanations that both seem “right” has led to some heated arguments, especially on some web sites that offer 
explanations of lift. 

For example, there has been a tendency for advocates of either the Bernoulli approach or the momentum 
approach to argue that one is correct and the other is wrong, leading to a Bernoulli-versus-momentum (or 
Bernoulli-versus-Newton) “controversy,” as seen, for example, in much of the discussion accompanying the 
Wikipedia article on “Aerodynamic lift.” An alternative put forward by some other commentators is that both 
approaches are correct, but that they represent two separate mechanisms and two separate types of lift. The ex- 
planation on the ALSTAR instructional web site (Florida International University) 1s in this camp. According 
to this view, a flat plate at an angle of attack produces only “reaction lift,” and an airfoil with camber produces 
“Bernoulli lift” at zero angle of attack and adds “reaction lift” as angle of attack increases. The Bernoulli-versus- 
momentum controversy and the Bernoulli-lift/reaction-lift distinction are false, of course. Lift is a phenomenon 
that always involves a seamless combination of pressure difference and momentum transfer. 

Another argument that is often made, as in several successive versions of the Wikipedia article “Aerodynam- 
ic Lift,” is that lift can always be explained either in terms of pressure or in terms of momentum and that the two 
explanations are somehow “equivalent.” This “either/or” approach also misses the mark. It's true that lift can be 
accounted for quantitatively either by the integrated pressure difference or by the momentum transfer. But such 
accounting doesn't constitute a satisfying physical explanation for how it all happens. A complete explanation 
must address all of the necessary aspects of the flowfield, and neither the Bernoulli nor the momentum approach 
is complete in this sense. As we'll see in Sections 7.3.3 and 7.3.4, a complete explanation must address both the 
velocity magnitude (Bernoulli) and direction (downward turning) because the flow we're trying to explain is not 
one-dimensional, but two-dimensional. 


7.3.1.11 An Explanation Based on Flow Curvature 


Babinsky (2003) begins by explaining the relationship between streamline curvature and the pressure gradient 
in the cross-stream (normal) direction, and he does so without any incorrect implication that the cause-and-ef- 
fect relationship is one way. He then presents graphic illustrations of streamline patterns in flows around air- 
foils, showing that under lifting conditions the streamlines above and below the airfoil are generally curved 


downward. The pressure gradient (low pressure to high) is thus upward, and given that the pressure far above 
and below the airfoil must be close to ambient, the pressure on the airfoil upper surface must be low, and the 
pressure on the lower surface must be high, so that there is lift. Babinsky further shows that if the airfoil is thick, 
the high pressure on the lower surface may not materialize, but the pressure difference between the upper and 
lower surfaces can still provide lift. This explanation is correct as far as it goes, but it is incomplete in that it 
doesn't explain how the pressure gradients in the streamwise direction are sustained. 


7.3.1.12 Lanchester's Explanation 


An interesting early explanation was put forward by Lanchester (1907) and is paraphrased in the historical 
sketch by Giacomelli and Pistolesi, in Durand (1967). Lanchester first imagines a horizontal 2D flat plate (finite 
chord, infinite span) moving vertically downward, in effect a 2D planar parachute. There is, of course, a lift 
force on the plate, with low pressure above the plate and high pressure below, and air both above and below the 
plate is dragged downward. Because there can be “no permanent change of density or accumulation of matter 
in the lower strata of the atmosphere,” the downward current above and below the plate must be accompanied 
by upward currents around the edges of the plate, driven by the difference in pressure. This is the same general 
flow pattern that we discussed in Section 5.1 in connection with the “obstacle effect.’’ Lanchester imagined this 
motion to be associated with “a field of force established around the plane when the load was first applied: a 
field of force everywhere defined by the acceleration of the air particles.” 

Lanchester then considers adding a horizontal motion to the vertical motion, so that the plate becomes a 2D 
“slider” descending along a sloped path. He deduces the resulting velocity field not by superposition of velo- 
cities, but by arguing that fluid particles passing through the same “field of force” that was there in the case of 
pure vertical motion 


“will receive an upward acceleration as they approach the aerofoil, and will have an upward velocity as 
they encounter its leading edge. While passing instead under or over the aerofoil, the field of force is in 
the opposite direction, viz. [that 1s,] downward, and thus the upward motion is converted into a downward 
motion. Then, after the passage of the aerofoil, the air is again in an upwardly directed field, and the down- 
ward velocity imparted by the aerofoil is absorbed.” 


Note that although the velocity field with only vertical motion was symmetrical fore and aft, with upwash 
off both the leading and trailing edges, the field in the presence of forward motion is asymmetric, with down- 
wash behind the trailing edge rather than upwash. Lanchester deduced from this flowfield picture that a slightly 
cambered airfoil, with the trailing edge turned down to enhance the downwash and the leading turned down to 
meet the oncoming upwash, should be superior to an airfoil with zero camber. Lanchester concludes that the 
vertical velocity of the air particles far ahead of and far behind the airfoil is zero, by the same argument as 
before, that a nonzero velocity would result in an accumulation of matter. He also concludes that there is no 
“continual transmission of energy to the air’ and that the drag associated with such a 2D lifting motion 1s zero, 
ignoring skin friction. 

Lanchester's explanation 1s intriguing for several reasons. The first is that 1t was developed so early in the his- 
tory of the field, before the Kutta-Joukowski theorem was known and before potential-flow solutions for flows 
around airfoils had been derived. Further, and to its credit, Lanchester's explanation acknowledges a mutual re- 
lationship, rather than one-way cause and effect, between the velocity and the pressure (the “field of force’), 
and it provides an essentially correct account of some key features of 2D lifting flow: upwash ahead of the air- 
foil, and downwash behind, decaying to zero in the farfield. It does not explicitly address how lift varies with 
angle of attack, but it could easily be extended to do that. One weakness is that the 2D-parachute flowfield is 
not a very good model for the flow in the presence of forward motion. Of course, a steady parachute-type flow, 


with a massive separated wake above the plate, would be a terrible model. Lanchester avoided the problem of a 
separated wake by stipulating that we take the “field of force” to be that which existed at the moment when the 
load was first applied (the pressure field immediately after an impulsive start). Even so, this leads to a pressure 
field that is wrong in one important detail: The pressure field for a flat plate in vertical motion alone is sym- 
metrical fore and aft, while the pressure field associated with forward motion at an angle of attack has a marked 
fore-and-aft asymmetry, as we'll see later. So Lanchester's explanation avoids some of the shortcomings of the 
other explanations that we've seen, but his flow model is inaccurate in some details. 


7.3.1.13 An Unusual Argument for Downward Turning in the Nearfield 


Hoffren (2001) argues that the streamline coming off the trailing edge of an airfoil at an angle of attack is dir- 
ected downward and asymptotically levels off at a level below that of the trailing edge. Then, using essentially 
the same “no accumulation” argument that Lanchester (1907) used, he argues that the stagnation streamline ap- 
proaching the leading edge must have started at the same low level. This means that the flow generally rises 
ahead of the airfoil and descends behind it and must therefore experience downward turning in the neighbor- 
hood of the airfoil. This kind of argument establishes that if the flow leaves the airfoil at a downward angle, 
there is a kinematic necessity for downward turning. But it doesn't establish a physical mechanism for the down- 
ward turning. 


7.3.1.14 Two Other Recent Explanations 


Craig (1997), in Stop Abusing Bernoulli—How Airplanes Really Fly, a book that has been popular in the air- 
plane home-builder community, proposes an interesting wrinkle on the conventional momentum-based explana- 
tion, but unfortunately ends up generating more confusion than enlightenment. He draws attention to the upwash 
ahead of the leading edge of an airfoil and outboard of the wingtips of a finite wing, invoking the same reason 
for it that Lanchester did, that is, that to avoid an accumulation of matter, the downwash above and below the 
lifting surface must be accompanied by upwash elsewhere. For this upwash, he coins the term “recirculation,” 
apparently to reflect the idea that the air involved in the upwash motion was previously part of the downwash. 
But this idea correctly reflects the actual flowfield only far downstream of a wing of finite span, where the flow 
actually circulates around each member of a pair of trailing vortices. (We'll discuss the trailing vortex wake of a 
finite wing in some detail in Section 8.1.2.) 

While the air immediately around the airfoil is part of a flow with circulation, as we saw in Section 7.2, none 
of 1t remains in the vicinity of the airfoil long enough to “recirculate” in any reasonable sense of the word, so 
the terminology is confusing at best. Furthering the confusion, Craig's Figure 2.5 shows a perspective view of 
a wing with arrows indicating that air from below the lower surface circulates forward, then upward in front of 
the leading edge, and then back, ending up above the upper surface. This picture might be correct if we isolated 
the circulatory part of the velocity field and regarded the arrows as streamlines of that part of the field, but Craig 
doesn't stipulate that that is what his Figure 2.5 represents. 

If we interpret the arrows as particle paths, the picture is simply wrong. (Ordinarily in steady flows, particle 
paths and streamlines coincide, but not in this case, because the circulatory streamlines were constructed based 
on only a part of the velocity field.) In any case, the fact is that the air involved in the upwash ahead of the lead- 
ing edge (and directly outboard of the tips of a finite wing) has not yet been involved in any of the downwash. 
Nevertheless, Craig refers to the upwash ahead of the airfoil as “recirculation” and assigns to it a major role in 
the production of lift. The accompanying discussion of transfers of kinetic energy from downwash to upwash 
and of lift being a “regenerative” process is confusing, and it is difficult to assess its physical correctness or lack 
of it. 


A very clear description of the flowfield around an airfoil is given in See How It Flies, a web site for pilots, 
produced by a physicist (Denker, 1996). Many of the details of the flowfield and the pressure field around an 
airfoil are described and illustrated through excellent diagrams and animations. Cause-and-effect relationships 
at the local level, such as the relationship between pressure differences and flow curvature, and pressure dif- 
ferences and flow speed are described. At the global level, circulation and the Kutta-Joukowski theorem are 
discussed. Denker's presentation provides the reader with a wealth of information and physical interpretation, 
all to a high technical standard. Its only shortcoming, to my mind, is that it doesn't quite manage to show how 
all the pieces fit together, or to establish a complete web of cause-and-effect relationships at the global level. 


7.3.2 Desired Attributes of a More Satisfactory Explanation 


Every explanation we've looked at so far has been flawed in one way or another. We would of course like our 
explanation to avoid such flaws. But beyond that, what other attributes would we like it to have? Remember 
that we seek a physical explanation that does not require mathematics. In this regard, observing that we can pre- 
dict lift by solving the NS equations is not a satisfactory explanation. Some, for example, Shevell (1989), have 
concluded that circulation and the Kutta-Joukowski theorem provide the only satisfactory explanation, but this 
is really just math as well. Besides, even if we were to accept the Kutta-Joukowski theorem as part of our ex- 
planation, it would just shift our problem from that of explaining where the lift comes from to that of explaining 
where the circulation comes from. 

We acknowledge at the outset that it 1s not possible, without math, to predict the existence of lift, without 
knowing a priori some things about the flow that produces it. Providing ironclad “proof” that lift must exist is 
too much to expect, because that would require quantitative precision that is not available without computation 
or at least a simplified quantitative theory. Our objective is therefore to explain, not to predict or prove. Even 
though we've thus limited our objective, we still want our explanation to be as complete as possible, consistent 
with the correct physics. By “complete,” we mean the following: 

1. Though it is fair game to assume a priori some knowledge of what the flow does, we should explain in 
a satisfying way how the flow does it. 


2. Our explanation should be based on a logical application of physical principles and leave no gaps. 


3. Not only should the logical inferences be correct, but the direct physical cause-and-effect relationships 
should be made clear. 


4. The explanation should make clear not just how lift is produced, but also how the magnitude of the lift 
can be “controlled,” that is, how it varies with angle of attack and airspeed. 


Although we have strong justification for relying on prior knowledge of what the flow does, we still tend to 
think of it as “cheating” somehow. We therefore have an inclination to try to define the minimum prior know- 
ledge that needs to be assumed, and to try to deduce everything else from first principles. There is a strong 
appeal to the idea of an explanation that is maximally “self-sufficient” in this sense. But would such self-suffi- 
ciency really serve a pedagogical purpose? I think not. It seems to me that it would be better to assume the level 
of prior knowledge that makes for the clearest explanation, and that there 1s no disadvantage in assuming more 
than the minimum, provided it 1s all explained in the end. 

So completeness and clarity are high on our list of objectives. There are also some faults we want to avoid: 

1. Misrepresenting the phenomenon as being simpler than it really is. Given the subtlety of the phenomen- 
on we're trying the explain, oversimplification is a great temptation, and it is a key weakness in many of 
the explanations we discussed in the previous section. Von Karman is reported to have said that in ex- 


plaining things to a nontechnical audience, a Plausible Falsehood 1s preferable to the Difficult Truth (see 
Sears, 1994). I don't share this pessimistic view. 


2. One-way causation of the kind we know is inconsistent with the physics, for example, “Lift 1s due to 
circulation” or “The pressure difference is caused by a velocity difference.” 


3. Assuming something that isn't subsequently explained, for example, “Circulation means there is lift,” 
but not explaining where the circulation comes from. 


4. Dependence on things that we know are not needed, for example, a sharp trailing edge or a curved 
upper surface. 


5. “Naming” as a substitute for explaining, as, for example, in saying that a jet flow follows a curved 
surface because of the Coanda effect, where “Coanda effect” is just a name for the tendency of jet flows 
to follow curved surfaces. 


An explanation that meets these requirements is not going to seem as tidy as most of the explanations we 
discussed in the previous section. However, as much as we'd like our explanation to be short and simple, we 
should resist the temptation to oversimplify and should aim for a level of detail and fidelity that does justice to 
the physics and meets our “completeness” requirement. A satisfactory explanation must resolve the Bernoulli- 
versus-momentum “controversy” and show how both pressure differences and momentum transfer are neces- 
sary parts of the picture. And it must blend these two seemingly independent aspects of the phenomenon into a 
coherent whole. 

Ease of sharing favors an explanation that can be conveyed in relatively simple words, with no more graphic 
aid than can easily be sketched on a napkin. A drawback to simple graphics, however, is that the levels of detail 
and physical fidelity are limited. One way to increase the level of detail and to ensure fidelity is to introduce 
graphics with real quantitative accuracy. Going further in this direction, digital animation could probably be 
used to provide a level of “feel” for the physics that would be difficult to get any other way, for example, by 
illustrating time histories of forces and velocities as fluid parcels move through the field. But either of these 
high-tech options would reduce “portability.”” My choice here 1s to try to achieve a satisfactory basic explana- 
tion without leaving the “talking/ sketching” realm. 

Because our intent is to share our basic explanation with a “public” that will not have read the background 
in this book, and because we'd like our explanation to have some staying power, it behooves us to give some 
thought to its “fitness for survival” in the marketplace of ideas. Practically everyone we share our explanation 
with will have heard other explanations before ours and will hear more in the future. Superior correctness on 
the part of our explanation can help, but is not likely to be sufficient, because the general public receives little 
real-world feedback regarding the correctness of its understanding of aerodynamics. Dawkins (1976) drew an 
analogy between the survival of ideas in a human population and the survival of genes 1n a biological gene pool, 
and proposed that Darwin's insights on natural selection apply in both universes. He coined the word meme to 
represent units or groups of ideas that can be passed from brain to brain and whose survival depends on their 
ability to “reproduce” (spread) and on their resistance to being displaced by other ideas. The survival fitness of 
our explanation as a meme will be greatly enhanced if, in addition to providing superior correctness, 1t inocu- 
lates its hosts (the audience) against the errors they are likely to encounter in other explanations. To do that, our 
explanation should discuss some of the most common errors and explicitly call attention to what is needed to 
avoid them. 

The basic explanation that follows is intended to stand alone and therefore repeats in simplified form some 
of the background in other parts of the book and some of the critiques of other explanations in Section 7.3.1. 
It comes as close to achieving the desired attributes stated above as I have been able to manage. It is longer 
than other explanations, but I have not found any way to shorten it significantly without compromising either 
completeness or correctness. 


7.3.3 A Basic Explanation of Lift on an Airfoil, Accessible 
to a Nontechnical Audience 


When a relatively thin, flat /ifting surface such as a wing, a sailboat sail, or a shark's fin moves through air or 
water, it can produce a force perpendicular to its direction of motion. This force is called /ift, whether or not it 
is in an upward direction. 

Mathematical theories of lift have been agreed on by the experts since the early twentieth century, but there 
has been a long history of disagreement on how to explain lift in simple physical terms, without math. Over the 
last 100 years or so, many different nonmathematical explanations have appeared in books, popular magazines, 
pilot-training materials, museum exhibits, and so forth. These explanations follow a variety of approaches, but 
they generally try to make lift simpler than it really is, explaining too little about the flow and leaving import- 
ant parts of the phenomenon unexplained. We'll consider the common examples below under “How the main 
popular explanations are incomplete.” And some explanations resort to erroneous arguments to explain features 
of the flow. These are discussed below under “Popular misconceptions.” Naturally the existence of numerous 
different explanations has been a source of confusion and controversy. 

The following is a nonmathematical explanation of lift that attempts to resolve the controversies and to be 
scientifically complete and correct. It is necessarily more complicated and longer than earlier popular explana- 
tions. 


7.3.3.1 Airfoil Shape and Angle of Attack 


The cross-sectional shape of a lifting surface, as illustrated in Figure 7.3.7, 1s called an airfoil. How much lift 
a lifting surface produces depends on the shape of the airfoil, on the angle of attack at which it approaches the 
oncoming flow, and on flow speed and density. A positive angle of attack means that the /eading edge (front) of 
the airfoil is positioned higher than the trailing edge (back), relative to the direction of the oncoming flow, as in 


Figure 7.3.7. 


Figure 7.3.7 The cross-sectional shape of a wing is called an airfoil, and the angle at which oncoming air ap- 
proaches it 1s called the angle of attack 





Almost any shape, as long as it is not too thick, will work as an airfoil and produce some lift when the angle 
of attack is in the right range. The main types of airfoil shapes are shown in Figure 7.3.8. A simple flat plate 
shape (a) 1s used on many toy gliders. The Wright brothers used a curved-plate airfoil shape (b) for the wings 
of their early airplanes because they found that adding curvature, or camber, increased lift. A toy glider will 
also fly much better with a curved-plate airfoil than with a flat-plate airfoil. Sailboat sails also generally take 
on the curved-plate form. Finally, larger model airplanes and full-size airplanes use streamlined airfoil shapes 
that are rounded at the /eading edge and sharp at the trailing edge, (c) and (d). Aerobatic airplanes often use 
the symmetrical type (c), which works equally well right-side-up or inverted, while most other airplanes use 
streamlined airfoils with camber (d). Streamlined airfoils, especially those with camber, can produce more lift 
with less drag than other shapes, and they enable an airplane to fly over a wider range of speeds. Birds' wings 
tend to have a combination of the curved-plate and streamlined shapes (e). 


Figure 7.3.8 Types of airfoil shapes. (a) Flat plate. (b) Curved (cambered) plate. (c) Symmetrical streamlined 


shape. (d) Cambered streamlined shape. (e) Bird's wing oe 


(a) (b) 


7.3.3.2 The Airfoil Reference Frame 


The phenomenon of lift generation is the same regardless of what reference frame we view it from. We could 
watch the airfoil move through the air, but everything is easier to understand if we imagine ourselves moving 
along with the airfoil, so that the airfoil appears to us to be standing still and the fluid appears to flow past. And 
for simplicity we'll refer to the “upper” and “lower” surfaces of the airfoil assuming that the wing is positioned 
horizontally and is lifting upward. 


7.3.3.3 Lift Involves Action and Reaction (Newton's Third Law) 


An intuitive way to imagine that lift is possible is to think of the airfoil shape and angle of attack as working 
together so that the airfoil pushes downward on the fluid as it flows past. The fluid must then push back with an 
equal and opposite (upward) force, which 1s the lift. This is an example of Newton's third law, that every action 
has an equal and opposite reaction. Thus lift 1s an interaction in which the airfoil and the fluid exchange equal 
and opposite forces. The aspects of the interaction that need to be explained further have to do with how the 
moving fluid actually pushes back. 


7.3.3.4 Lift Is Felt as a Pressure Difference on the Airfoil Surfaces 


A fluid always exerts pressure, which just means that it always pushes against itself and against any surface that 
it touches. When the fluid around an airfoil is at rest, the pressure 1s practically the same everywhere, so that it 


pushes upward on the lower surface and downward on the upper surface equally, and there is no lift. When the 
fluid is moving and there is lift, the fluid exerts the lift force directly on the airfoil's surfaces as a difference in 
pressure: higher pressure on the lower surface than on the upper surface. Under lifting conditions, the average 
pressure on the lower surface is usually higher than ambient unless the airfoil is very thick, and the average 
pressure on the upper surface is always lower than ambient. 

How large is this pressure difference, typically? In the atmosphere at sea level, the ambient pressure is about 
2100 pounds per square foot. Compared with this, it doesn't take much of a pressure difference to provide a 
practical amount of lift. Even the heaviest airplane requires no more than about 150 pounds of lift per square 
foot of wing area. So the pressure difference that an airfoil must produce to support any airplane is much less 
than the “background” atmospheric pressure, and even when a wing is lifting, the pressure on the upper surface 
pushes down almost as hard, in absolute terms, as the pressure on the lower surface pushes up. The difference 
is what counts, and even a relatively small pressure difference, spread over a large enough area, can lift a 747. 


7.3.3.5 Lift Involves Force and Acceleration (Newton's Second Law) 


Explaining how the flow maintains the pressure difference described above requires looking at the forces exer- 
ted on the air and the resulting accelerations of the air, not just at the surface of the airfoil, but in an extended 
region around the airfoil. In the explanation below, we first identify the major features of the flow that are essen- 
tial to maintaining the pressure difference, and then we consider how the whole combination satisfies Newton's 
second law. 


The outline of the explanation is as follows: 


¢ The fluid flows as if it were a continuous material that deforms to follow the contours of the airfoil. 

¢ The airfoil affects the direction and speed of the flow within a deep swath above and below the airfoil in 
what is called a velocity field. Flow above and below the airfoil is deflected downward. Flow above the 
airfoil always speeds up, and flow below usually slows down. 

¢ The airfoil affects the pressure over a wide area in what is called a pressure field. When lift is produced, a 
diffuse cloud of low pressure always forms above the airfoil, and a diffuse cloud of high pressure usually 
forms below. Where these clouds touch the airfoil they constitute the pressure difference that exerts the 
lift on the airfoil. 

¢ The velocity field and the pressure field support each other in a reciprocal cause-and-effect relationship, 
an interaction in accordance with Newton's second law of motion. 


Let's look at each part in more detail. 


7.3.3.6 The Fluid Flows as if It were a Continuous Material that Deforms 
to Follow the Contours of the Airfoil 


Fluids such as air and water consist of huge numbers of individual molecules that move randomly in all direc- 
tions, even when the fluid appears to be at rest. In water, molecules are in constant contact with their neighbors 
as they move. In air, they are not in constant contact, but they collide frequently and travel only very short dis- 
tances between collisions. If the molecules didn't have this random motion and never interacted with each other, 
they would fly directly into the forward-facing parts of the airfoil and not touch the aft-facing parts, as in Figure 
7.3.9. Instead, because of the random motion and the frequent contact between molecules, the fluid flows as if it 
were a continuous material. It deforms and changes course to flow around the airfoil, and it fills all of the space 
around the airfoil and touches all of its surfaces, as illustrated in Figure 7.3.10. 


Figure 7.3.9 If molecules had no random motion and never interacted with each other, they would fly directly 
into the forward-facing parts of the airfoil and not touch the aft-facing parts 





Figure 7.3.10 Because of the random motion and many collisions among its molecules, a fluid actually flows 
like a continuous material, deforming and changing course to flow around the airfoil, in what is called a flow- 


field 


7.3.3.7 The Airfoil Affects the Direction and Speed of the Flow within a 
Deep Swath above and below the Airfoil in What Is Called a Velocity Field. 
Flow above and below the Airfoil Is Deflected Downward. Flow above the 
Airfoil Always Speeds Up and Flow below Usually Slows Down 


Because the fluid deforms continuously as it flows, changes in direction are gradual, and the speed and direction 
of the flow vary over a wide area around the airfoil. A spread-out pattern of variations like this is often referred 
to as a flowfield or velocity field. 

The airfoil's solid surface forces the flow very close to it to follow the direction of the airfoil contour, with 
the result that the speed and direction of the flow are affected over a wide area. And the flow 1s thus affected by 
both the airfoil shape and angle of attack. 

When an airfoil produces lift, 1t deflects the fluid flow downward, as indicated by the streamlines sloping 
downward to the right in Figure 7.3.10 and by the downward-turning arrows in Figure 7.3.11. To produce the 
downward turning, the aft portion of the airfoil surfaces must have a predominantly downward slope. Thus to 
produce lift, the airfoil must have either camber or a positive angle of attack, or a combination of the two. 


Figure 7.3.11 Illustration of the pressure field and the velocity field around a lifting airfoil. Minus signs indic- 
ate pressure lower than ambient, and plus signs indicate pressure higher than ambient. The tighter the spacing 
of the symbols, the larger the pressure difference 
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It seems obvious that downward-sloping airfoil contours should force the flow very close to the surface to 
be deflected downward, but it is important to understand that it is not just the flow close to the surface that 1s 
affected, and that the downward turning is spread over a deep swath of flow above and below the airfoil. The 
amount of downward turning is greatest close to the airfoil and dies away gradually far above and below. 

In addition to the downward turning, the flow above the upper surface is always speeded up, as indicated by 
the longer arrow above the airfoil in Figure 7.3.11, and the flow below the airfoil is usually slowed down, as 
indicated by the shorter arrow. Like the downward turning described above, this effect is also strongest close to 
the airfoil surfaces and dies away gradually far above and below. 


7.3.3.8 The Airfoil Affects the Pressure over a Wide Area in What Is 
Called a Pressure Field. When Lift Is Produced, a Diffuse Cloud of Low 
Pressure Always Forms above the Airfoil, and a Diffuse Cloud of High 
Pressure Usually Forms below. Where These Clouds Touch the Airfoil 
They Constitute the Pressure Difference That Exerts the Lift on the Airfoil 


Because the fluid moves as if it were a continuous material, the airfoil influences much more fluid than its 
surfaces touch, producing pressure changes over a wide area. When an airfoil produces lift, there is always a 
diffuse cloud of low pressure above the airfoil, and there is usually a diffuse cloud of high pressure below, as 
illustrated by the clouds of minus signs and plus signs in Figure 7.3.11. Note that the minus signs don't mean 
that the pressure is negative, just that it is lower than ambient. These pressure differences are generally largest 
at the airfoil surface, where they actually exert the lift force, and away from the airfoil they die away gradually 
in all directions: above, below, ahead, and behind. A spread-out pattern of pressure differences like this is often 
called a pressure field. 


7.3.3.9 The Velocity Field and the Pressure Field Sustain Each Other in 
a Reciprocal Cause-and-Effect Relationship, an Interaction in Accordance 
with Newton's Second Law of Motion 


The downward turning of the flow, the changes in flow speed, and the clouds of low and high pressure described 
above are all necessary for the production of lift. They support each other in a reciprocal cause-and-effect re- 
lationship, and none would exist without the others. The pressure differences exert the lift force on the airfoil, 
while the downward turning of the flow and the changes in flow speed sustain the pressure differences. 

Although the clouds of low and high pressure depicted in Figure 7.3.11 don't have sharply defined bound- 
aries, they are still effectively “confined” to a limited area, both vertically and horizontally. “Sustaining” the 
pressure differences essentially means maintaining this spatial “confinement” in both the vertical and horizontal 
directions. Neither direction can be said to be more important than the other. Both are essential. 

To understand how this “confinement” of the pressure differences works, we start by imagining the flow to 
be divided into tiny “parcels” of fluid passing through the region around the airfoil. The interaction these parcels 
are involved in reflects Newton's second law of motion, a general relationship among mass, force, and motion. 
Every parcel is subject to Newton's second law because the fluid in the parcel has mass, and because the parcel's 
neighbors can exert a net force on it, through differences in pressure. When the pressure is higher on one side of 
a parcel than on the other, the forces pushing the parcel in opposite directions are not balanced, and there is a net 
force pushing the parcel in the direction from higher pressure to lower pressure. Examples of this are illustrated 


in Figure 7.3.12. 





Figure 7.3.12 Examples of how pressure differences exert net forces on fluid parcels. (a) Pressure higher on 
left: Net force to the right. (b) Pressure higher above: Net force downward. (c) Pressure higher on right: Net 
force to the left 


(a) 
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Net forces on fluid parcels can also come from internal friction due to the viscosity of the fluid. However, 
when the flow around an airfoil follows both surfaces all the way to the trailing edge, as it usually does, viscosity 
has only a small effect on the overall flow pattern, and for purposes of understanding lift, we need only consider 
the forces due to pressure differences. 

Newton's second law tells us that when a pressure difference imposes a net force on a fluid parcel, it must 
cause a change in the speed or direction (or both) of the parcel's motion. But in fluid flows, this cause-and-effect 
relationship not a one-way street. The pressure difference causes a change in the parcel's motion, but the exist- 
ence of the pressure difference depends on the parcel's motion. The relationship is thus mutual, or reciprocal: A 
fluid parcel changes speed or direction in response to a pressure difference, and its resistance to changing speed 
or direction (its inertia, due to its mass) sustains the pressure difference. The pressure difference can exist only 
if something 1s there to “push back,” and what pushes back 1s the inertia of the fluid, which is why the mass of 
the fluid is so important. 

To show how this applies in the airfoil flowfield, Figure 7.3.13 illustrates how typical fluid parcels are acted 
upon by the pressure in different parts of the field. In different parts of the high- and low-pressure clouds, the 
pressure is increasing or decreasing in different directions, exerting forces on the fluid parcels as shown by the 
arrows. 





Figure 7.3.13 Pressure differences exert net forces on fluid parcels in different parts of the flowfield as indic- 
ated by the block arrows, and these forces are balanced by accelerations of the fluid parcels in the same direc- 
tions 





+ 
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Fluid parcels directly above and below the airfoil see higher pressure above them than below and thus see a 
net force that is mostly downward. The downward force is resisted by the downward acceleration, or downward 
turning, of the flow. This interaction between pressure and downward turning 1s reciprocal: The pressure differ- 
ence acting on a fluid parcel causes the parcel's path to be deflected downward, and the pressure difference is 
sustained because the fluid parcel has mass and therefore resists having its path deflected from a straight line. 

On the upper left, where flow is entering the region of low pressure above the airfoil, the net force on a 
parcel is from left to right, which 1s resisted by an acceleration in the direction of the flow, or an increase in 
flow speed. On the upper right, where flow is leaving the region of low pressure, it sees a pressure difference in 
the opposite direction and is slowed down. Thus a parcel passing through the cloud of low pressure above the 
airfoil is speeded up and then slowed back down. On the other hand, a parcel passing through the cloud of high 
pressure below the airfoil experiences the opposite sequence: It is slowed down and then speeded back up. 

These changes in flow speed follow the relationship that is often referred to as Bernoulli's principle, which 
can be expressed mathematically as Bernoulli's equation. Bernoulli's principle applies to regions of steady flow 
that have not been affected significantly by viscous friction. There is a small part of the airfoil flowfield in 
which Bernoulli's principle doesn't apply, and that is in the thin viscous boundary layer next to the airfoil surface 


and a thin viscous wake downstream, where the flow is strongly affected by viscosity and is usually turbulent. 
However, the pressure field is largely determined in the larger part of the flowfield in which viscous effects 
are small, outside of the boundary layer and wake. There, Bernoulli's principle applies, according to which low 
pressure means high speed, and high pressure means low speed. This relationship is reciprocal: The differences 
in pressure in the horizontal direction cause the changes in flow speed, and the fluid's resistance to acceleration, 
because of its mass, sustains the pressure differences. 

To summarize these interactions: The downward turning of the flow provides the vertical “confinement” of 
the clouds of low and high pressure, and the changes in flow speed provide the horizontal “confinement. ”’ 

Thus downward deflection of the flow and different flow speeds above and below the airfoil are both es- 
sential parts of the reciprocal cause-and-effect relationship that sustains (or “confines”’) the pressure differen- 
ces. The pressure differences cause the flow to change speed and direction, and the changes in flow speed and 
direction cause the pressure differences to be sustained. This circular cause-and-effect might seem a bit like 
“something for nothing” or “perpetual motion,” but it's not. The pressure differences follow naturally from the 
Newton's second law and from the fact that the flow close to the surface is forced to follow the predominantly 
downward-sloping contours of the airfoil. And of course the fact that the fluid has mass 1s crucial to the interac- 
tion. 

In a way, it is surprising that mass is so important, given that air feels so light to our senses. However, air 
actually has more mass (and thus weight) than our intuition gives it credit for. In air at sea level, a volume of 
only about 14 cubic feet (a cube 2.4 ft on a side) weighs a pound. Thus even a modest volume of air contains 
significant mass, and an object moving through the air can influence a large mass of air in a short time. For 
example, the wing of a small airplane (a two-seat Cessna) flying at 100 mph causes significant changes 1n speed 
and direction in thousands of pounds of air every second. 


7.3.3.10 The Roles of Camber, a Sharp Trailing Edge, and a Rounded 
Leading Edge 


The downward-turning action that an airfoil produces is affected by both the airfoil shape and the angle of at- 
tack. The airfoil surfaces themselves need not be curved, and even the flat-plate shape of Figure 7.3.8a can pro- 
duce downward turning when it has an angle of attack. However, adding camber to the airfoil shape, as in Fig- 
ure 7.3.8b,d, aligns the airfoil surfaces better with the desired curved flow and enhances the downward-turning 
action. As a result, cambered shapes in which the upper surface 1s more convex than the lower surface produce 
more lift at a given angle of attack than shapes without camber. 

What happens in the flow near the trailing edge plays an important role in controlling the downward-turning 
action. Figures 7.3.14 and 7.3.15 illustrate streamline patterns for low-to-medium angles of attack. Note that in 
both of these cases the flow from both the upper and lower surfaces leaves the airfoil smoothly from the trailing 
edge. If it didn't do this, the flow from one side would have to go around the trailing edge to the other side, as in 
Figure 7.1.3a. But viscosity generally prevents flow from going around a sharp edge in this way. So, with some 
help from viscosity, the trailing edge has the effect of directing the flow as it leaves the airfoil to flow in the 
direction of the contours of the aft part of the airfoil. This tendency of the trailing edge to direct the flow is a 
major reason why angle of attack and camber are so effective in influencing lift. 


Figure 7.3.14 Streamline pattern at a low angle of attack: flow attaches near leading edge and follows the up- 


per surface 
Figure 7.3.15 Streamline pattern at a medium angle of attack: flow attaches below the leading edge, flows 
around the leading edge, and follows the upper surface 


The situation at the leading edge is very different. Referring again to Figures 7.3.14 and 7.3.15, note that 
the flow around the leading edge changes dramatically when the angle of attack is increased. At a low angle of 
attack (Figure 7.3.14) the flow approaches the leading edge almost head on, while at a higher (medium) angle 
of attack (Figure 7.3.15) it approaches the leading edge from below and flows around the leading edge from the 
lower surface to the upper surface. In fact, the reason for having a rounded leading edge is to accommodate the 
different flow patterns at different angles of attack with a minimum of disruption. 


7.3.3.11 How Lift is Controlled 


Lift depends on the speed of the flow. An increase in speed increases the lift by increasing both the amount 
of fluid that the airfoil influences in a given time and the downward acceleration imparted to each bit of fluid. 
These two effects multiply each other, so that lift 1s approximately proportional to the square of the speed, at a 
given angle of attack. 

Lift can also be increased by an increase in either the camber or the angle of attack, as long as the flow fol- 
lows both surfaces of the airfoil, as it usually does at low-to-moderate angles of attack, as shown in Figures 
7.3.14 and 7.3.15. For a given airfoil shape, this increase in lift continues until an angle of attack is reached 
above which the effects of viscosity prevent the flow from following the upper surface. The flow then breaks 
away, as shown in Figure 7.3.16, which is called stalling. As an airfoil passes through the stalling angle of at- 
tack, the lift generally decreases dramatically, though it doesn't disappear altogether. 


Figure 7.3.16 Flow around an airfoil at a high angle of attack breaks away from the upper surface, and the air- 
foil is said to be stalled 


Controlling lift through angle of attack 1s an essential ingredient in achieving controlled flight over a range 
of speeds. For steady level flight, the total lift on an airplane must be kept equal to the weight. When an airplane 
speeds up, it reduces its angle of attack in order to keep the lift from exceeding its weight. When an airplane 
slows down, it increases its angle of attack in order to keep the lift equal to the weight, but this works only up 
to the stalling angle of attack. Thus an airplane can slow down only so much, to what is called the stall speed, 
which is the minimum speed at which an airplane can fly steadily. 

Lift also depends on the density of the fluid. A decrease in density decreases lift at a given angle of attack 
and speed by decreasing the mass of the fluid the airfoil acts on. Thus when an airplane flies at high altitude 
where air density 1s lower, it must either fly faster or increase its angle of attack in order to maintain lift equal 
to the airplane's weight. 

Lift can also be controlled by what is called a control surface, which 1s just an aft portion of an airfoil that 
is hinged so that it can be deflected up and down, as shown in Figures 7.3.17 and 7.3.18. Deflecting a control 
surface changes both the effective angle of attack and the camber of the airfoil and thereby changes the flow- 
turning action. Examples of control surfaces are the e/evators on the horizontal tail, which can pitch an airplane 
nose up or nose down, the ailerons near the wing tips, which can rol/ an airplane to one side or the other, and 
the rudder on the vertical tail, which can yaw an airplane in one direction or the other. 





Figure 7.3.17 A control surface deflected upward to decrease lift 








Figure 7.3.18 A control surface deflected downward to increase lift 


A flap is similar to a control surface and is deflected downward at the trailing edge of a wing to increase both 
the lift at a given angle of attack and the maximum lift that can be generated without stalling. Flaps enable an 
airplane to take off and land at lower speeds than it could without flaps. A flap often has one or more s/ots that 
direct air from the lower surface to flow over the aft upper surface, as shown in Figure 7.3.19, which helps to 
delay stalling. 


Figure 7.3.19 A flap deflected downward to increase lift, with a slot to allow air to flow from the lower sur- 
face to the upper surface 






Pine. 


Leading-edge devices such as leading-edge flaps or slats, as shown in Figure 7.3.20, also help increase max- 
imum lift and allow slower takeoff and landing speeds. 


Figure 7.3.20 A leading-edge slat deflected downward to increase maximum lift 





Spoilers are devices that hinge upward from the upper surface of a wing, as shown in Figure 7.3.21, disrupt- 
ing the upper-surface flow by deflecting it upward, and causing a large decrease in lift and an increase in drag. 
When spoilers are used in flight, the purpose is not to decrease lift, but to increase drag and allow the airplane 
to descend steeply without speeding up. When spoilers are use in this way, the decrease in lift caused by the 
spoilers is compensated by an increase in angle of attack. When spoilers are used on the ground during the land- 
ing roll, the purpose is both to increase drag and to decrease the lift on the wing, putting more of the airplane's 
weight on the wheels to allow harder braking without skidding. 





Figure 7.3.21 A spoiler deflected upward to disrupt the upper surface flow, decreasing lift and increasing drag 





7.3.3.12 How the Main Popular Explanations Are Incomplete 


The two most widely circulated explanations of lift are the momentum based (Newtonian) and the Bernoulli 
based. Both contain elements of the truth, but neither provides a complete explanation. 

The momentum-based explanation generally goes as follows: When an airfoil produces lift, it deflects the 
flow downward, as indicated by the streamlines sloping downward to the right in Figure 7.3.10. Newton's 
second law tells us that to deflect the flow downward, the airfoil must push downward on the fluid. Then New- 
ton's third law tells us that for every action there is an equal-and-opposite reaction, and the fluid must therefore 
push upward on the airfoil, and thus there is lift. This explanation is correct as far as it goes, but 1s incomplete in 
several ways. First, it doesn't point out that the force 1s actually transmitted to the airfoil by the pressure, and it 
doesn't explain how the airfoil can impart downward turning to a much deeper swath of the flow than it actually 
touches. Then although it hints at explaining how the pressure differences 1n the vertical direction are sustained, 
it doesn't explain at all how the pressure differences in the horizontal direction are sustained. That is, it leaves 
out the Bernoulli part of the interaction. 

A Bernoulli-based explanation starts by arguing that the flow over the upper surface is speeded up, either be- 
cause the path length over the upper surface is longer, or because of an “obstacle,” “hump,” or “Venturi” effect. 
Because of the higher speed, the pressure over the upper surface must be lower, according to Bernoulli's prin- 
ciple, and thus there is lift. Explanations of this type are incomplete in that they don't adequately or correctly 
explain what causes the flow to speed up. The longer-path-length explanation is simply wrong (see below, under 
“Popular misconceptions’’). The “obstacle,” “hump,” or “Venturi” explanations are better, but only a little. They 
often mention “pinching” or “necking down” of the flow over the upper surface, but they don't provide a con- 
vincing physical explanation for the pinching. A common fault in all of these explanations is that they imply 
that a speed difference can arise from causes other than a pressure difference, and that the speed difference then 
causes, or at least implies, a pressure difference, according to Bernoulli's principle. This implied one-way caus- 
ation 1s a misconception that we'll discuss below. Finally, a Bernoulli-only explanation doesn't really explain 


how the pressure differences in the vertical direction are sustained. That 1s, it leaves out the downward-turning 
part of the interaction. 

There have been at least three schools of thought among proponents of the momentum-based and Bernoulli- 
based explanations. One is that only one or the other can be correct. Another is that both are correct, and that 
they apply to two different kinds of lift. A third is that both are correct, and either one suffices to explain lift in 
general. These are all misconceptions that we'll discuss next. 

A third major category of explanation involves circulation around the airfoil. An explanation of this type 
starts with the observation that when an airfoil starts its motion through the fluid, a starting vortex 1s left be- 
hind. The formation of the starting vortex is accompanied by the establishment of a “circulatory flow,” or cir- 
culation, around the airfoil, which is responsible for the lift according to an aerodynamic theory known as the 
Kutta-Joukowski theorem. This is not a proper physical explanation for two reasons. First, it depends on sev- 
eral advanced mathematical and aerodynamic theorems instead of direct physical arguments. Second, although 
the argument is mathematically and logically correct, its general progression does not reflect physical cause 
and effect, and is therefore misleading. It implies that lift is somehow caused by the formation of the starting 
vortex and the resulting circulation. The starting vortex and the circulation are actually more properly seen as 
byproducts of the lift than as causes. 


7.3.3.13 Popular Misconceptions 


We've seen that lift generation involves subtle cause-and-effect relationships, so it shouldn't be surprising that 
many of the attempts to explain it to a popular audience have made errors of one kind or another. To solidify 
our understanding and make it less likely we'll be taken in by incorrect ideas, let's identify some of the miscon- 
ceptions and consider where they went wrong. 


One-way causation: This is a misconception we've already discussed, that a velocity difference can be de- 
duced first, based on some argument that does not depend on the pressure, and that a pressure difference 
follows, according to Bernoulli's principle. 


This implication that the causation runs in only one direction is not consistent with the physics of fluid flows. 
If you try to explain a speed difference without referring to the pressure difference, you'll inevitably get the reas- 
ons for the speed difference wrong. One example of this, an erroneous reason for high velocity over the upper 
surface of an airfoil, is the next item on our list. 


Longer path length and equal transit time: This is an argument that is widespread in explanations aimed at 
the layman. It is assumed that the upper surface of the airfoil is more convex than the lower surface, and 
that the path the fluid must follow around the upper surface is therefore longer than the path around the 
lower surface. It is further assumed that fluid parcels that are split apart at the leading edge to traverse the 
upper and lower surfaces must rejoin at the trailing edge. Thus fluid parcels negotiating both paths must do 
So in equal time, and we conclude that the velocity over the upper surface must be higher than that over the 
lower surface. 


First, this isn't a proper kind of physical explanation. Just saying that something has to arrive somewhere at 
a particular time doesn't explain why the thing might speed up. To explain why something speeds up, we must 
identify and explain the force that makes it speed up. And this explanation is wrong on another level. There is 
no reason why fluid parcels that start together ahead of the airfoil must rejoin at the trailing edge, and in fact, 
they generally don't. A parcel that traverses the airfoil near the upper surface typically arrives at the trailing edge 
well ahead of one that traverses near the lower surface. So no difference in path length is required, and there are 
many situations in which lift 1s produced without a difference. And on airfoils where there is a difference, it is 
typically much too small to explain the speed difference that actually occurs when lift is produced. 


Bernoulli is applicable, and Newton is not, or vice versa: Some proponents of the Bernoulli-based explan- 
ations argue that lift is produced solely by a pressure difference, according to Bernoulli's principle, and that 
there is no downward momentum imparted to the fluid (Newton). Some proponents of momentum-based 
explanations argue the opposite: that imparting momentum is everything (Newton), and that the Bernoulli 
principle is not applicable. 


We've seen that a pressure difference between the upper and lower surfaces and downward turning of the 
flow are both essential parts of the picture. 


Bernoulli and Newton are both right, and they explain two different kinds of lift: This line of argument 
maintains that “Bernoulli lift” and “reaction lift” represent two distinct physical mechanisms. 


There is only one kind of lift, and explaining it requires both a pressure difference and downward turning. 


Bernoulli and Newton are both right, and either one suffices: According to this line of argument, the 
Bernoulli-based and the momentum-based explanations are just different but equivalent ways of looking at 
the same thing. 


Again, we've seen that a complete explanation must include both the pressure difference and the downward 
turning of the flow. 


Invoking the Coanda effect as the reason the flow is able to follow the curved surfaces of the airfoil: Some 
explanations argue that viscosity plays a crucial role in enabling the flow to turn and follow the curved 
upper surface of the airfoil. They refer to this purported coupling between viscosity and flow turning as the 
Coanda effect. 


This reflects a misunderstanding of the role of viscosity 1n fluid flows and of what the Coanda effect actually 
entails. As fluid flows over the surface of an airfoil, there 1s no direct coupling between viscosity and flow turn- 
ing, and none is needed. Viscosity plays a significant role in lift generation only in the immediate vicinity of the 
airfoil trailing edge, by preventing the flow from going around the trailing edge from the lower surface to the 
upper surface. The real Coanda effect refers to the tendency of a turbulent jet flow with higher energy than the 
surrounding fluid to attach itself to an adjacent surface and to follow the surface even if it is curved. This is not 
so much a viscous effect as it is an indirect effect of the jet turbulence. It arises because of the tendency of jet 
flows to entrain surrounding fluid, and it plays no role in ordinary airfoil flows. 


The low pressure on the upper surface pulls upward on the airfoil: Many popular explanations of lift de- 
scribe the effect of low pressure in these terms. The idea of the airfoil being pulled upward has a strong 
intuitive appeal, but it is incorrect. 


Pressure, especially in gasses such as air, is always a push, never a pull. The pressure on the upper surface of 
an airfoil pushes downward on the airfoil, but the higher pressure on the lower surface pushes upward harder, 
and the net effect is lift. 


Not acknowledging the importance of angle of attack: Some explanations, such as the one based on the 
longer-path-length-and-equal-transit-time argument, never mention the angle of attack. 


The angle of attack is a key factor that determines how much lift an airfoil produces at a given flow speed 
and is an essential ingredient in achieving controlled flight. 


7.3.3.14 Why There Have Been So Many Misconceptions 


Explaining lift in physical terms is more difficult than most people realize, and the difficulty is inherent in the 
basic nature of fluid mechanics. We are dealing with countless little parcels of fluid that move in coordination 
with their neighbors and exert forces on their neighbors, all while separately and simultaneously obeying New- 


ton's second law. It simply isn't possible to look at an airfoil and deduce, by mental effort alone, what flow 
pattern satisfies the physical laws everywhere at once. There are too many simultaneous relationships to keep 
track of. And the difficulty 1s compounded by the circular nature of the cause-and-effect relationship between 
pressure and velocity. 

This kind of complexity isn't easy to deal with mathematically, either. Mathematically expressing all the re- 
lationships a fluid flow must satisfy results in a set of partial-differential equations called the Navier-Stokes 
equations. By solving these equations we can predict in detail what the flow around an airfoil does and how 
much lift is produced. But solving the equations means mathematically determining how the pressure and the 
flow velocity vary throughout a large volume of space surrounding the airfoil. For any given flow situation it 
requires lengthy calculations that are practical only on a high-speed computer. Computer programs are available 
that can make such calculations routinely. However, all they do is provide a simulation of what would happen 
in a real flow; they don't provide a physical explanation of how it happens. 

Simplified theories have also been developed, such as potential-flow theory, which ignores viscosity, and the 
Kutta-Joukowski theorem, which relates lift to a circulatory component of flow (circulation) around the airfoil, 
but these don't provide a direct physical explanation for lift either. 

So we see that predicting the existence of lift using nothing but the properties of the fluid and the laws of 
physics would essentially require solving the NS equations or the potential-flow equation for the flow around 
the airfoil, which is not something we can do in our heads. Explaining what happens, with words and simple 
diagrams instead of laborious calculations, thus requires some prior knowledge of what the flow does. The ex- 
planation above started with knowledge of some basic features of the pressure and velocity fields around an 
airfoil, and then showed how the pressure field and the velocity field support each other in a manner consistent 
with the laws of physics, including the proper reciprocal cause-and-effect relationships. 

Faulty explanations often assume too little prior knowledge and then try to do more than 1s logically possible 
by mental effort alone. As a result, they tend to leave important things unexplained (such as what really causes 
the high velocity over the upper surface) and to resort to logical fallacies such as one-way causation. 


7.3.4 More Physical Details on Lift in 2D, for the 
Technically Inclined 


The above explanation of the interaction between the pressure field and the velocity field explicitly distin- 
guished between vertical and horizontal confinement of the pressure differences. Readers with technical back- 
grounds will recognize this as one way of explaining to a lay audience an interaction involving a vector equa- 
tion. In this case, the equation is the Euler momentum equation in 2D. Of course this equation has two compon- 
ents, both of which must be satisfied, and thus the necessity to explain two components of the interaction. 

Above, in trying to explain why there have been so many misconceptions, we claimed that predicting lift 
strictly from first principles can't be done just through mental effort, that it requires laborious computation. To 
avoid going into too much detail for a lay audience, we deliberately neglected to mention that simplified the- 
ories such as conformal mapping or linear theory can greatly reduce the required computational effort. This 1s 
a justified omission because the simplified theories are still heavily mathematical, and they provide no aid to 
physical understanding by a nontechnical audience. But we will make use of the linear theory to further our own 
understanding of airfoil pressure distributions in Section 7.4.1. 

The main thrust of our basic explanation was that lift is transmitted to the airfoil by a pressure difference 
between the upper and lower surfaces and that it requires at the same time a downward turning of the air stream 
above and below the airfoil. We ignored the upward turning, both ahead of the airfoil and behind, that accom- 


panies lift in general and can be seen in Figure 7.1.3b. Referring again to Figure 7.3.11, note that the flow ap- 
proaching the front of the airfoil sees higher pressure below it than above, and thus must receive a net push 
upward. It therefore turns upward, as indicated by the upward-curving streamlines ahead of the airfoil in Figure 
7.1.3b. Then comes the downward turning above and below the airfoil that we've already discussed and con- 
nected with the generation of lift. Finally, the air behind the airfoil feels lower pressure above than below, and 
is turned upward again, as shown by the curving streamlines behind the airfoil, but only enough to cancel the 
downward direction 1t acquired as it passed the airfoil. 

The flow angle actually approaches zero far from the airfoil, behind the airfoil as well as ahead. Does this 
mean that the downward turning that we associate with the lift has been canceled by the upward turning? Well, 
yes, but only in terms of the local flow direction. The net momentum change imparted to the air by the airfoil 1s 
still downward and still accounts for the lift. 

To explain how this can be, we start with the fact that lift requires circulation, according to the Kutta- 
Joukowski theorem. An airfoil flow with circulation effectively consists of a uniform flow with disturbances 
associated with a collection of bound vorticity superimposed on it. In the simplest way of deriving the Kutta- 
Joukowski theorem, we define a control volume as shown in Figure 7.3.22, with vertical outer boundaries at the 
front and back, extending to infinity vertically. Conservation of momentum requires that the lift on the airfoil 
be balanced by the forces and momentum fluxes at these outer boundaries. Because the boundaries are vertical, 
there is no net vertical force contribution by the pressure, and the lift must be accounted for by the net flux of 
vertical momentum into the control volume. It can be shown that as long as the two faces of the control volume 
enclose all of the bound vorticity, the flux of upward momentum into the front face and the flux of downward 
momentum out of the back face account for half the lift each, regardless of the distances from the airfoil to the 
faces. And from this, we also arrive at a simple derivation of the Kutta-Joukowski theorem, Equation 7.2.1. 


Figure 7.3.22 In the simplest derivation of the Kutta-Joukowski theorem, the farfield flow around an airfoil is 
modeled as a potential vortex superimposed on a uniform flow, and a control volume with vertical faces 1s 
used to assess the momentum flux due to lift 
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So the flux of vertical momentum through either face of the control volume of Figure 7.3.22 remains the 
same, regardless of how far away from the airfoil the face 1s placed. This is true in spite of the fact that the 
vertical velocity goes to zero at large distances. As either face is moved farther from the airfoil, the maximum 
vertical velocity decreases, but the distribution becomes more spread out vertically. This is illustrated in Figure 
7.3.23 by plots of the downwash velocity -v on vertical cuts at different distances upstream and downstream 
of the airfoil. Upstream of the airfoil, the curves are to the left of their respective axes, which, given that we 
are plotting -v, indicates upwash. Downstream of the airfoil, the curves are to the right of their axes, indicating 
downwash. With increasing distance from the airfoil either upstream or downstream, the vertical velocity in the 
middle of the distribution decreases, while the vertical velocity in the “tails” of the distribution, above and be- 
low the level of the airfoil, increases, and the integrated flux of vertical momentum remains the same. 


Figure 7.3.23 An illustration of the distributions of vertical velocity on vertical cuts at different distances 
ahead of an airfoil and behind. The vertical velocity is plotted as -v, so that in the upwash ahead of the airfoil 
the curves appear to the left of their respective axes, and in the downwash behind the airfoil, the curves appear 
to the right of their axes. The 45° dashed lines indicate the approximate dividing lines between upward turning 
and downward turning outside of the nearfield 
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Now it 1s clear how it can be that the upward turning of the flow (the reduction in the downwash velocity) aft 
of the airfoil trailing edge does not reduce the total downward momentum crossing successive vertical planes 
as we move downstream. It is because the upward turning is always offset by downward turning (the increase 
in downwash velocity) that is still taking place far above and below the airfoil. Outside of the near field, the 
boundary between the upward turning aft of the airfoil and the downward turning above and below is approx- 
imately along 45° lines, as indicated by the dashed lines in Figure 7.3.23. Thus along any vertical cut, at any 
distance behind the airfoil, we have both upward turning and downward turning taking place. We can look at 
the upward turning behind the airfoil as just part of the process by which the downward momentum becomes 
more spread out vertically. 

Likewise, the upward turning ahead of the airfoil does not increase the total upward momentum crossing 
successive vertical planes because it is offset by downward turning that is already taking place far above and 
below. Across vertical planes ahead of the airfoil, the same total flux of upward momentum is always there, no 
matter how far ahead of the airfoil we look. 


It seems counterintuitive that the airfoil can cause a fixed total flux of upward momentum at unbounded dis- 
tances upstream. This finite influence on a flow integral at infinite distance upstream is of course an artificiality 
resulting from our implicit assumption that the airfoil has been in motion forever in an infinite atmosphere. If 
we assume instead a finite flight time since the start of the airfoil's motion, there would have to be a net starting 
vorticity in the field behind the airfoil, with opposite sign to that of the bound vorticity, and it would, in the in- 
tegral sense, cancel the flux of upward momentum ahead of the airfoil. Or a finite altitude above a ground plane 
changes things even if the flight time 1s infinite. In that case, the reflection of the bound vorticity in the ground 
plane comes into play, canceling the integrated flux at very large distances. The effects of a ground plane and/ 
or starting vortex are similar in the 3D case, as we'll see in Section 8.5.3. So in the real world, we don't have 
to worry about finite effects at infinite distances. But the dying off of the integrated momentum flux does re- 
quire interference from a ground plane or a starting vortex, and in most situations these are far away. When the 
starting vorticity and the ground plane are both far away compared to the airfoil chord, there is a large domain 
around the airfoil in which the flow is practically indistinguishable from what it would be in an infinite domain 
and in which the total flux of upward momentum upstream of the airfoil is effectively constant. 

So when an airfoil is effectively operating in free air, the increasing upwash in the flow approaching the 
leading edge does not represent a change in integrated momentum flux, just a redistribution of momentum from 
above and below. Anderson and Eberhardt (2001) misunderstand this redistribution process and argue that the 
increasing upwash ahead of the airfoil requires a downward force on the airfoil, which they call the “the addi- 
tional load caused by the upwash” and which they claim offsets some of the lift. This leads them to an incorrect 
explanation for the reduction in induced drag of a wing flying close to the ground, which we'll discuss in Sec- 
tion 8.3.9. From the momentum analysis we've just discussed, we can see that there is no loading due to upwash 
on an airfoil. 

Airfoil flow patterns in most practical flight situations are characterized by attached flow, in which the 
boundary layers on both surfaces are attached all the way to the trailing edge. The boundary layers then tend to 
have only minor effects on the global flow pattern, and the outer inviscid flow closely follows the predictions 
of potential-flow theory that we discussed in Section 7.1. The potential flow around a 2D airfoil is reversible, in 
that a flow that starts as a uniform onset flow from upstream becomes uniform again downstream, and the same 
flow pattern and pressures arise if the flow is run in the reverse direction. In a viscous flow in the attached-flow 
regime, the flow outside the boundary layer and wake still follows the reversible pattern quite closely. In addi- 
tion to the near-reversibility of the general flow pattern, there is also very little permanent vertical displacement 
of streamlines between upstream and downstream. This is consistent with the view of a 2D airfoil flow as a 
uniform flow with a vortex superimposed, as in Figure 7.3.22. 

There is one feature of 2D lifting flow that might appear at first glance not to be reversible, and that is that 
fluid parcels that start together ahead of the airfoil and split apart to flow above and below the airfoil experience 
a permanent longitudinal displacement between them after they have passed the airfoil, as we saw illustrated in 
Figure 7.3.2. This is, of course, a result of the nonzero circulation. But a little reflection should convince you 
that the longitudinal displacement needn't introduce anything irreversible. In the inviscid case, the displacement 
takes place entirely while the parcels are split apart by the airfoil and involves no slipping of adjacent layers of 
air relative to each other, either ahead of the airfoil or behind. The longitudinal displacement is therefore com- 
pletely reversible in the 2D inviscid case. 

Now let's consider further the issue of how lift varies with angle of attack. In Section 7.1, we noted that New- 
ton's “bullet” theory predicts that lift varies as the square of the angle of attack, but that in continuum subsonic 
flow it tends to vary nearly linearly. This linear variation generally holds as long as the flow remains attached 
all the way to the trailing edge on both surfaces, so that the flow-turning action associated with angle of attack 
remains fully effective. The amount of flow turning varies directly with the angle of attack, but, as I stated in 
Section 7.1, angle of attack has very little effect on the amount of fluid per unit time that is subjected to the 


turning or is influenced significantly in any other way by the airfoil. Another way of looking at this 1s that the 
angle of attack affects the magnitude of the pressure difference between the upper and lower surfaces, but that 
the relative rate at which the pressure difference decays away from the surface 1s related primarily to the airfoil 
chord and depends very little on angle of attack. Therefore the vertical depth of the stream that passes through 
the region in which the pressure disturbance is significant in a relative sense (greater than 5% of the maximum, 
say) 1s essentially independent of angle of attack. 

Let's look at some quantitative “data” (computational results) that illustrate this. Figure 7.3.24 shows isobar 
patterns in the inviscid flowfield for a thin symmetrical airfoil at angles of attack of 4° and 8°. Note that the lift 
at 8° is almost exactly twice that at 4°. Far from the airfoil, the pressure disturbance patterns are essentially the 
same, just with the disturbances for 8° being twice as strong as those at 4°. Thus the strength of the pressure 
disturbance is proportional to angle of attack, but the spatial “spreading” of the disturbance is not. What we see 
here is behavior that is nearly linear with angle of attack, something we'll consider further in the next section in 
connection with the linear theory. 


Figure 7.3.24 How the pressure distribution in the field around an airfoil is affected by a change in a, as illus- 
trated by isobar patterns for a thin symmetrical airfoil (NACA 0010) at two angles of attack. Except for some 
differences due to thickness, doubling the angle of attack roughly doubles the strength of the pressure disturb- 
ances but does not much change the degree of “spreading” of the disturbances above and below the airfoil. 
Solutions calculated by the MSES code (Drela, 1993) in the inviscid mode. (a) a 4°, C7 0.4761; Cp contour in- 
terval 0.05. (b) a 8°, C7 0.9497; Cp contour interval 0.10 
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So far we haven't said anything about the drag of an airfoil, nor have we considered in any detail how flow 
separation limits maximum lift. These effects depend not only on viscosity but on the detailed pressure distri- 
bution on the airfoil's surface, and we'll defer their discussion to the next section. 


7.4 Airfoils 


In the previous section, we defined an airfoil as a cross section of a wing of finite span. In the early years of 
the development of aerodynamics, this terminology was not yet standardized, and “airfoil” or “aerofoil” often 


referred to a complete 3D wing. Anderson (1997, pp. 288—289) provides an interesting look at the often-con- 
flicting terminology that was used in the early days. In any case, in the previous section, we spent considerable 
effort just trying to explain how an airfoil produces lift, which is its primary purpose. Here we'll look into the 
many other important aspects of airfoils, including how pressure distributions and integrated forces depend on 
airfoil shape and angle of attack, factors influencing drag, factors influencing maximum lift, and considerations 
in airfoil design, including some specialized classes of airfoils. 


7.4.1 Pressure Distributions and Integrated Forces at Low 
Mach Numbers 


The pressure distribution on an airfoil surface, and the resulting integrated forces, over a wide range of flow 
conditions, can be predicted with good accuracy by computational fluid dynamics (CFD) calculations based on 
the NS equations or a coupled inviscid/viscous method. But for airfoils with nicely streamlined shapes at low 
Mach numbers and in the attached-flow regime, there are some general characteristics of the pressure distribu- 
tion, the lift curve, and the pitching moment that we can predict without resorting to detailed flow solutions, but 
relying instead on a simplified linear theory. We'll begin this section by considering what the linear theory can 
tell us. 

The linear inviscid theory was developed by Munk around 1920 (see the historical sketch by Giacomelli and 
Pistolesi, in Durand, (1967a) and formally applies in the limit of small thickness and angle of attack, but even 
for airfoils with practical amounts of thickness and camber, it is close enough to reality to provide useful in- 
sights. The beauty of the linear theory is that the effects of angle of attack, thickness, and camber can be treated 
separately and superimposed, which is not only computationally convenient, but provides a powerful way of 
thinking about how airfoil shape and angle of attack affect the pressure distribution. 

In the linear inviscid theory, we assume incompressible potential flow and construct solutions by superposi- 
tion of a uniform free stream and disturbances “produced” by elementary singularities. Superposition is allowed 
because the incompressible potential equation 1s linear to start with. To simplify the problem further we linear- 
ize the boundary conditions as well, which involves two things: 

1. The flow-tangency boundary conditions that in higher-fidelity theories would be applied on the airfoil 
upper and lower surfaces as shown in Figure 7.4.1la, are applied instead along the x axis (y = 0), as shown 
in Figure 7.4.1b and 

2. We ignore the longitudinal perturbation velocity u in calculating the velocity slope that 1s matched to 
the airfoil surface slope. 


Figure 7.4.1 Flow tangency boundary conditions in airfoil potential flow, illustrated for the upper surface. (a) 
Full tangency condition applied at airfoil surface. (b) Linearized tangency condition applied on the x axis. 
Note that perturbation u is ignored 
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In keeping with applying the boundary conditions on the x axis instead of the airfoil surface, the sheet of dis- 
tributed source strength and vorticity that “produces” the flow disturbance is also placed along the x axis. This 
sheet of singularity strength “causes” the perturbation velocities u and v to vary along the x axis and to have 
different values above and below the axis, which are taken to represent airfoil upper-surface and lower-surface 
values respectively. The equations that are solved for the distributions of source and vorticity strength are just 
the tangency condition illustrated in Figure 7.4.1b, imposed for both upper and lower surfaces, over the whole 
chord. The source and vortex singularities play complementary roles in influencing the perturbation velocities 
u and v. The averages and differences in the perturbations between the upper and lower surfaces are influenced 
differently, and the influence relationships are most easily illustrated in Table 7.4.1. 


Table 7.4.1 Influence relationships in linear airfoil theory 
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For the analysis problem (solving for the pressures for a given airfoil shape), we can decompose the airfoil 
surface slope into contributions from the angle of attack, the shape of the mean line, or camber line (halfway 
between the upper and lower surfaces), and the shape of the thickness distribution (the difference between the 
upper and lower surfaces), and we can then determine the first-order pressure coefficient Cp = — 2u/Ux for each 
part separately, where u is the streamwise perturbation velocity evaluated on the chordline. These components 
of the geometry and their corresponding contributions to the pressure distribution are shown in Figure 7.4.2a—c 
for a NACA 4410 airfoil. Adding these pressure contributions together, we get a prediction for the complete 
pressure distribution of the airfoil, in Figure 7.4.2d. These plots follow the usual convention for plotting pres- 
sures in aerodynamics, with Cp plotted on an inverted scale so that when an airfoil is lifting upward, the pressure 
on the upper surface appears above the pressure on the lower surface. 





Figure 7.4.2 NACA 4410 airfoil at 2° angle of attack, shown decomposed into a chord line, camber (mean 
line), and thickness. The linear-theory contribution of each to the pressure distribution is also shown. (a) Angle 
of attack. (b) Camber line (NACA 44xx). (c) Thickness shape (NACA xx10). (d) All combined for NACA 
4410 airfoil at 2° o 





The angle of attack and the mean-line shape affect only the average of the upper-and lower-surface slopes 
and thus the average v perturbation, and they therefore affect only the vorticity distribution, according to Table 
7.4.1. As we saw in Section 3.3, the local strength of a vortex sheet is equal to the difference (jump) in tangen- 
tial velocity across the sheet. In this case, the local strength of the vorticity distribution on the chordline is equal 
to the perturbation u and is therefore also proportional to the pressure difference, or the local lift loading. 

The thickness shape affects only the difference in surface slopes and thus the difference in the v perturbations, 
which 1s locally equal to the strength of the source distribution. The source strength, and therefore the thickness 
distribution, affects only the average of the wu perturbations, not the difference, according to Table 7.4.1. 

So we see that according to linear theory, the chordwise distribution of lift 1s affected only by angle of attack 
and camber shape, not by thickness, and that the thickness shape affects only the average velocity over the up- 
per and lower surfaces. This is reflected in the contributions to the pressure distribution shown in Figure 7.4.2. 
The pressure distributions due to angle of attack in part (a) and camber shape in part (b) are pure lift-carrying 
pressure differences (equal and opposite on upper and lower surfaces) with zero average perturbation pressures. 
The pressure distribution due to thickness 1n part (c) has a nonzero average and zero difference. 

Now look in particular at the pressure distribution due to angle of attack, shown in Figure 7.4.2a for an angle 
of attack of 2°. Note that the shape of this pressure distribution is generic and that only the amplitude changes 
with angle of attack. The distribution is singular at the leading edge (ACp is infinite), but it is integrable, and 


the lift and moment at a given a can be calculated by integration. Because the AC» varies linearly with angle 
of attack, the lift slope dC//da is constant, and its value is 27 per radian. The lift is concentrated well forward, 
and the centroid, or the point about which the pitching moment 1s zero, 1s located at the quarter chord for any 
nonzero angle of attack. The quarter chord is thus a convenient reference point for pitching moments: Any 
nonzero pitching moment about the quarter chord can be due only to the camber shape and will not vary with 
a, according to the linear theory. This holds true with reasonable accuracy for airfoils with practical amounts of 
thickness and camber, in the absence of nonlinear compressibility effects, and up to angles of attack at which 
boundary-layer separation begins. 

The pressure distribution due to a typical camber shape at zero a is shown in Figure 7.4.2b, in particular the 
NACA 44xx mean line. This mean line produces an incremental C7 of 0.455 with its centroid at 0.488 x/c, for a 
pitching-moment coefficient Cy of —0.108 about the quarter-chord. 

Figure 7.4.2c shows the pressure distribution for the NACA xx10 thickness shape at zero a, which is, of 
course, the same upper and lower. Combining this with the a and camber contributions, (a) and (b), gives the 
complete linear-theory pressure distribution for the NACA 4410 airfoil at 2° a, shown in part (d). 

Linear theory can also be applied to the “inverse” design problem (solving for the airfoil shape for a given 
pressure distribution). Here we would decompose the pressure distribution into a lift part (difference between 
upper and lower pressures, as shown in Figure 7.4.2 by the sum of (a) and (b), and a thickness part (average of 
upper and lower pressures, as in Figure 7.4.2c). The lift part determines the mean-line shape and the angle of 
attack, and the thickness part determines the thickness shape. Linearized design is almost never used in practice, 
because anyone who really wants to design an airfoil will invariably want to use a higher-fidelity theory. We'll 
discuss airfoil design further in Section 7.4.10. 

So to summarize what the linear theory leads us to expect regarding the pressure distribution and forces on 
an airfoil: 

1. The pressure difference between the upper and lower surfaces, and thus the lift, is due primarily to the 
angle of attack and the shape of the mean line and is much less affected by the thickness distribution. 


2. The pressure disturbance averaged between the upper and lower surfaces is due mainly to the thickness 
distribution and much less affected by angle of attack and camber. 


3. The integrated lift is linear with a, and the lift slope dC//da is close to 27 per radian. 
4. The pitching-moment coefficient Cj, about the quarter-chord is nearly constant with a. 


As we'll see, these expectations hold up reasonably well, at least qualitatively, for airfoils of practical thick- 
ness at low Mach numbers in the attached-flow regime. 

Now let's look at some of the ways “real” airfoil flows differ from the predictions of the linear theory. First, 
even before we add viscous effects, just moving the inviscid-flow boundary conditions from the x axis to the 
actual airfoil surface and taking perturbation u into account in the boundary conditions have effects that increase 
with thickness, camber, and angle of attack. In Figure 7.4.3, pressure distributions from the linear theory are 
compared with inviscid-flow calculations from the MSES code (Drela, 1993) for two airfoils that differ by a 
factor of 2 in thickness, camber, and a: NACA 2405 at 1°, and NACA 4410 at 2° (same case as in Figure 7.4.2). 
In both cases, the linear-theory underpredicts the pressure disturbances compared with the higher fidelity invis- 
cid calculations, but much less so for the thinner airfoil than for the thicker one. The discrepancy seems to grow 
roughly quadratically with thickness, as one might expect for nonlinear effects. Still, the linear theory captures 
the general character of the pressure distributions. 


Figure 7.4.3 Comparison between the linear inviscid theory and incompressible inviscid flow with boundary 
conditions applied at the airfoil surface, calculated by the MSES code (Drela, 1993). Comparisons are made 
for two airfoil cases that differ by a factor of 2 1n camber, thickness, and a, so that linear theory predicts a 
doubling of the pressure disturbances. (a) NACA 2405 at 1° a. (b) NACA 4410 at 2° a 





Figure 7.4.4 compares the linear theory with inviscid MSES calculations for the NACA 4410 in terms of the 
change in the chordwise distribution of lift for a change in a from 0° to 2°. Here we see that the linear theory 
captures the general character of the distribution, except that the real airfoil, with its blunt-nosed thickness dis- 
tribution, has no singularity at the leading edge. 


Figure 7.4.4 The effect on the chordwise distribution of lift of a change in a from 0 to 2°. The linear-theory 
prediction, which is independent of airfoil shape is compared with a calculation fora NACA 4410 airfoil, cal- 
culated by the MSES code (Drela, 1993) in the inviscid mode 
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Viscosity can have significant effects on the pressure distribution, through the boundary-layer displacement 
effect. In Figure 7.4.5a, the inviscid MSES calculation of Figure 7.4.3b is compared with a corresponding vis- 
cous calculation at the same angle of attack for a chord Reynolds number of 10 million, assuming laminar-to- 
turbulent transition close to the leading edge. The boundary-layer displacement effect, which 1s at its strongest 
by far on the upper surface near the trailing edge, reduces the effective angle of attack and aft camber of the 
airfoil, and changes the circulation, affecting pressures over the whole chord and noticeably reducing the lift. If 
the viscous solution 1s run so as to match the inviscid C; instead of the inviscid a, the differences in the pressure 
distributions are reduced, as shown in Figure 7.4.5b. Note that due to the reductions in the effective camber and 
a, the viscous calculation requires a higher a to match the inviscid C7. The effect of the higher a can be seen in 
the higher suction level on the forward upper surface. 


Figure 7.4.5 Effect of viscosity on the pressure distribution of NACA 4410 at 2° a, R= 107, tripped to turbu- 
lent at 5% chord, calculated by the MSES code (Drela, 1993). (a) Inviscid and viscous MSES calculations at 
2° a. (b) Inviscid MSES calculation at 2° a; viscous calculation matches inviscid C7 = 0.7385 (a = 2.541°) 
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The most dramatic local changes in pressure brought about by the boundary-layer displacement effect are 
seen in the neighborhood of the trailing edge. As we saw in Section 3.10, the inviscid flow off of a trailing edge 
with a nonzero wedge angle has a stagnation point, and the exact inviscid solution should thus have stagnation 
pressure (Cp = 1.0) at the trailing edge. The numerical solution plotted here doesn't reach stagnation pressure 
because the pressure change approaching the stagnation 1s too rapid to be resolved by the numerical grid (note 
that when a potential flow turns a concave corner of less than 90°, stagnation pressure is approached with in- 
finite slope, as in Figure 3.10.2c). But this issue is irrelevant in the real world with viscous effects. Real pres- 
sure distributions look very much like the viscous solutions plotted in Figure 7.4.5, in which the boundary-layer 
displacement effect has substantially reduced the pressure recovery near the trailing edge and eliminated the 
stagnation. This alteration of the pressure distribution is a significant contributor to the profile drag, as we'll see 
later. It is easy to be deceived by appearances, however. Although the pressure changes near the trailing edge 
are the most dramatic, the pressure contribution to the profile drag is subtle and depends on changes around the 
entire airfoil contour. 


Now let's look at the effects of nonlinear boundary conditions and boundary-layer displacement on the lift- 
versus-a curve. Figure 7.4.6 shows the lift curve predicted by linear theory for the 44xx mean line, again show- 
ing the C; shift of 0.455 at zero a and a slope dCj/da of 2a per radian. For comparison, MSES predictions 
are shown for the NACA 4410 and NACA 4420 airfoils, for both inviscid flow and all-turbulent flow at R 10 
million. The inviscid MSES calculations show higher lift slopes than linear theory, increasing with increasing 
thickness ratio. The viscous calculations show significant losses of lift relative to inviscid. The calculation for 
10% thickness ratio just happens to agree closely with the linear theory over the linear part of the lift curve, 
while the calculation for 20% thickness ratio is lower. At higher angles of attack, the displacement thickness of 
the boundary layer on the aft upper surface grows rapidly with angle of attack, causing the lift curve to become 
nonlinear and the lift slope to decrease. The maximum lift shown may be close to the maximum lift this airfoil 
could produce, but the MSES solution scheme based on simultaneous boundary-layer/inviscid-flow coupling 
did not in this case produce solutions showing maximum lift and beyond. 


Figure 7.4.6 Lift curves predicted by linear theory and the MSES code (Drela, 1993) with and without viscos- 
ity for the NACA 4410 and 4420 airfoils 
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So far, we've seen two important effects of viscosity on the pressure distributions: The reduction in aft pres- 
sure recovery that contributes to profile drag, which we'll discuss in Section 7.4.2, and a reduction in lift slope 
that increases with increasing thickness. The latter effect explains what Yates (1990) referred to as the “fatness 
paradox,” in which the lift slope in inviscid flow increases with increasing thickness, while the lift slope in vis- 
cous flow decreases. 

In Figure 7.4.4, we saw that a rounded leading edge removes the leading-edge singularity that would gener- 
ally occur in the inviscid solution for the mean line alone. On an airfoil that has no sharp corner other than the 
trailing edge, flow attachment occurs at a stagnation point, as we saw in Section 5.2.1. A stagnation point is, of 
course, a feature that the linear theory cannot model accurately. At moderate angles of attack for most airfoils, 
the stagnation point occurs near the leading edge. Figure 7.4.7 shows the pressure distribution plotted versus arc 
length around the leading edge, calculated for the NACA 4410 at an angle of attack of 2° in inviscid flow and 
at the matching C; condition for viscous flow (same conditions as in Figure 7.4.5b). There is a slight shift in the 


location of the stagnation point, but the shape of the pressure distribution in the neighborhood of the stagnation 
point is affected very little by viscosity. The value of Cp at stagnation is greater than 1 because of numerical 
issues related to running a compressible code at a very low Mach number. 


Figure 7.4.7 Pressure distributions in the neighborhood of the leading-edge stagnation point of a NACA 4410 
airfoil. The inviscid MSES calculation is at 2° a, and the viscous calculation matches the inviscid C7 = 0.7385 
(a = 2.541°, as in Figure 7.4.5b) 
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7.4.2 Profile Drag and the Drag Polar 


In Chapter 6, we looked in considerable detail at drag in general, and we noted that although it is not possible 
to rigorously decompose the drag force according to the flow mechanisms that cause it, an approximate decom- 
position into lift-induced drag, shock drag, and viscous drag is possible with the aid of simplified theoretical 
models. The problem we face here is simpler: A 2D airfoil in 2D flow has no induced drag, and shock drag ap- 
pears only at transonic speeds when regions of supersonic flow appear in the field. We'll defer transonic issues 
to Section 7.4.8 and look now just at the viscous drag in subcritical flow. As we saw in Section 6.1.3, viscous 
drag has both a shear component and a pressure component. In the context of airfoils and wings, the viscous 
drag is often called the profile drag. 

Of course in an ideal 2D inviscid flow without shocks, the drag of an airfoil must be zero. This is an example 
of D'Alembert's paradox, which we discussed in Sections 5.4 and 6.1.6. Intuitively speaking, zero drag is even 
more surprising in the case of a lifting airfoil at an angle of attack than it is in general. Consider the lifting air- 
foil sketched in Figure 7.4.8a. Because of the general downward slope of the airfoil surfaces, due to the angle 
of attack, the pressure difference between the upper and lower surfaces produces a net force that over much of 
the airfoil chord is tilted backward as illustrated by the arrow above the airfoil. If the integrated pressure force 
over so much of the chord is tilted backward, how can the drag be zero? The answer to this lies with something 
that is loosely termed /eading-edge suction. On a lifting airfoil with a rounded leading edge, as in Figure 7.4.8a, 
the pressure around the leading edge is predominantly lower than elsewhere, as indicated by the minus signs, 
just enough to balance the general backward tilt of the lift over the rest of the chord. On an airfoil with zero 
thickness, as in Figure 7.4.8b, there is only one angle of attack at which the flow attaches smoothly to the lead- 
ing edge. At any other angle of attack, the potential flow around the leading edge has a singularity with infinite 


velocity, and produces a singular suction force parallel to the surface tangent at the leading edge, as indicated 
by the arrow. In 2D inviscid flow, this suction force must exactly cancel any net drag force resulting from the 
pressure differences over the rest of the chord. 


Figure 7.4.8 The concept of leading-edge suction. (a) Low pressure around a rounded leading edge. (b) Singu- 
lar suction force at the leading edge of an airfoil with zero thickness 
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In the real world, the viscous drag of a typical streamlined airfoil in attached flow tends to be small, on the 
order of 10 ~ in terms of the drag coefficient. When we want to predict airplane performance or compare can- 
didate designs, we would like to pin this small drag number down to within a small percentage, so we are look- 
ing at trying to predict the profile drag to within something on the order of a drag count (10 *). Computational 
predictions at this level are at or beyond the limit of what is currently practical and are therefore problematic. 
When we are looking at increments on the order of a drag count, we should use only the best methods available, 
and we should apply them with great care. We'll address the CFD issues further in Chapter 10. Experimental 
measurements of drag at this level are likewise difficult. Only particular experimental setups and measurement 
methods are up to the task, and then only marginally so. Experimental uncertainties are often as large as the 
increments we would like to determine. These difficulties, however, don't keep us from drawing some general 
conclusions about the nature of viscous profile drag. 

In Section 5.4 and in Section 6.1.3, we discussed the decomposition of drag into a surface shear stress, or 
skin-friction part and a pressure part, and noted that this decomposition involves none of the theoretical ideal- 
ization that is required in the induced/shock/viscous drag decomposition. In attached flow, typical airfoil pro- 
file drag is mostly integrated skin friction, but pressure also makes a significant contribution, as a result of the 
boundary-layer displacement effect. A viscous-inviscid interaction code can do a reasonable job of predicting 
the boundary-layer displacement thickness, but an NS code is generally required if one is to calculate the effect 
on the pressure accurately enough to allow the viscous pressure drag to be calculated by direct pressure integ- 
ration. In practice, accurate prediction of the pressure drag in an NS code requires a flow solution with an ex- 
cellent degree of grid convergence. In viscous-inviscid interaction codes, the pressure drag cannot generally be 
calculated directly with sufficient accuracy, and the total drag must be inferred from less-direct flowfield mani- 
festations. The wake development can be calculated into the farfield and used in the farfield wake-momentum 
formula, Equation 6.1.7, as is done in the MSES code (Drela, 1993), for example, or trailing-edge boundary- 
layer quantities can be used in the Squire- Young formula, Equation 6.1.8 for incompressible flow, or the com- 
pressible version by Cook (1971). 

Now let's look at typical behavior of skin-friction and pressure drag, using MSES calculations for the NACA 
4410, the airfoil whose pressure distributions we looked at in Figure 7.4.5. First, a typical distribution of Cr 
along the chord is shown in Figure 7.4.9, for the same flow condition as in Figure 7.4.5b. Flat-plate Cr from 


Equations 4.3.4 and 4.3.5 1s shown for comparison. The artificially forced transition from laminar to turbulent 
at 5% chord is clearly seen. It is also clear that pressure gradients and edge velocities different from freestream 
make Cf on the upper and lower surfaces considerably different from that on a flat plate, and that Cf on the 
upper surface is higher than that on the lower surface, except near the trailing edge. In Figure 7.4.10 we look 
at the drag polar behavior, that is, how these things vary with C7. The skin friction calculated by the code's 
boundary-layer equations was integrated directly, and the total drag was calculated from the farfield wake mo- 
mentum. The difference was assumed to be the pressure drag and labeled as such. The total skin-friction drag is 
only slightly greater than that on a two-sided flat plate and varies little with C7. Most of the variation of the drag 
with C7 is seen to come from the pressure drag. The variation is roughly parabolic at lower C;, but it becomes 
strongly nonparabolic at higher C7, as the boundary layer on the aft upper surface approaches separation. 


Figure 7.4.9 Skin-friction distribution for NACA 4410 calculated by the MSES code (Drela, 1993). R = 107, 
C7 = 0.7385 (a = 2.541°). Here Cr is normalized by farfield q, not local edge q. C¢ for a flat plate at the same 


Reynolds number is shown for comparison 
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Figure 7.4.10 Drag predicted by the MSES code (Drela, 1993) for the NACA 4410, R = 10/, trip at 5% chord. 
(a) Drag polar. (b) Form factor 
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In Section 6.2.1, we discussed the concept of a form factor that relates the actual profile drag to the skin- 
friction drag of a flat plate at the same chord Reynolds number and with laminar-to-turbulent transition at the 
same chord location. Handbook methods for drag estimation often use the form factor in estimating the profile 
drag of the airfoils that make up the wing. The flat-plate skin friction is easily calculated by Equation 4.3.2 for 
all-turbulent flow, and an adjustment can be made if a significant run of laminar flow is expected. The form 
factor itself poses more of a problem and is thus the weak point of the form-factor approach. To illustrate why, 
the form factor variation calculated from the MSES predictions of Figure 7.4.10a is plotted in Figure 7.4.10b. 
Like the pressure drag in Figure 7.4.10a, the form factor has a roughly parabolic variation at lower C; and be- 
comes strongly nonparabolic at higher C7. Note that even at the drag minimum the form factor 1s greater than 
1 by more than 20%, reflecting the fact that the viscous drag of an airfoil is typically significantly greater than 
the skin-friction drag of a flat plate. In handbook methods, only the parabolic part can be correlated reasonably 
well with the airfoil maximum thickness and maximum camber. The nonparabolic part depends on airfoil design 


details and is usually ignored. As a result, when it comes to making airplane performance predictions, relying 
just on the parabolic part of the airfoil polar that fits the low-C/ range is bad practice. Usually, the airfoil C; that 
provides the highest airplane lift/drag ratio is determined by the rapid increase in profile drag on the nonpara- 
bolic part of the airfoil polar. Using the parabolic polar that fits the low-C; range will generally lead to a choice 
of operating C, that is too high and will also yield a seriously optimistic drag estimate. 


7.4.3 Maximum Lift and Boundary-Layer Separation on 
Single-Element Airfoils 


We now turn our attention to the problem of the maximum lift an airfoil can produce and how it is affected 
by the airfoil shape and pressure distribution. There are limits to the amount of lift that can be produced even 
in inviscid flow, as discussed by Smith (1975). Either the circulation becomes so large that the attachment and 
separation points merge, and it is assumed that larger circulation would be a mathematical artificiality, or the 
condition on the upper surface approaches vacuum. Smith points out that these limits are moot, however, be- 
cause viscous effects impose much lower limits. As we noted in our basic explanation of lift in Section 7.3.3, 
the maximum lift of an airfoil is limited because “there is an angle of attack above which the flow cannot follow 
the upper surface and breaks away,” as shown in Figure 7.3.16. Now it's time to look in more detail at what that 
means. The breaking away is of course boundary-layer separation, and it changes the basic topology of the flow, 
as we discussed in Sections 5.2 and 5.3. As we saw in Section 4.1.4, the immediate cause of any separation 
from a smooth surface in 2D flow must be an adverse pressure gradient. The angle of attack strongly affects 
the pressure distribution and is therefore a primary variable affecting separation. Separation also depends on the 
Reynolds number and the location of laminar-to-turbulent transition. 

The progression of separation with angle of attack, and the resulting variation of lift with angle of attack, 
can follow a variety of scenarios, depending on the details of the airfoil shape and pressure distribution and 
on whether the boundary layer is laminar or turbulent at the separation point. We'll take a brief look at the in- 
teresting complexities that can accompany laminar separation before we take up the simpler case of turbulent 
separation that dominates at high Reynolds numbers. 

If the Reynolds number is relatively low, or if the airfoil is thin enough, separation can often occur while 
the boundary layer is still laminar, and the progression with angle of attack can be complicated. The shear layer 
downstream of a laminar separation is generally highly unstable, and transition tends to happen within a short 
distance, but of course the distance depends on the Reynolds number. Depending on the length of the transition 
region and the shape of the airfoil, transition may or may not cause reattachment of the boundary layer to the 
surface. A laminar separation with subsequent turbulent reattachment is called a /aminar separation bubble and 
generally has the structure illustrated in Figure 7.4.11. If the displacement-thickness Reynolds number at the 
separation point is above 500 (see Thwaites, 1958), transition is very quick and nearly always results in reat- 
tachment. The resulting bubble is typically only 1—2% of the chord in length and is called a short bubble. At 
lower Reynolds numbers, transition is less vigorous, and reattachment may or may not occur. If a bubble results, 
it is referred to as a long bubble. 


Figure 7.4.11 General structure of a laminar separation bubble with turbulent reattachment. From Thwaites, 
(1958). Used with permission of Dover Publications, Inc. 
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Most airfoil flows are unique, in that there is only one possible flow pattern for a given airfoil shape, farfield 
flow condition, and angle of attack. But in some situations when laminar separation bubbles are involved, there 
can be two possible flow patterns at the same angle of attack, and the lift curve can have a hysteresis loop. Fig- 
ure 7.4.12a,b shows two possible types of hysteresis loop. 


Figure 7.4.12 Lift curves showing different types of stall behavior depending on the progression of upper-sur- 
face boundary-layer separation. (a) Counter-clockwise hysteresis loop in the mid-C; range associated with lam- 
inar separation near mid-chord. (b) Clockwise hysteresis loop at the top of the lift curve associated with the 
bursting of a laminar separation bubble near the leading edge. (c) Gentle turbulent stall: separation moves for- 
ward gradually, and lift increases somewhat after the first appearance of separation. (d) Sudden turbulent stall: 
separation jumps forward suddenly, and lift increases little or not at all after first separation 
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The counterclockwise loop in part (a) 1s associated with a laminar separation bubble near mid chord that first 
appears at an angle of attack below point A. As point A on the lift curve is approached from below, the bound- 
ary layer ceases to reattach, and the lift jumps down to point B on the lower branch. When the angle of attack 
is further increased to point C, transition moves ahead of the former separation point, the separation disappears, 
and the lift jumps up to point D. When the loop is traversed in the other direction, earlier transition continues to 
prevent the laminar separation, and the lift follows the upper branch back to point A. 

A clockwise hysteresis loop like that shown in part (b) of Figure 7.4.12 is more common and is associated 
with laminar separation near the leading edge. Below point A on the lift curve, there is a laminar short bubble 
with reattachment. The laminar bubble thickens the boundary layer considerably, however, and reduces the tur- 
bulent boundary layer's ability to resist separation downstream. (We'll look at the effect of incoming boundary- 
layer thickness on turbulent separation in some detail in connection with turbulent stall below.) At point B, sep- 
aration jumps forward suddenly from the trailing edge to the leading edge, leading to the precipitous drop in lift 
to the lower branch at point C. This is often described as the “bursting” of the laminar bubble. Now that the lift 
is reduced, and with it the circulation around the airfoil, the stagnation point is not as far aft on the lower surface 
as it was when the lift was on the upper branch. The distance around the leading edge from stagnation to separ- 
ation is shorter, the displacement-thickness Reynolds number at separation is lower, and the ability to reattach 
is reduced. As angle of attack is reduced, the lift follows the lower branch to point D before the reattachment is 
reestablished, and the lift jumps back up to point A. 


Obviously, flows with laminar separation bubbles can display complex behavior, and it turns out that they 
are not that easy to predict. Drela's ISES and MSES codes, which are based on viscous/inviscid interaction, and 
in which the integral-boundary-layer portion of the analysis includes a transition-prediction module, can predict 
such flows in some cases. At the current state of the art, NS codes based on the RANS equations with turbulence 
modeling don't routinely predict these effects because they don't typically incorporate any option for transition 
prediction. 

At the high Reynolds numbers typical of large airplanes, laminar separation tends not to arise, and lift is 
limited by turbulent separation. In a “turbulent stall,” lift can increase substantially or very little after separa- 
tion first appears, depending on how the separation moves forward as angle of attack increases. As a result, the 
airfoil's lift curve can vary between the extremes illustrated in Figure 7.4.12c,d. Separation may move gradu- 
ally forward from the trailing edge, producing a gradual reduction in lift slope and a broad maximum in the 
lift curve as 1n part (c), or it may jump forward suddenly, producing a precipitous drop in lift as in part (d). In 
the gradual scenario, the actual sta//, or onset of the decrease in lift, 1s usually preceded by buffeting caused by 
flow unsteadiness related to the separation, and the loss of lift post-stall is gradual. In the sudden scenario, the 
stall comes without warning, and the loss of lift is abrupt, which can cause sudden changes in airplane handling 
characteristics. Obviously, flight safety favors a gradual stall, and we'll see later what pressure-distribution fea- 
tures can be designed into an airfoil to produce it. Even an airplane with a gradual stall, however, is deliberately 
flown into the buffeting regime only in special circumstances, typically flight testing and pilot training, and in 
this sense, the onset of flow separation provides one definition of the maximum usable lift coefficient. For this 
reason, and because it is easier to predict computationally than the actual maximum lift, we'll take the onset of 
flow separation as the threshold for purposes of this discussion. 

By this criterion, the maximum lift depends on how low the average upper-surface Cp can be made before 
the upper-surface boundary layer can no longer make it to the trailing edge without separating. The “run” of 
the upper-surface boundary layer starts at the stagnation point of attachment, which under high-lift conditions is 
usually on the lower surface, back a short distance from the leading edge, so that the boundary layer must flow 
around the leading edge before it starts to traverse the upper surface. Then whether the boundary layer separates 
or not depends on the pressures it is subjected to around the leading edge and along the entire upper surface. 


We saw what a typical upper-surface pressure distribution looks like in Figure 7.4.2: There is a decrease in 
pressure as we move away from the stagnation point of attachment near the leading edge, and a minimum pres- 
sure we'll call the suction peak, which can be very close to the leading edge or farther back, depending on the 
airfoil shape and the angle of attack. The suction peak is followed by an increase in pressure, or pressure recov- 
ery from the suction peak to the trailing edge. For an airfoil with nonzero thickness in attached viscous flow, 
the Cp at the trailing edge typically has a slightly positive value that depends primarily on the average thickness 
and is not very sensitive to the details of the thickness shape, for reasonable shapes. 

The recovery region is where the pressure gradient is adverse, and that's where we'll concentrate our atten- 
tion. In general, a streamlined body with thickness in attached flow must have a recovery region at the rear. In 
Figure 7.4.2c, we saw that a typical airfoil thickness distribution by itself contributes a modest amount to the 
recovery, and in Figure 7.4.2b, we saw that the typical camber distribution also contributes. For determining 
maximum lift, the contribution from angle of attack (Figure 7.4.2a) is crucial because it increases with angle of 
attack, making the pressure gradient over the entire recovery region increasingly adverse. Increasing a eventu- 
ally causes the gradient to exceed what the boundary layer can tolerate, regardless of other details of pressure 
distribution. 

To maximize the lift at the onset of separation, we want to make the average Cp on the upper surface as neg- 
ative as possible, subject to the constraint imposed by the slightly positive C, that we must reach at the trailing 
edge. We can decrease the average Cp either by lowering the minimum pressure or by keeping the pressure low 
over more of the chord, which delays the start of the recovery, or by a combination of both. Lowering the min- 


imum pressure increases the pressure rise the boundary layer must withstand. Delaying the start of the recovery 
shortens the distance in which the recovery must take place and thickens the boundary layer at the start of the 
recovery, both of which tend to bring separation on sooner. The shape of the recovery pressure distribution is 
also an important factor. 

There is no single, simple way to visualize how all of these factors come together to affect maximum lift. A 
conventional Cp plot normalized by the farfield dynamic pressure qo has the advantages of directly showing the 
effects of the pressure distribution on C; and of making it clear that all recovery pressure distributions are es- 
sentially “anchored” to a trailing edge Cp that is fixed within a narrow range. But conventional Cp at the suction 
peak varies widely, making it difficult to visualize how that part of the pressure distribution affects separation. 
The amount of pressure rise, or velocity drop, that the boundary layer can withstand is much more closely re- 
lated to the conditions at the suction peak than to those in the farfield. In his classic Wright Brothers lecture on 
high-lift aerodynamics, Smith (1975) suggested that plotting recovery pressure distributions in what he called 
canonical form, in terms of a pressure coefficient relative to the suction-peak pressure and normalized by the 
peak dynamic pressure, reduces the variation in AC, between the suction peak and separation and makes the 


separation trends easier to see. The canonical pressure coefficient ~! is thus defined and related to conven- 
tional Cp by 
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where the subscript o denotes conditions at the suction peak, which we'll take as the start of the recovery. Note 


that C » at the suction peak is zero by definition, and it takes on only positive values in the recovery. Separation 


of a turbulent boundary layer generally occurs at ~P values between 0.4 and 0.9, a much smaller range than 
we would see in terms of conventional ACp. Of course, to see how the separation trends affect Cy, we will still 
have to return to the conventional pressure distribution. 
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Let's look at some separation trends in terms of canonical pressure distributions. Figure 7.4.13, from Smith 
(1975), shows in a single plot how C at separation is affected by the shape of the recovery pressure distribu- 
tion and the thickness of the boundary layer at the start of the recovery, which in these examples is determined 
by the length xo of an ideal constant-pressure boundary-layer run ahead of the start of the recovery. The longer 
the length xo, the thicker the boundary layer at the start of the recovery. The results shown are for turbulent 
boundary layers and are based on numerical solutions to the boundary-layer equations with the algebraic eddy- 
viscosity model of Cebeci and Smith (1974) and with the given pressure distributions as boundary conditions. 
Because of the limitations of the simple turbulence model, these calculations are generally optimistic regard- 
ing to the amount of pressure recovery that is possible, especially in the cases where the recovery leading to 
separation is short when measured in terms of initial boundary-layer thicknesses, but the qualitative trends are 


realistic. 


Figure 7.4.13 Separation loci for a family of power-law canonical pressure distributions preceded by different 
lengths xo of boundary-layer run ahead of the recovery. In parentheses are values of Ro at x = 0. (From Smith, 
1975; Figure 20.) The airfoil sketch at the top, and the circles and arrows pointing to it, were added to Smith's 
original figure to illustrate how these curves would apply to a hypothetical airfoil for the particular case of xo = 


0.25 ft and m = 1/2 
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Three important trends are clear in Figure 7.4.13: 
1. A concave recovery (a recovery pressure distribution characterized by m < 1, in which the adverse 
pressure gradient starts out strong and eases downstream) leads to separation in a shorter distance than 
does a convex recovery (m> 1), 
2. In spite of the shorter distance to separation, a concave recovery produces greater pressure recovery 
than a convex recovery, and 
3. A thin boundary layer at the start of the recovery (small xo) results in greater pressure recovery, and a 
thick boundary layer results in less. 


Smith (1975) showed that these results are not very sensitive to Reynolds number: Reducing the Reynolds 





number by a factor of 10 reduced the values of ~P at separation generally by less than 0.1. But for a given 


pressure distribution, separation is strongly affected by wing sweep, as we'll see in Section 8.6.2. 

So viewing recovery pressure distributions in canonical form makes it easier to visualize how the initial 
boundary-layer thickness and the shape of the pressure distribution affect separation. But it isn't clear just from 
the canonical distributions what these boundary-layer separation trends imply regarding airfoil maximum lift. 
To visualize the effects on maximum lift, we must convert the canonical distributions back to conventional form 
and view them as candidate upper-surface pressure distributions of a hypothetical airfoil with a rapid flow ac- 
celeration near the leading edge and a region of constant pressure back to the recovery point, followed by the 
pressure recovery. 


The conversion to a conventional pressure distribution goes as follows. We take xo in the canonical plot to 
correspond to the leading edge of our airfoil, and if we place the trailing edge at Xsep in the plot, we have sep- 
aration beginning just at the trailing edge, which corresponds to our criterion for the maximum usable lift. The 
airfoil sketch at the top of Figure 7.4.13, and the circles and arrows pointing to it, were added to Smith's ori- 
ginal figure to illustrate how this would work for a hypothetical airfoil in the particular case of xo = 0.25 ft and 
m = 1/2. In this case the leading edge lines up with x = — 0.25 ft, and the trailing edge lines up with Xsep for 
m = 1/2, Xo = 0.25. For general cases, each of the possible combinations of Xo and Xsep then corresponds to 
particular locations of the leading and trailing edges on the plot, and a particular chordwise location x/c for the 
start of the recovery, often referred to as the recovery point. Then we assume that the conventional Cp at the 


trailing edge always has a slightly positive value, say 0.2, and that this corresponds to ~F at separation for the 
particular pressure distribution we're considering in the plot. Through Equation 7.4.1, this defines uo for each 


case we consider, and thus defines the relationship between ~? and Cp. When we do this for any one of the 
canonical pressure distributions in Figure 7.4.13, and repeat it for various combinations of Xo and Xsep, we get a 
family of conventional pressure distributions with different recovery points and different Cp levels at the suction 
peak. Strictly speaking, each of these pressure distributions corresponds to a different chord Reynolds number, 
because in Figure 7.4.13 it is the “unit” Reynolds number uo/v of the canonical x scale that 1s constant. But 
the Reynolds-number sensitivity is fairly weak, and this discrepancy shouldn't seriously distort the qualitative 
trends we're looking for. 
Results of this process are shown in Figure 7.4.14 for four of the canonical pressure distributions ranging 

from concave to convex. Several important trends are evident in these plots: 

1. In the part of the pressure distribution ahead of the pressure recovery (often called the rooftop), higher 

suction levels (lower pressures) are possible with concave recoveries, 


2. For a given shape of the recovery pressure distribution, the allowable suction level increases as the 
recovery point is moved forward, and 


3. For a given suction level, a concave recovery allows a farther-aft recovery point and a longer rooftop. 


Figure 7.4.14 Families of conventional maximum-lift pressure distributions corresponding to some of the ca- 
nonical pressure distributions of Figure 7.4.13; for each assumed location of the recovery point, the rooftop 
level was determined so that separation is reached at the trailing edge. All-turbulent flow was assumed ahead 
of the recovery 





A higher suction level and a longer rooftop both increase lift, so that in view of items (1) and (3) above, 
concave recoveries should be favorable. This is born out by the plots in Figure 7.4.14, which clearly show that 
concave recoveries produce higher lift, and by the corresponding upper-surface lift coefficients plotted in Figure 
7.4.15. 


Figure 7.4.15 Upper surface C; values for the pressure distributions of Figure 7.4.14, plotted versus the 
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It is also clear that for any given shape of recovery pressure distribution, high lift involves a compromise 
between having low pressure at the suction peak and having the low pressure extend over more of the chord. 
There is therefore an optimum location for the recovery point that yields the highest lift. Figure 7.4.15 shows 
that the optimum recovery point moves aft, and the maximum lift dramatically increases, as the recovery pres- 
sure distribution is made more concave. Remember that chord Reynolds number was not kept constant in these 
examples, and the trends are only qualitative approximations to what would happen for constant Reynolds num- 
ber. 


So in these examples, the highest maximum lift 1s achieved by the most concave recovery pressure distribu- 
tion, or the most “rapid” recovery. Smith (1975) concluded that the highest possible lift that can be achieved 
without separation is achieved by the most rapid of all possible recoveries, the Stratford recovery (Stratford, 
1959a, b), which is shown in canonical form in Figure 7.4.16 (Smith's Figure 21). In a Stratford recovery, an 
extremely strong pressure gradient at the start brings the boundary layer immediately to near-zero Cf, and the 
remainder of the pressure distribution is tailored to maintain the boundary layer in a constant state of incipient 
separation. The boundary-layer velocity profile in a Stratford recovery is nearly self-similar, and it is one of the 
family of equilibrium turbulent-boundary-layer flows that we discussed in Sections 4.3.2 and 4.4.2. 


Figure 7.4.16 Canonical pressure distributions for Stratford's recovery at two different values of unit Reynolds 
number. (From Smith, 1975; Figure 21; including references to his equation 4.11) 
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The potential for high maximum-lift coefficients led Liebeck (1973) to use the Stratford recovery pressure 
distribution as the basis for several low-speed airfoil designs. A drawback to this approach is that above the 
“design” angle of attack, separation jumps forward abruptly from the trailing edge, leading to the problem of 
abrupt stall that we've already discussed. As a result, Stratford-recovery airfoils have not found much applic- 
ation in aeronautical practice. Instead, designers developing airfoils for practical applications purposely back 
away from the Stratford recovery by varying amounts, that is, they make the recovery less concave, depending 
on how gentle they want the stall to be and how much of a penalty in maximum lift they are willing to pay for 
it. 

So far we've assumed that the boundary-layer development 1s all-turbulent from near the leading edge, a situ- 
ation Smith referred to as a turbulent rooftop. Now let's look at the case of a laminar rooftop. A run of laminar 
flow ahead of the start of the recovery, that is, a laminar rooftop, will result in a thinner boundary layer at the 
start of the recovery than would be there if the boundary layer were all turbulent. In terms of the canonical 
pressure distributions in Figure 7.4.13, we saw that a thin boundary layer at the start of the recovery tends to 
delay separation. In terms of the conventional pressure distribution, as in Figure 7.4.14, the thinner boundary 
layer would allow either a higher suction level in the rooftop or a longer rooftop, either of which would increase 
maximum lift. Figure 7.4.17, from Smith (1975), illustrates this effect by comparing conventional maximum-lift 
pressure distributions with laminar and turbulent rooftops. These examples happen to show Stratford recover- 
ies, but the effects of a laminar rooftop are reasonably generic. For a given recovery-point location, a laminar 
rooftop allows a higher suction level, and for a given suction level, a laminar rooftop allows a longer rooftop. 
In view of the potential for very high maximum lift coefficients, Liebeck (1973) assumed a laminar rooftop for 
several of his airfoil-design examples. For practical applications, a drawback to designing an airfoil for a lam- 
inar rooftop at the maximum-lift condition is that if the airfoil surface becomes contaminated, say by rain or 
insects, and the flow trips early to turbulent, the maximum-lift capability will be greatly reduced. 


Figure 7.4.17 Conventional maximum-lift pressure distributions with Stratford recoveries at a chord Reynolds 
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To summarize, we've seen that high maximum lift in a single-element airfoil favors concave and even 
Stratford-recovery pressure distributions, and a laminar rooftop. In airplane applications, however, such features 
are to be avoided for handling characteristics and safety reasons, and besides, single-element airfoil design is 
usually dominated by considerations in other parts of the flight envelope. For many airplanes, maximum lift for 
takeoff and landing is enhanced by high-lift devices with slots, which brings us to our next topic. 


7.4.4 Multielement Airfoils and the Slot Effect 


Maximum-lift capability can be greatly enhanced if the airfoil has multiple elements with favorably configured 
slots between them. Figure 7.4.18 shows how slots are typically arranged on airfoils with slotted trailing-edge 
and leading-edge high-lift devices and between the mainsail/headsail combination often used on sailboats. Note 
that the slot widths (gaps) used in aeronautical applications are typically much smaller, relative to overall chord, 
than those used in cloth sails. As we'll see, however, the physical mechanism for the effectiveness of the slot is 
largely the same in both cases. 


Figure 7.4.18 Sketches of multielement airfoils with slots, not to scale. (a) Single-slotted trailing-edge flap, as 
on many light airplanes. (b) Double-slotted trailing-edge flap and leading-edge slat, as on the Boeing 757 and 
Boeing 737NG. (c) Triple-slotted trailing-edge flap and a flexible (variable camber) leading-edge Krueger 
flap, as on the Boeing 747. (d) A mainsail/ headsail combination 


(a) \ C ® 
y — (b) 


\ 


(c) (d) 


For a slot to be effective in enhancing maximum lift, the general pattern of flow through it must be as shown 
in Figure 7.4.19, in which the flow passes smoothly through the slot, from the lower surface to the upper sur- 
face, and is directed along the upper surface of the aft element. A smoothly contoured slot as shown in part 
(a) naturally facilitates such flow, but even if the slot is preceded by a sharp-cornered cove, which is common 
in practical applications in which the slot opens when a high-lift device is deployed, the flow under high-lift 
conditions usually reattaches to the lower surface ahead of the slot, as shown 1n part (b). In both the smoothly 
contoured slot and the coved slot, much of the air passing through the slot is not in the boundary layers and 
has freestream total-pressure. This part of the flow through the slot and extending downstream in the region 
between the boundary layer on the aft element and the wake of the forward element is often referred to as a 
potential-flow core, or potential core, for short. The potential core can be thought of as ending where confluence 
occurs between the wake of the forward element and either the boundary layer or the wake of the aft element, 
as shown in Figure 7.4.19c,d. Identifying the beginning of confluence 1s a bit like identifying the “edge” of a 
turbulent boundary layer; both are difficult to define precisely in practice. 


Figure 7.4.19 Typical features of the flow through an effective high-lift slot. (a) Smoothly faired slot. (b) Slot 
with a sharp-cornered cove with flow reattachment upstream. (c) Confluence of the wake of the forward ele- 
ment with the boundary layer of the aft element. (d) Confluence of the wake of the forward element with the 
wake of the aft element 
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Historically speaking, explanations for how the s/ot effect enhances maximum lift have evolved considerably. 
Early on, the flow through the slot was typically seen as a wall jet that “energizes” the boundary layer down- 
stream and prevents or delays separation on the downstream element, in the way we discussed in Section 4.5.3 
(see also Prandtl and Tietjens, 1934, section 93). For the small gaps typical of aeronautical applications, this 
view has some merit. After all, the entire viscous layer downstream of the slot, which now includes the bound- 
ary layer on the aft element, what remains of the potential core of the slot flow, and the wake of the forward 
element, is still a boundary layer in the general sense, and the addition of the potential core does constitute a 
sort of “energizing” of the flow, relative to what would be there without the slot. The potential core does have 
higher energy than the rest of the boundary layer, albeit not higher than the freestream. But this view of the slot 
flow turns out to be disappointingly limited in what it can explain. 

Much of what is known about the performance of wall jets in preventing separation applies to the case of an 
active jet in which the total-pressure in the jet core is well above that of the freestream (farfield). Active-wall- 
jet blowing tends to be implemented with a relatively narrow slot, high total-pressure in the jet, and a high jet 
velocity. In this situation, the jet mass flux required to control downstream flow separation tends to be relatively 
small, and momentum flux alone usually does a good job of correlating performance, as we noted in Section 
4.5.3. On the other hand, a passive wall jet, in which the potential core is only at freestream total pressure, is not 
so simple and should not be expected to perform in a way that correlates neatly with simple jet-flow parameters. 
The sorts of relatively simple trends that can be seen in the behavior of ordinary boundary layers, as in Figures 
7.4.13 and 7.4.15, don't apply to passive wall-jet flows either. If we are to really understand the effectiveness of 
passive slots, we must go beyond viewing the slot flow as a simple wall jet, even for cases with small gaps. And 
clearly, physical intuition indicates that the wall-jet model doesn't apply to the large gaps used with sails. 





How the slot effect really works becomes much clearer if we view the boundary layers on the individual air- 
foil elements as ordinary individual boundary layers, a view that is valid as long as the potential core of the slot 
flow persists, and retains some validity even after some confluence has occurred. Looking at the problem in this 
way was first popularized in Smith's (1975) Wright Brothers Lecture. To the extent that this view holds true, it 
is logical to think of each airfoil element as an individual single-element airfoil, as far as its own boundary-layer 
development is concerned, with the only influence of the other airfoil elements coming through their effects on 
the pressure distribution. So our focus has shifted from seeing the slot effect primarily as a viscous-flow phe- 
nomenon (the wall-jet view) to seeing it in terms of a combination of viscous effects and the largely inviscid 
interaction by which the airfoil elements influence the pressure distributions imposed on their neighbors’ bound- 
ary layers. 

We can now look at the problem of how much lift an element of a multielement airfoil can carry in the same 
way that we looked at the maximum lift of a single-element airfoil in Section 7.4.3. There we saw that the main 
limitation on maximum lift is imposed by how much pressure rise (or velocity drop) the boundary layer on the 
suction side can withstand. We also saw that the boundary layer's ability to withstand the pressure rise is in- 
creased if the boundary layer is thin when it starts into the pressure rise. The slot effect works on both of these 
things: the amount of pressure rise imposed on each element, and the thickness of the boundary layer starting 
into the pressure rise. 

So a slotted configuration enhances maximum lift by doing two things: 


1. Starting a fresh upper-surface boundary layer at the leading edge of each element. For elements after 
the first, this means a thinner boundary layer at the start of the pressure rise than would be there if there 
were no slot. 


2. Reducing the pressure rise imposed on the suction-side boundary layer of each element. Slots do this in 
either of two ways, or both simultaneously, depending on the situation of the given airfoil element. When 
there is an element ahead, it can provide leading-edge suction-peak suppression, essentially by providing 
some flow turning ahead of the leading edge of the current element, so that the flow doesn't rush around 
the leading edge as fast as it otherwise would. When there is an element behind, it can provide trailing- 
edge dumping-velocity elevation, effectively by placing the trailing edge of the current element in a high 
velocity region near the leading edge of the trailing element. Reducing the velocity at the leading edge of 
an element and elevating the velocity at the trailing edge both reduce the total velocity drop the boundary 
layer is subjected to. 


These mutual-influence relationships by which the elements favorably affect each others' pressure distribu- 
tions can be seen clearly in the pressure distributions of typical high-lift configurations. An example with a 
leading-edge slat and a double-slotted trailing-edge flap is shown in terms of the conventional pressure coef- 
ficient in Figure 7.4.20a. The slat does not have an element ahead of it to suppress its suction peak, but the 
downward deflection of the slat reduces the peak compared with what would occur on an airfoil without a de- 
flected leading-edge device. The main element and the trailing-edge flaps have their suction peaks suppressed 
by the elements ahead of them. The trailing-edge dumping velocity on the slat is considerably elevated over 
what would occur on an isolated airfoil. Moving aft in the system, dumping velocities decrease until that on the 
aft-most trailing-edge flap reaches a level typical of an isolated single-element airfoil. Successive leading-edge 
peak velocities also decrease systematically. When we put these pressure distributions in canonical form, as in 
Figure 7.4.20, we see that each element withstands a pressure rise comparable to what might occur on a single- 
element airfoil, which tends to support our view of the elements as separate airfoils. 


Figure 7.4.20 Geometry and pressure distribution of a four-element high-lift configuration. (From Smith, 
1975; Figures 17 and 18.). (a) In terms of conventional pressure coefficient. (b) In terms of canonical pressure 
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Looking at the boundary layers on the individual elements in this way provides us with much of our explana- 
tion for the effectiveness of the slots, but of course the flowfield around a multielement system is more complex 
than the flow around individual airfoils, and there are additional issues and limitations. While the boundary lay- 
ers on all elements but the last benefit from elevated trailing-edge dumping velocities, the wakes that leave these 
trailing edges must complete the remainder the pressure recovery, reaching freestream pressure or slightly high- 
er before they leave the system. In other words, the wakes of forward elements must withstand adverse pressure 


gradients as they flow over the upper surfaces of the aft elements. A wake is typically more resistant to flow 
reversal in an adverse pressure gradient than a boundary layer 1s, so that flow reversal in the wake is not a com- 
mon occurrence. But even if a wake doesn't have reverse flow, it can adversely affect the maximum lift of the 
system. Its displacement thickness in an adverse pressure gradient can become quite large, and it can influence 
the pressure distribution and separation behavior of the airfoil element it is flowing over. And unless the slot 
gap upstream is large, the wake will merge to some extent with the boundary layer of the element underneath. 
If too much of this boundary-layer confluence takes place, the separation resistance of the entire viscous layer 
will be reduced. Thus the choice of slot gap is a trade: Greater suction-peak suppression favors a smaller gap, 
but boundary-layer confluence penalizes a smaller gap. 


The wakes of forward elements, and the interaction of the wakes with the separation behavior of aft elements, 
can affect the lift curve of a multielement airfoil in various ways, and it is safe to say that the possibilities are not 
completely understood. For example, there is some evidence from 2D RANS solutions and from experiments 
that some three-element airfoils (slat, main element, and single-slotted aft flap) can have a counterclockwise 
hysteresis loop in the lift curve like what we saw for a single-element airfoil in Figure 7.4.12a. In the multiele- 
ment case, this seems to involve toggling between the two basic flow states illustrated in Figure 7.4.21. In part 
(a), there is mostly attached flow on the aft flap, which imposes a strong pressure gradient on the combined 
wakes of the slat and main element, causing a large displacement thickness and perhaps reverse flow in some 
cases. In part (b), the pressure gradient is less severe, and the forward-element wake remains thinner, because 
the flow separates near the leading edge of the aft flap. A different scenario that doesn't necessarily involve 
a double-valued lift curve can lead to a so-called “inverse-Reynolds-number effect,” in which an increase in 
Reynolds number results in a thinner forward-element wake, which increases the adverse pressure gradient 1m- 
posed on the aft flap, causing the flow on the flap to separate earlier, thus reducing maximum lift. 


Figure 7.4.21 Illustration of flow states in the interaction of forward-element wakes with the separation on aft 
elements. (a) Mostly attached flow on aft flap and thick forward-element wake. (b) Separated flow on aft flap 
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Now let's look again at sail combinations like that shown in Figure 7.4.18d. These appear at first glance to be 
very different from aeronautical high-lift systems, in that the gap between elements is typically so much larger. 
But the mechanisms by which the slot does its job are much the same. This has not always been correctly un- 
derstood by the sailing community. According to older folklore among sailors, the slot directs a “high-velocity 
jet” of air along the lee (suction) side of the main (aft) sail to “energize” the boundary layer. This is reminiscent 
of the aeronautical community's older wall-jet interpretation of the slot effect, which we've seen is not a very 
good explanation in the case of small gaps, and which I've already argued is even less appropriate in the case 
of large gaps. The correct view of the slot effect in the case of typical sail configurations recognizes that the 


velocity through the slot is only modestly elevated above freestream, and that the main benefit the slot provides 
for the mainsail is suppression of the leading-edge suction peak. Gentry (1971) provided the sailing community 
with one of the earliest correct interpretations of the slot effect. 


7.4.5 Cascades 


A cascade is a stacked array of nominally identical airfoils. The common examples are turning-vane arrays, as 
used in wind-tunnel corners, and rows of compressor blades, turbine blades, and stators, as used in turboma- 
chinery. These arrays obviously serve a variety of purposes, but what they have in common is that they all in- 
volve turning the flow. For stationary arrays (turning vanes and stators), flow turning is the primary function, 
while for moving arrays (compressor and turbine blades), flow turning is the means to an end: imparting work 
or extracting work. In the case of moving blades, imparting or extracting work requires a force component in the 
direction of the blades' motion. The cascade must therefore change the component of the stream's momentum in 
that direction, which requires turning the flow. 

Because the blade elements of a cascade are usually not spaced far apart relative to their chords, the turning 
that can be accomplished is typically much greater than would be possible with isolated airfoils. The typical 
wind-tunnel turning-vane array, for example, turns the flow 90°, as shown in Figure 7.4.22. The reason such 
large turning angles are possible can be looked at in two ways. First, a global view 1s that the force generated by 
each blade is directly “responsible” only for turning a streamtube of limited depth, so that the required loading 
on each blade is not too large. A more local view is that in spite of the large camber that such blades typically 
have, loadings are kept low, and stalling is avoided, by the inviscid interference provided by neighboring blades. 





Figure 7.4.22 Wind-tunnel turning vanes for a 90° turn. Profiles sketched from Gelder et al., (1986); published 
by NASA 


The geometry of a cascade array repeats in a spatially periodic pattern, and for theoretical purposes, we can 
usually assume that the resulting flow pattern is also spatially periodic. To solve for the detailed flowfield, we 
need only solve for the flow around a single blade, with periodic boundary conditions representing the effects 


of the other blades, as shown in Figure 7.4.23a. The ISES airfoil code (Drela and Giles, 1987) has options for 
analyzing and designing cascade blades, based on this kind of treatment of a single element in the array. For 
viscous calculations, the inlet flow angle must be specified, which is analogous to specifying the angle of attack 
of a single airfoil. The outlet flow then depends on the solution for the flow around the blades. In inviscid cal- 
culations, either the inlet or outlet flow angle may be specified, but not both. 


Figure 7.4.23 Simplified views of a cascade. (a) Dealing with a single element and imposing periodic bound- 
ary conditions. (b) Illustration of a view of integrated performance in terms of uniform farfield flow, and the 
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For purposes of analyzing the integrated performance of an array, we can simplify things further by taking 
a control-volume view in which the disturbances from individual blades are assumed to have died out, and the 
entry and exit flows are taken to be uniform. Practically speaking, this requires moving only about one spa- 
cing length upstream and downstream of the leading and trailing edges of the elements. The details of the flow 
around the blades can then be ignored, and the cascade can be viewed as a control volume with a steady, uni- 
form farfield flow entering at a particular angle and speed, and a uniform exit flow leaving at a different angle 
and speed, as shown in Figure 7.4.23b. A streamtube passing through the cascade has its cross-sectional area 
changed by an amount that is dictated by the entry and exit angles, as indicated in the diagram. If the blades 
operate in the attached-flow regime, so that viscous losses are small, the flow can be approximated as inviscid. 
If we are in the reference frame in which the blades are stationary, the entry and exit flow conditions must then 
be related by the 1D isentropic flow relations we discussed in Section 3.11.1, and can be determined based on 
the area change only, and thus on the entry and exit angles only. Then given the flow conditions and the flow 
angles, we can apply conservation of momentum in control-volume form to calculate the forces per unit span 
on the blades in both the vertical and horizontal directions. 

In inviscid flow without shocks, this force is the cascade counterpart to the lift of a conventional airfoil. If we 
are in a reference frame in which the blades are moving, the isentropic relations do not apply because we must 
take into account the work done by the blades, and the exit flow conditions and the blade forces are then impli- 
citly related. Otherwise the analysis is the same as in the stationary-blade case. For details of such analyses, see 
Hill and Peterson (1992). 

At the control-volume level, when only the entry and exit flow conditions are known, the blade shape and 
details of the flow are not determined. Determining a blade chord and spacing and a detailed shape that would 
accomplish the specified flow turning is a design problem that usually has multiple solutions. As the chord is 
reduced relative to the spacing, however, the loading on each blade increases, and in the case of viscous flow, 
there is a limit beyond which no blade shape exists that can carry the required load. 


There is thus a minimum chord that the blades must have (relative to the periodic spacing) that depends on 
the required entry and exit flow conditions. The problem of determining the minimum chord is very similar to 
that of determining the maximum lift of a conventional airfoil, and the physics issues are essentially the same. 
In subsonic flow, the lift capability of a blade is limited by boundary-layer separation on the suction side, so 
that much of the discussion in Section 7.4.3 applies. In this regard, the pressure level at the trailing edge is very 
important, just as 1t was in the case of a conventional airfoil. The trailing-edge pressure of a cascade blade is 
closely tied to the exit pressure of the cascade, just as the trailing-edge pressure of a single-element convention- 
al airfoil is closely tied to freestream pressure. 

Because exit pressure plays such an important role, there is a substantial difference between the load-carrying 
capabilities of compressor cascades and turbine cascades. When the exit pressure is higher than the inlet pres- 
sure, aS in a compressor stage, the suction-side boundary layer 1s not able to recover from as low a pressure as it 
would otherwise be able to, and the load-carrying capacity of the blades is reduced. Another way to look at this 
is that the passages between blades are required not just to turn the flow, but to slow it down as well (1.e., to act 
as diffusers). An exit pressure that is lower than the inlet pressure, as in a turbine stage, has the opposite effect 
and enhances the load-carrying capacity of the blades. This is the main reason why turbine blades can sustain 
higher loadings than do compressor blades. In the design of gas-turbine engines, this difference is usually used 
to allow the turbine section to have far fewer stages than the compressor section. 


7.4.6 Low-Drag Airfoils with Laminar Flow 


We've already noted that most of the drag of an airfoil in attached flow consists of the integrated surface shear 
stress, or skin friction. Early in the history of our understanding of boundary layers, 1t became known that lam- 
inar skin friction is lower than turbulent skin friction under the same conditions, and that keeping the boundary 
layer laminar over part of the airfoil chord should lead to lower drag. And laminar-flow stability theory pre- 
dicted that a favorable pressure gradient would delay the growth of small disturbances in the boundary layer 
and thus should lead to a longer run of laminar flow. 

All of this was known by the late 1920s, but it was not applied immediately. The stability theory had not yet 
been tested against experiment, and it was not yet known what it would take to turn it into a practical transition- 
prediction method. Recall from Section 4.4.1 that stability-based e" transition prediction wasn't developed until 
the early 1950s (Smith, 1981). And in the 1920s airfoil design was still a cut-and-try art. 

But long before the em method was available, it had been observed that under the right conditions (a smooth 
airfoil surface and a low freestream turbulence level) transition often did not take place until the favorable pres- 
sure gradient at the front of the airfoil ended, and it was reasonable to expect that extending the favorable pres- 
sure gradient would extend the run of laminar flow. In the 1930s, a group at the NACA, under the direction of 
Eastman Jacobs (see Anderson, 1997), carried out a long program of airfoil development aimed at putting this 
idea into practice. This was one of the earliest practical applications of inverse design (see Section 7.4.10), in 
which an airfoil is purposely designed to produce a particular pressure distribution. The objective of this effort 
was of course to extend the regions of favorable pressure gradient and produce long runs of laminar flow. Ja- 
cobs' group also developed a low-turbulence wind tunnel in which to test the new airfoils. 

In the earlier attempts that resulted in the NACA 2-through-5 series airfoils, the designs were carried out 
by approximate methods. The resulting airfoils demonstrated long runs of laminar flow at some conditions, but 
the approximate design methods did not control the pressure distribution near the leading edge well enough to 
produce good performance over adequate ranges of lift coefficient (see Abbott and von Doenhoff, 1958). 

Then in the late 1930s, Jacobs' group undertook the design of the NACA 6-series airfoils, using a more ac- 
curate design method. The inverse design was actually applied only to a family of thickness distributions. These 


were then combined with various mean lines designed by linear theory to produce uniform load and therefore 
not grossly change the pressure gradients that had been designed into the thickness distributions. Each thickness 
distribution was designed to produce a favorable pressure gradient back to some intended point on the chord 
and to maintain the favorable gradient on the suction side up to a design angle of attack. Thus each thickness 
distribution had a range of angle of attack over which the pressure gradient would be favorable, and over which 
a low-drag bucket could be expected in the airfoil's drag polar. 

In wind-tunnel testing of the 6-series airfoils, the expected runs of laminar flow and resulting low drag levels 
were verified. Figure 7.4.24 shows the shape and calculated inviscid pressure distributions of a 6-series sym- 
metrical airfoil, 632-015. Compared with a conventional airfoil of that time, the nose radius is smaller, and the 
maximum thickness is farther aft. The purposely designed region of favorable pressure gradient is clearly seen 
in the pressure distribution. The low-drag bucket in the measured drag polar can be seen in Figure 7.4.25. The 
width (range of C7) of the low-drag bucket depends on design details but is constrained by airfoil thickness. 
Making the bucket wider generally requires a thicker airfoil, which exacts a penalty in drag level. There is thus a 
trade between the width and the depth of the bucket. Adding camber (a cambered mean line) to the airfoil shape 
moves the low-drag bucket to higher lift coefficients, but because the width of the bucket depends primarily on 
the thickness distribution, the width tends to stay about the same. Drag polars are shown for some cambered 
6-series airfoils in Abbott and von Doenhoff (1958). 


Figure 7.4.24 Shape and calculated inviscid pressure distribution of the symmetrical NACA low-drag laminar- 
flow airfoil, 632-015. From Abbott and von Doenhoff, (1958). Used with permission of Dover Publications, 
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Figure 7.4.25 Measured drag polars for the symmetrical NACA low-drag laminar-flow airfoil, 632-015, show- 
ing the low-drag bucket. From Abbott and von Doenhoff, (1958). Used with permission of Dover Publications, 
Inc. 
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The success of the 6-series airfoils in the wind tunnel led to attempts to exploit their low drag levels on sever- 
al military airplanes during the World War II years, the North American P-51 Mustang being the prime example. 


Achieving the long runs of laminar flow for which the airfoils were designed, however, requires low levels of 
surface waviness and roughness, something that was not routinely achieved with the conventional metal con- 
struction used on wings at the time. In these applications, the 6-series airfoils probably did not reach the low 
drag levels seen in the wind-tunnel data. True laminar-flow drag levels were probably not routinely achieved in 
practice until laminar-flow airfoils were used on high-performance gliders, beginning in the 1950s. 

As we discussed in Section 4.4.1, the verification of the stability theory wasn't widely known until after 
World War II, the e™ transition-prediction method wasn't developed until the early 1950s, and it wasn't widely 
used until the 1970s. This was not much of a hindrance to the early development of laminar-flow airfoils be- 
cause successful airfoils could be developed without detailed stability calculations. On a wing without much 
sweep, over a wide range of practical Reynolds numbers from about a million to more than 10 million, a mod- 
erate favorable pressure gradient generally suffices to enable a long run of laminar flow, with transition at or 
near the end of the region of favorable gradient. When a wing has substantial sweep, however, the situation is 
more complicated and tends to require more analysis, including detailed stability calculations, as we'll discuss 
in Section 8.6.4. 

On airfoils like the 6-series, designed for low-drag laminar flow at moderate lift coefficients, the favorable 
gradient usually disappears at higher angles of attack, and transition moves forward, well before maximum lift 
is reached. Thus premature loss of laminar flow due to poor surface quality or contamination does not usually 
entail a reduction in maximum lift, unlike the Liebeck airfoils designed for maximum lift with laminar flow that 
we discussed in the Section 7.4.3. The small leading-edge radius typical of low-drag laminar airfoils tends to 
produce a pronounced suction peak at the leading edge at high angles of attack, however, and thus can reduce 
the maximum lift dictated by turbulent stall. 

In the years since the 6-series airfoils were designed, the technology of low-drag laminar-flow airfoils has 
become much more sophisticated, especially for high-performance gliders. Part of the focus of these develop- 
ments has been to ensure that transition happens where it is wanted and 1s not delayed so far that laminar sep- 
aration occurs before transition. Features like a “transition ramp,” which is a region of mild adverse pressure 
gradient carefully designed to trigger transition before the start of the hard aft recovery, have been used. Other 
airfoils have been designed to make use of roughness strips to trip the flow (Althaus, 1981), or a “pneumatic 
turbulator” (Horstmann and Quast, 1982), a row of tiny air jets that are supplied by a passive ram-air inlet and 
can be turned on over the range of C; where artificial tripping is desired and can be sealed closed when a longer 
run of laminar flow is desirable. Laminar flow on swept wings poses additional problems that we'll discuss in 
Section 8.6.4. 


7.4.7 Low-Reynolds-Number Airfoils 


Though low-Reynolds-number airfoils cover a wide spectrum, the single issue of laminar separation is pervasive 
across the entire range. Laminar separation 1s dealt with by a variety of strategies, from simply living with it 
to taking purposeful measures to prevent it or to control its effects. At the low end of the spectrum are rubber- 
powered indoor model airplanes that use cambered-membrane airfoils at chord Reynolds numbers in the range 
of 3000-6000. In this regime, the laminar boundary layer on the upper surface typically separates at around 
70% of the chord and does not reattach, and maximum sectional L/D is around 12 at a C; of 0.8—0.9. At the 
high end are airfoils designed for human-powered airplanes, as, for example, the MIT Daedalus (Drela, 1988), 
on which the chord Reynolds number on the inboard wing was around 500 000, with a maximum sectional L/D 
of around 138 at a C; of 1.5. This Reynolds number is still well below those for which conventional low-drag 
airfoils are usually designed, and transition tends not to happen ahead of laminar separation unless artificial 
tripping is used. To avoid having to install tripping devices on the delicate plastic-film covering of the Daedalus 


wing, Drela chose to accept the presence of a laminar separation bubble on the upper surface and to use the 
bubble, in effect, as the tripping mechanism. Drela found that through proper design of the pressure distribution, 
the bubble could be kept short, and the drag penalty associated with the presence of the bubble could be kept 
small. The “signature” of the separation bubble on the upper surface (the kink in the pressure distribution, with 
a short plateau preceding it, and a precipitous pressure rise following) can be seen in the pressure distributions 
in Figure 7.4.26. 


Figure 7.4.26 Shape and pressure distribution of DAE 11 designed by Drela (1988) for the wing center section 
of the Daedalus human-powered airplane, showing the indication of a laminar-separation bubble on the upper 


surface. From Drela, (1988). Used with permission of AIAA. XFOIL calculation by Steven R. Allmaras 
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7.4.8 Airfoils in Transonic Flow 


As Mach number increases from zero, the pressure disturbances produced by an airfoil, when expressed in terms 
of the conventional Cp, increase gradually, but the general character of the pressure distribution does not change. 
Then when the freestream Mach number is high enough that the peak local velocity outside the boundary layer 
approaches sonic, the physics begins to change character. As the local Mach number passes through sonic, the 
local relationship between pressure and streamtube area reverses, as we saw in connection with 1D isentropic 
flow in Section 3.11.1, and the airfoil pressure distribution changes dramatically. 

Figure 7.4.27a shows this progression for the NACA 4412, a conventional airfoil not designed for perform- 
ance at transonic speeds, illustrated in terms of pressure distributions at several Mach numbers at the same mod- 
erate C; of 0.72, as calculated by the MSES code (Drela, 1993). At low freestream Mach numbers, the upper- 
surface suction peak is broad and centered at about 20% chord. As the freestream Mach number increases and 


the local velocity at the suction peak approaches sonic, the suction peak moves aft. Sonic Cp values for the dif- 
ferent Mach numbers are indicated by horizontal dashed lines. The freestream Mach number at which the peak 
local velocity first hits sonic is often referred to as the critical Mach number. As speed increases further, a re- 
gion of supersonic (“supercritical”) flow forms, terminated by a shock, as illustrated in Figure 3.11.la. As the 
strength of the shock increases, the transonic drag rise begins. For low-speed airfoils like this one, the margin 
between the critical Mach number and the beginning of drag rise is small. At still higher Mach numbers than 
those shown in this series of solutions, the shock becomes strong enough to cause separation, dramatically re- 
ducing maximum lift. Note that the high velocities preceding the shock are concentrated over the mid portion 
of the airfoil chord. 


Figure 7.4.27 Progression of airfoil pressure distributions from incompressible to the transonic regime at con- 
stant lift coefficient of 0.72, R = 107, calculated by the MSES code (Drela, 1993). Sonic Cp values for the dif- 
ferent Mach numbers are indicated by horizontal dashed lines. (a) NACA 4412, representative of older con- 


ventional airfoils. (b) RAE 2822, representative of an early generation of modern transonic airfoils 
-2.0 TMSES NACA 4412 





-?.0 TMSES RAE 2822 
V3.0 Mach 
0.300 
-1.5 0.580 
he sc, “nicse haba ca amaes Samet 
CG i * = 0.660 
ee eee Sa 2 ee a ey ee . 
<1 f= = = 5 = Pe ee ee acer Ole 
___ Ne EEE cen 472712 720.720 
| 3 } “0.740 
-0.5 
0.0 
| 
0.5 


1.0 = 





Modern transonic airfoils delay the drag rise to well above the critical Mach number by delaying the rapid 
increase in the local Mach number ahead of the shock. This requires shifting some of the lift from the midsec- 
tion to the front and the rear, which means making the pressure distribution at low Mach numbers “peaky” at 
the front and introducing an increased degree of “aft loading.” Figure 7.4.27b shows the calculated progression 
of the pressure distribution for an airfoil representative of an early generation of this general design approach, 
the RAE 2822 (Cook, McDonald, and Firmin, 1979), an airfoil with the same 12% maximum thickness as the 
NACA 4412. Note that the peak upper-surface velocity remains near the leading edge well into the transonic 
regime rather than being located near mid-chord, as it was on the NACA 4412. The shock first appears well 
forward and then moves aft rather than first appearing near mid chord and not moving much, as was the case 
with the NACA 4412. 

Richard Whitcomb of NASA developed some of the earliest airfoils of this type. His first such airfoil was 
designed to operate with the shock quite far aft and incorporated a slot in a manner similar to a high-lift slotted 
flap, but smoothly faired as in Figure 7.4.19a. Subsequent airfoils aimed for a similar overall pressure-distri- 
bution architecture, but were unslotted (Whitcomb, 1974). For this general design approach, Whitcomb coined 
the term “supercritical airfoil,” referring to the fact that the drag rise was delayed well past the critical Mach 
number. Whitcomb's supercritical airfoils were more aggressively aft-loaded than the RAE 2822 example we 
looked at in Figure 7.4.27b. 

The formation of a shock in the flow field causes drag to increase through two mechanisms: 

1. There is the dissipation in the shock itself, which makes itself felt as an increase in pressure drag at the 
airfoil surface. 


2. The increase in pressure through the shock, even when it does not cause separation, thickens the bound- 
ary layer as it passes through the shock, which also increases its rate of growth from the shock aft, and 
increases the pressure drag due to the boundary-layer displacement effect. 


The results of this can be seen in Figure 7.4.28, which shows the drag-rise curves of the NACA 4412 and 
the RAE 2822. At a constant C; of 0.72 for which we compared the pressure distributions in Figure 7.4.27, both 
airfoils maintain a reasonably low and nearly constant drag level across the low-Mach range, until a point 1s 
reached for each airfoil where the drag rise begins, gradually at first, and then quickly becoming quite steep. 
For the NACA 4412, the drag rise begins at a Mach number of about 0.60, while for the RAE 2822 it is delayed 
to about 0.68. Over the first few tens of counts of drag rise, shock drag typically accounts for roughly two thirds 
of the increase over the low-Mach drag level. The other roughly one third is accounted for by an increase 1n vis- 
cous drag associated with the increasing punishment imposed on the boundary layer as the shock gets stronger. 


Figure 7.4.28 Drag-rise curves for NACA 4412 and RAE 2822 airfoils at C7 of 0.72, calculated by the MSES 
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We've seen how transonic flow brings with it a huge change in the pressure distribution on an airfoil. It also 
brings a dramatic increase in the sensitivity of the pressure distribution to small changes in the effective shape of 
the airfoil, and thus much more sensitivity to the boundary-layer displacement effect that we discussed in Sec- 
tion 4.1.3 and that we saw illustrated for subcritical airfoil flow in Figure 7.4.5. We make a similar comparison 
for transonic flow in Figure 7.4.29. The boundary layer displacement thickness is largest on the upper surface 
near the trailing edge, where it affects the circulation and the overall lift, and it does so much more strongly than 
it did in the subcritical case in Figure 7.4.5. 


Figure 7.4.29 Effect vs boundary-layer displacement on the pressure distribution of the RAE 2822 airfoil in 


transonic flow, R = 10/ ,M =0.74, a = 2.221°, calculated by the MSES code 
Pressure jumps if shocks were normal 





The interaction between the shock and the boundary layer plays an important role in the physics of transonic 
airfoil flows. We can see some of the effects of this interaction in the pressure distributions through the shocks 


in Figure 7.4.29: 

1. The pressure rise through the shock is “smeared” over some distance along the airfoil surface. The 
pressure rise thickens the boundary layer, as we've already mentioned, and the thickening of the bound- 
ary layer acts like adding a curved “ramp” or “wedge” to the effective shape of the airfoil, which has the 
effect of making the shock “fan out” at its base into compression waves (Figure 7.4.30) and smearing the 
pressure rise over a distance roughly proportional to the incoming boundary-layer thickness. This smear- 
ing of the shock pressure rise is clearly visible in the surface pressure distribution in the viscous case in 
Figure 7.4.29. 

2. The pressure downstream of the shock is not as high as that which a normal shock would produce at 
the same upstream Mach number. This is another result of the effective viscous “ramp” or “wedge” asso- 
ciated with the thickening of the boundary layer. In addition to causing the shock to fan out; the “wedge”’ 


causes it to become oblique, so that the normal-shock relations don't apply, and the post-shock pressure 
1 & 


is moved closer to the sonic level, ~ P 


, aS can be seen in the viscous case in Figure 7.4.29. 


Figure 7.4.30 Sketch of transonic shock/boundary-layer interaction. From Sirieix, Delery, and Stanewsky, 
(1980). Used with permission of Springer Science + Business Media 


M,>1.0 







Displaced B.L.Edge 





—_ = ja 


Sonic line 


Viscous sublayer 


F SFL, eee ee ae oe 
ere ir Sess Highs Sait MTree ai 


Other interesting aspects of transonic airfoil shocks can be seen in Figure 7.4.29. The shock pressure jumps 
that would have occurred, had the shocks been normal shocks with the upstream Mach numbers Mj, consist- 
ent with the pressure distributions, are indicated by arrows. In the viscous case, the pressure rise falls far short 
of the normal-shock value because of the “wedge” effect we just discussed. In the inviscid case, the pressure 
rise falls a little bit short, but for a different reason. A shock impinging on a smooth surface in inviscid flow 
must impinge normal to the surface, and the pressure rise should be the normal-shock pressure rise. But when 
the surface is curved, there is a singularity at the point where the shock impinges (the Zierep singularity, see 
Oswatitsch and Zierep, 1960), and the flow downstream of the shock very quickly reexpands to a higher Mach 
number. The numerical solution for the inviscid case in Figure 7.4.29 didn't have enough resolution to show the 
singular dip to the normal-shock value. There is also some numerical smearing of the shock in the inviscid case. 

In the viscous shock/boundary-layer interaction, the downstream pressure varies somewhat as a function of 
the strength of the shock. By examining experimental data, Jou and Murman (1980) concluded that the down- 
stream pressure corresponds closely to that of an oblique shock on a wedge with the maximum deflection angle 
for an attached shock (see Section 3.11.2 for a discussion of oblique shocks). Lynch and Johnson (1987) pro- 
posed a similar kind of correlation for the downstream pressure, with a slightly different result. The two are 
compared in Figure 7.4.31 in terms of the inviscid Mach number M2 downstream of the shock inferred from 
measured surface pressures. It is clear that transonic airfoils don't “shock down” to anywhere near the normal- 


shock level, and for the weak-shock range that is of interest in cruise (M1 < 1.20) the downstream Mach number 
M2 will be between 0.95 and 1.00. 


Figure 7.4.31 Correlations of downstream Mach number in transonic shock/boundary-layer interactions. 


(From Jou and Murman, (1980 (“Max a a and —__ and Johnson, 1987) 
1.0 





Lynch and Johnson 


0.95 


0.9 
3 
= 
0.85 
0.8 
0.75 
1 1.05 1.1 1.15 1.2 1.25 1.3 
M, 


As either Mach number or angle of attack increases, the shock generally becomes stronger, and when it is 
strong enough it causes immediate separation of the boundary layer. Because the pressure rise through the shock 
is smeared over a distance that scales roughly with the thickness of the incoming boundary layer, the severity of 
the pressure gradient through the shock varies so as to compensate for the effects of Reynolds number, with the 
result that shock-induced separation has practically no Reynolds-number dependence. Separation first appears 
at an upstream Mach number Mj very close to 1.30, practically independent of Reynolds number, and with only 
a slight dependence on the shape factor of the incoming boundary layer. For M1 between 1.30 and 1.40, the 
boundary layer reattaches to the airfoil surface, and the separation is thus limited to a local bubble beginning at 
the base of the shock, as illustrated in Figure 7.4.32. For Mj above 1.40, there is usually no reattachment, and 
the final flow pattern in Figure 7.4.32 is generally associated with severe buffeting. A more detailed discussion 
of transonic shock/boundary-layer interaction can be found in Sirie1x, Delery, and Stanewsky (1980). 


Figure 7.4.32 Illustration of the progression of separation at the foot of the shock on an airfoil. (a)M1 <= 
1.30: Attached flow. (b) 1.30 <M, <= 1.40: Separation bubble at foot of shock. (c) M1 > 1.40: Separation 
without reattachment 





(a) 


The progression of separation at the foot of the shock described above can be complicated by interaction 
with separation starting at the trailing edge and moving forward. Because these two types of separation can pro- 
gress either separately or simultaneously, the number of possible scenarios is large. Which one actually happens 
depends on the details of the particular airfoil's pressure distribution. Pearcey, Osborne, and Haines (1968) ex- 
haustively described the possibilities and divided them into those with separation at the foot of the shock only 
(Model A) and those with aft separations or combinations (Model B). Their diagram of the possibilities is shown 


in Figure 7.4.33. 





Figure 7.4.33 Possible sequences in the development of separation at the foot of the shock and separation 
moving forward from the trailing edge, shown as successive stages as either Mach number or angle of attack 
increases. Turbulent boundary layers are assumed. From Pearcey, Osborne, and Haines, (1968). Published by 
AGARD 
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The problem of designing an airfoil for good performance under transonic conditions, for example, achieving 
a high sectional L/D at a high Mach number, is effectively that of designing for the highest C; at which a reas- 
onably low drag level can be had. Thus it is similar to the problem of designing a single-element airfoil for 
maximum lift in subcritical flow that we discussed in Section 7.4.3, where we saw that an optimum design rep- 
resents a tradeoff between increasing the magnitude of the negative Cp in the “rooftop” on the forward part of 
the chord and delaying the start of the pressure recovery. The limiting constraint in the general high-lift case 
was delaying boundary-layer separation to the trailing edge. In the transonic case, we have the same boundary- 


layer separation constraint, but we also have the requirement for “reasonably low” drag, which is effectively an 
active constraint as well. In particular, that means keeping shock drag under control over the required range of 
operating conditions while trying to maximize usable lift. 

It is possible to design an airfoil with the required region of supersonic flow and a shock-free recovery at one 
operating condition. But shock-free design has not proven to be practical because it provides low drag over too 
narrow a range of Mach and Cj. At conditions other than the design condition, a shock-free airfoil tends to have 
a stronger shock and significantly higher drag than a conventional transonic airfoil would. The conventional 
compromise is to design for an upper-surface pressure distribution with the general features illustrated in Figure 
7.4.34, including a weak shock, over the desired transonic part of the operating envelope. Shock drag increases 
rapidly for upstream Mach numbers above about 1.15—1.20, which effectively limits the value of negative Cp 
that can be tolerated just ahead of the shock. Note that a shock of this strength is not even close to causing sep- 
aration (recall Figure 7.4.32a). 


Figure 7.4.34 Illustration of desired features of an upper-surface pressure distribution for efficient transonic 
cruise 


Slightly sloping rooftop 


Weak shock 


slight re-expansion 


| Convex recovery 
> -0.5 | 








x/c 


Downstream of the shock, 1t seems to be advantageous to provide a slight re-expansion before the final aft 
pressure recovery begins. This increases lift and is probably also favorable to the boundary layer. Although the 
shock does not cause separation, it is sufficiently traumatic to the boundary layer that it seems to be helpful to 
provide the re-expansion to allow the boundary layer to “relax” before the start of the final aft pressure recov- 
ery. Downstream of the re-expansion, a slightly convex recovery is favored because it also increases lift. This 
differs from what we saw in Section 7.4.3, where maximum lift in subcritical flow favored a concave recovery. 
In that case, a concave recovery allowed more negative Cp at the start of the recovery, while in the transonic 
case, negative Cp is limited by shock drag. 

It is also favorable to design the supersonic rooftop with a slight built-in deceleration or adverse pressure 
gradient. This increases lift by allowing more-negative Cp near the leading edge, and it facilitates a benign pro- 
gression of shock position and strength with Mach and C7. When all these considerations are taken into account, 
it seems to be optimum to position the shock just aft of mid chord at the C; at which the maximum sectional 
L/D 1s desired at the targeted cruise Mach number. 


Practical considerations for design are discussed in Section 7.4.10. 


7.4.9 Airfoils in Ground Effect 


The presence of a ground plane below an airfoil alters the flow relative to free-air conditions at the same angle 
of attack. The effects generally become stronger with decreasing height above the ground plane, and the height 
must typically be less than about one chord for the effects to be large. 

In the simplest model, which should be valid for large h/c, the lift is represented by a vortex at the airfoil 
quarter-chord, and the effect of the ground is represented by an image vortex below the ground. The presence 
of the image reduces the effective freestream velocity seen by the airfoil from Us to a lower value Uegf and thus 
reduces the lift at a given angle of attack. If we ignore any change to the effective camber of the airfoil and 
assume that the airfoil shape and angle of attack determine I’/Ueff, we obtain the following expression for the 
lift relative to its free-air value at the same angle of attack: 

7.42 Ct! Choo = 1/[1 + (Cro) /(8th)]*. 

Note that the C7 ratio depends on both Cj and h/c, and that the effect disappears (the ratio goes to one) as either 
h/c — © or as Cja — 0. The predicted reduction in lift is plotted as the solid curve in Figure 7.4.35 for Cho = 


1.30. 


Figure 7.4.35 Effect of a ground plane on the lift of an airfoil at a fixed angle of attack. Comparison of the 
simple model of Equation 7.4.2 with MSES calculations for the NACA 4412 airfoil at a = 7.622° (C7 = 1.30 
out of ground effect), R = 10 million, M = 0.10, with viscous effects included on the airfoil surface but not on 
the ground plane 
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Of course, a real airfoil has bound vorticity distributed along the chord and has thickness, and thus the ground 
alters the effective camber and angle of attack as well as Uert. These effects will be more complicated and will 
not generally disappear as Cj~o — 0. As an illustration of what these more-realistic effects can look like, Figure 
7.4.35 also shows results calculated by the MSES code for the NACA 4412 airfoil, with viscous effects included 
on the airfoil surface but not on the ground plane. At the larger heights, the lift loss is roughly the same as that 
predicted by the simplest model, Equation 7.4.2. At the lower heights, the lift reduction reverses, and lift in- 
creases, presumably because the ground increases the effective camber or angle of attack. Such deviations from 
the simple model will of course depend on the chordwise lift distribution. And for small heights, the boundary 
layer on the ground plane could alter these results significantly. (Note that a local unsteady boundary layer will 


develop even on a stationary ground plane in zero wind, because of the local unsteady velocities produced as 
the airfoil passes by.) 

There is a widespread notion that ground effect produces an “air cushion” of high pressure under the airfoil, 
which increases lift. In the NACA 4412 example in Figure 7.4.35, lift was indeed increased for h/c less than 
about 0.3. Pressure distributions for three of these cases are plotted in Figure 7.4.36. At h/c = 1.95, there is a 
decrease in lift, coming from a pressure increase that is greater on the upper surface than on the lower surface, 
consistent with the simplified model that explains the effect in terms of a reduction in effective freestream velo- 
city. At the lowest h/c of 0.26, the pressure increases have grown larger on both surfaces, but the increase on the 
lower surface is now larger than that on the upper surface, so that there is a small lift increase. The change in 
loading is not concentrated toward the leading edge, and thus is consistent with a change more in the effective 
camber rather than in the angle of attack. From what we see in this case, the “air cushion” image isn't a bad one, 
as long as you keep in mind that much of the effect on lift from the lower surface is offset by a pressure increase 
on the upper surface that is almost as large as that on the lower surface. 


Figure 7.4.36 Effect of a ground plane on the pressure distribution on an NACA 4412 airfoil for three of the 


cases in Figure 7.4.35 
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In the case of a 3D wing, the 3D effects of the ground plane often overwhelm the 2D effects we've just dis- 
cussed. We'll look at the 3D effects on lift in Section 8.2.4 and the 3D effects on induced drag in Section 8.3.9. 


7.4.10 Airfoil Design 


Different mission performance requirements place different amounts of importance on aerodynamic perform- 
ance measures such as high maximum lift, low drag, and high-Mach-number capability. In our discussions so 
far, we've considered these aerodynamic objectives separately, looking at some of the purely aerodynamic con- 
siderations that apply to designing a low-speed airfoil for maximum lift, a low-drag airfoil with laminar flow, 
or an airfoil for high L/D at high subsonic Mach numbers. Of course, practical airfoil design should take into 
account much more than just these single objectives. An airfoil on a practical flight vehicle should perform well 
not just at one flight condition, but should provide favorable performance over the full range of conditions en- 
countered during the vehicle's mission. The aerodynamic design should therefore take into account multiple op- 
erating conditions, including those in which high-lift devices may be deployed. Then there are nonaerodynamic 


requirements. In addition to providing aerodynamic performance, an airfoil must enclose wing structure, fuel, 
and mechanical systems. Sometimes an airfoil optimized for purely aerodynamic considerations will already 
meet these other requirements, but usually not. In the usual case, the best shape is a compromise between aero- 
dynamic and nonaerodynamic objectives and constraints. 

In the early era from the 1800s to the 1930s, airfoil design was a cut-and-try affair, with the airfoil shape 
as the input, and the performance, as determined by wind-tunnel testing and flight experience, as the output. 
Progress was slow, and the performance levels that were achieved were not high by today's standards. 

We've seen that the lift and drag performance of an airfoil is largely determined by the behavior of the bound- 
ary layer, which means that the pressure distribution is hugely important. In the second era of airfoil design, 
from the 1930s to the 1990s, controlling the pressure distribution through inverse design (solving for the shape 
that yields a specified pressure distribution) led to major advances in the ability to target particular kinds of 
performance. An early example we already considered is the NACA 6-series airfoils designed for low-drag lam- 
inar flow. In the early years, the state of computational methods was such that inverse design was limited to 
incompressible inviscid flow, but this sufficed for low-Mach-number applications. It was not until the 1970s 
that methods became available for routinely computing inviscid transonic flows. Turning this capability into 
a practical design tool required adding viscous effects, through boundary-layer coupling, because the pressure 
distribution in transonic flow is very sensitive to small changes in the effective shape of the airfoil, as we saw 
in Figure 7.4.29. The application of viscous transonic inverse-design methods led to dramatic improvements in 
transonic airfoil performance. 

The inverse-design method in general entails some interesting difficulties. One has to do with the nature of 
the inverse problem itself, and that is that only particular target pressure distributions lead to realizable airfoil 
shapes that are not either open or crossed over at the trailing edge. One way of dealing with this problem is to 
allow the solution algorithm to alter the target pressures in some arbitrary way, for example, by adding a lin- 
ear ramp or some other distribution to the pressures, and to solve for how much of the alteration is needed to 
make the resulting geometry realizable. How close the resulting design comes to achieving the desired pressure 
distribution then depends on how close to realizable the target was. Another approach is called modal inverse 
design, which starts with a “seed” airfoil shape that is then modified by the addition of particular shape func- 
tions, or “modes,” whose amplitudes are solved for by optimization, so as to minimize some measure of the de- 
viation between the actual pressure distribution and the target pressure distribution. The shape-changing modes 
are purposely defined so as not to be able to “open” the trailing edge, and the resulting airfoil automatically has 
a realizable shape. Both of these methods of dealing with the realizability problem are available as options for 
doing inverse design in the ISES and MSES codes (Drela, 1993). 

Since the 1990s, CFD-based optimization has replaced inverse design as the method of choice for designing 
an airfoil, or at least for the final refinement of a design. The objective function that 1s minimized can include 
almost any weighted combination of aerodynamic performance measures and airfoil shape characteristics. The 
aerodynamic performance objective almost always includes multiple operating conditions, since airfoils optim- 
ized for only one condition usually perform poorly at “off-design” conditions. Constraints can also be imposed 
on the aerodynamic characteristics (pitching moment, for example) or on the shape (maximum thickness or 
some other measure of the structural fitness of the shape, for example). Often the things we might ordinarily 
think of as constraints can be handled equally well as part of the objective instead. 

With the advent of optimization, it might seem that airfoil design has become a “pushbutton” activity: Define 
the objective and the constraints, and turn the code loose to produce the best possible airfoil. But airfoil optim- 
ization is not usually that simple in practice. Numerical optimizers have an uncanny ability to find and exploit 
weaknesses in the formulation of the problem, often with the result that the optimized solution 1s practically 
useless. The weakness can be in the physical modeling, as in some flaw in the physical fidelity of the CFD 


method, or it can be flaw in the choice of objectives or constraints. It usually takes considerable skill to use 
airfoil optimization in a way that produces an excellent design for the intended purpose. 

Regardless of whether inverse design or optimization is used to generate a design, there are lessons that crop 
up repeatedly and seem to be universal. One is that the pressure distribution on the upper surface has much 
greater leverage over the performance (drag or maximum lift) than the pressure distribution on the lower surface 
does. In the case of drag, this is presumably because of the higher average velocity over the upper surface un- 
der lifting conditions. In the case of maximum lift, it is because boundary-layer separation on the upper surface 
is determined by the upper-surface pressures. Thus for either low drag or high lift, a deviation from the aero- 
dynamic optimum pressure distribution on the upper surface carries a heavier penalty than a similar deviation 
on the lower surface. The upper surface pressure distribution is therefore more or less “inviolable,” while the 
lower-surface pressure distribution 1s more “negotiable” and can be compromised, for structural or other non- 
aerodynamic reasons, with only minimal aerodynamic performance penalty. 


7.4.11 Issues that Arise in Defining Airfoil Shapes 


Defining the shape of an airfoil would seem to be a simple proposition: Either use analytic functions, as in the 
NACA 4- and 5-digit series, or use a string of defining points combined with some method of interpolating 
smoothly between them. But interesting issues can still arise that are worth discussing here. 

Decomposing an airfoil shape into a camber line, or mean line, and a thickness shape was common in much 
of the early work on airfoils, including the various NACA series. As we saw in Section 7.4.1, the aerodynamic 
effects of camber and thickness (and angle of attack) are exactly separable only in the linear limit of small sur- 
face slope. Still, 11 was common practice to design camber lines and thickness shapes separately as late as the 
NACA 6 series of low-drag laminar airfoils. 

The traditional way of lofting an airfoil by combining a camber line and a thickness shape is described in 
Abbott and von Doenhoff (1958). The leading and trailing edges are defined as the ends of the camber line, 
and the chord line is defined as the straight line joining the leading and trailing edges. The thickness shape is 
added normal to the camber line, as shown in Figure 7.4.37. The circle defined by the leading-edge radius has 
its center not on the chord line, but on a line drawn tangent to the camber line at the leading edge. Thus if the 
slope of the camber line at the leading edge is nonzero, the center of the circle will be displaced vertically from 
the chord line, and the nose of the airfoil will protrude slightly ahead of the “leading edge.” 


Figure 7.4.37 Traditional way of combining a camber line and a thickness shape. From Abbott and von Doen- 


hoff, (1958). Used with permission of Dover Publications, Inc. 
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There are two formal problems with this procedure: 


1. Some camber lines have infinite slope at the leading edge, in which case the procedure has to be 
“fudged” as explained in Abbott and von Doenhoff (1958), and 


2. An airfoil surface defined in this way can “double back” on itself if the half-thickness exceeds the local 
radius of curvature of the camber line, but this is something that practically never occurs. 


A more basic objection to the traditional recipe is that there's no real reason to use such a complicated pro- 
cess. It certainly isn't justified by linear theory, since for the small surface slopes assumed in the theory there's 
no significant difference between adding the thickness normal to the camber and simply adding the y coordin- 
ates. And for airfoils with practical thickness and camber, the difference is typically significant only in the 
neighborhood of the leading edge, where linear theory doesn't apply anyway. So it makes just as much sense 
theoretically, and much more sense practically, just to add the y coordinates. 

Now that most new airfoils are designed by CFD calculations, either by inverse design or optimization, as 
we discussed in Section 7.4.10, we tend not to deal with separate camber lines and thickness shapes anyway. If 
we forego the decomposition into camber and thickness, defining an airfoil shape 1s just defining a function y(x) 
that is double valued so as to have upper and lower surfaces. We generally like an airfoil to have a well behaved 
pressure distribution in subsonic flow, which dictates some desirable properties for y(x). In potential flow, a dis- 
continuity in either slope or curvature produces a singularity (infinite slope) in the pressure distribution (This 1s 
discussed in Section 9.2.1, and two cases are sketched in Figure 3.10.2b.,c). In real, viscous flow, a discontinuity 
in slope or curvature doesn't produce singular behavior, but it can still produce a large local pressure gradient, 
which is best avoided. It is thus prudent to require that the surface be continuous to the second derivative (C2). 
This is not a difficult requirement to meet. 

What can cause some difficulty, however, is handling the leading-edge region. The usual blunt leading edge 
has a combination of concentrated high curvature and a leading edge point where the slope 1s infinite, which is 
not that easy to represent with simple conventional curve-fitting procedures. 

A general method of representing body shapes has been proposed by Kulfan (2008) that provides one way 
of dealing with the leading-edge issue. The function y(x) defining the cross section of a 2D body like an airfoil 
is defined as the product of a class function that gives the body its basic character, and a shape function that 
defines the specific shape. The idea is that with the class function providing the basic blunt leading-edge shape, 
the shape function should be a function that is easier to curve-fit. A simple class function for an airfoil with a 
1 —x) 


blunt leading edge and a wedge trailing edge is v xt , where x goes from 0 to 1 from the leading edge 


to the trailing edge. The Vx provides a blunt leading edge with a finite leading-edge radius, and the (1 — x) 
provides a wedge trailing edge, as shown in Figure 7.4.38. The airfoil shape is then given by 


7.4.3 ¥ fi/x(1 = x} aa VreX; 


with separate shape functions fy and f7 for the upper and lower surfaces. Separate values of yTE for the upper 
and lower surfaces define the thickness of the trailing edge. The value f(1) determines the slope at the trailing 
edge, and f(0) determines the leading-edge radius. Thus to have continuity of curvature at the leading edge re- 
quires fy(0) = — f7 (0). Figure 7.4.39 shows the airfoil shapes and the functions f(x) defined by them by Equation 
7.4.3 for the symmetrical NACA 0012 and cambered NACA 4412 (lofted by adding y coordinates, not by the 
more elaborate procedure). For the symmetrical case, f(x) 1s well behaved and would be easy to curve-fit to high 
accuracy. In the cambered case, however, things are not so well behaved. Note that f(0) 1s the same regardless of 
camber, since the leading-edge radius is determined only by the thickness shape. But the NACA 44xx camber 
line has nonzero slope at the leading edge, which causes f(x) to have infinite slope there, though f(x) 1s reason- 
ably well behaved everywhere else. So when the camber line at the leading edge has nonzero slope, Equation 
7.4.3 doesn't completely eliminate the difficulty of fitting the leading edge, and special measures must be taken 
to handle the infinite slope there. 


Figure 7.4.38 The class function vx(1 — x) for fitting airfoil shapes according to Equation 7.4.3 
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Figure 7.4.39 Airfoil shapes for NACA 0012 and NACA 4412 and corresponding shape functions f(x) defined 


by Equation 7.4.3 
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Chapter 8 


Lift and Wings 1n 3D at Subsonic Speeds 


In this chapter, we extend the discussion of lift from 2D to 3D take up the topics of the flow around a 3D wing, 
the lift distribution on a 3D wing, the induced drag, wingtip devices, and the manifestations of lift in the extended 
flowfield. Finally, we'll delve into some of the interesting issues that arise when wings are swept. 


8.1 The Flowfield around a 3D Wing 


The flow around a 3D wing must differ in some basic ways from the flow around a 2D airfoil, simply because of 
the finite span and the resulting flow gradients in the spanwise direction. In this section, we'll first describe the 
general features of 3D wing flowfields, and then we'll look at how they can be explained. The classical approach 
(starting with the early work of Prandtl and others, see historical sketch by Giacomelli and Pistolisi, in Durand, 
1967a) looks at the distribution of bound vorticity and the vorticity in the wake and deduces the velocity field 
everywhere else using the Biot-Savart law. Though this yields correct results if the correct vorticity distribution 
is used, it is not a real physical explanation in the cause-and-effect sense, as we argued in Sections 3.3.9 and 7.2. 
So we'll also seek an explanation based on the local balance of force and acceleration, that is, the interaction of 
the pressure and velocity fields. We constructed explanations of this type for the generic flow around an obstacle 
in Section 5.1 and the flow around a 2D lifting airfoil in Section 7.3.3. Even in those relatively simple flow situ- 
ations, however, we found that qualitative arguments alone did not enable us to predict the flow a priori. Instead, 
we had to settle for explaining things “after the fact,” based on prior knowledge of what the pressure and velocity 
fields look like. In the case of a 3D wing, we will also have to rely heavily on prior knowledge of the flow struc- 
ture. 


§.1.1 General Characteristics of the Velocity Field 


The flow around a 3D wing 1s similar in some ways to the flow around a 2D airfoil, so to start the discussion, 
let's review the relevant features of the flow in the 2D case. In the flow over a lifting 2D airfoil, the velocity 
disturbances produced by the airfoil die out rapidly in all directions, including downstream. Downstream of the 
airfoil, the only significant velocity “signature” of the lift production 1s the downwash field, which carries a flux 
of downward momentum across any vertical plane, corresponding to half of the lift. With increasing distance 
downstream, this downwash spreads out rapidly in the vertical direction and becomes very diffuse, but the flux 
of downward momentum remains constant. As we discussed near the end of Section 7.3.4, the flow around a 2D 
airfoil in the inviscid case is reversible, in that an onset flow that is uniform in the limit far upstream becomes 
uniform again in the limit far downstream, and the same flow pattern and pressures would arise if the flow were 
run in the reverse direction. In a viscous flow in the attached-flow regime, the flow outside the boundary lay- 


er and wake still follows the reversible pattern quite closely. In addition to the near-reversibility of the gener- 
al flow pattern, there is also very little permanent vertical displacement of streamlines between upstream and 
downstream (none in the inviscid, shock-free case). And, of course, all these flow features are by definition uni- 
form in the spanwise direction. 

Now consider a 3D wing of finite span, with moderate-to-high aspect ratio, operating in the attached-flow 
regime. At any station along the span of the wing other than very close to the tips, the chordwise distributions of 
pressure and boundary-layer development are not much different from those of a 2D flow, and the streamlines 
projected in a longitudinal plane look qualitatively like the flow around a 2D airfoil, as illustrated in Figure 
8.1.1. But this projected view misses an important aspect of the 3D flow; that is, that the streamlines of the 3D 
flow don't generally lie in planes as those of a 2D flow do. A key part of what distinguishes the 3D flow from 
the flow around a 2D airfoil is a significant out-of-plane component to the motion. This out-of-plane component 
would be difficult to discern if it was shown in a perspective view like Figure 8.1.1, but it 1s important nonethe- 
less. 


Figure 8.1.1 Flow around a 3D wing viewed in terms of streamlines projected in a plane perpendicular to the 
span is similar to flow around a 2D airfoil 





We can visualize the 3D flowfield more clearly in terms of velocities projected in planes perpendicular to the 
freestream, as illustrated in Figure 8.1.2. This cross-stream velocity field develops in conjunction with a pres- 
sure field that is nonuniform in the spanwise direction. The general pattern is characterized by downward flow 
in the area between the wingtips, upward flow outboard of the tips, outboard flow below the wing, and inboard 
flow above the wing. Note that these lift-induced velocities are not concentrated closely just around the wing 
itself or the wingtips, but are spread fairly diffusely over a wide area of the flowfield. 


Figure 8.1.2 Flowfield around a lifting wing illustrated by velocity vectors 1n a cross-flow plane. This general 


flow pattern is well established around the wing itself and persists for long distances downstream 
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The streamwise development of the cross-stream velocity field in the 3D flow is quite different from anything 
in the development of a 2D airfoil flow. In the flow more than about one wingspan ahead of the wing, the velo- 
city disturbances are small and are distributed diffusely, as they are in the 2D case. As we approach closer to the 
wing, a pronounced upwash appears ahead of the leading edge, as in the 2D case. As we pass behind the leading 
edge and over the wing itself, the general flow pattern shown in Figure 8.1.2 becomes well established. Behind 
the wing, the flow pattern continues to evolve, with velocities increasing in parts of the field and decreasing in 
others, but continuing to look qualitatively like Figure 8.1.2. At a distance on the order of a wingspan behind 
the wing, the flow will have settled into an asymptotic pattern, and then it changes only very slowly over long 
distances downstream. This is a key distinction between 3D and 2D: While the velocity disturbances in the 2D 
case begin to decrease immediately behind the airfoil and become very small and very diffuse far downstream, 
the cross-stream velocity field that develops in 3D persists for very long distances downstream. Another way 
to look at this distinction is in terms of reversibility: Both the 2D and 3D cases start with a uniform onset flow 
upstream, but while the 2D flow becomes uniform again downstream, the 3D flow becomes nonuniform, with 
persistent transverse velocities even at very large distances downstream. It is therefore not even close to being 
reversible like the flow around a 2D airfoil. 

At the location of the wing itself, we have a well-established flow pattern in which the wing is flying through 
air that is already moving generally downward between the wingtips. Thus the wing can be thought of as flying 
in a downdraft, or downwash, of its own making. At the location of the wing itself, the downwash can in general 
vary considerably, both spanwise and chordwise. But on a high-aspect-ratio wing, we can simplify the picture: 
We can think of the downwash at the location of the wing as consisting of two parts, a 2D part that would be 
there if the local airfoil section were in a 2D flow at the same sectional lift (not the same geometric angle of 
attack), and a 3D part that is a result of finite span. In the limit of high aspect ratio, the 3D part of the downwash 
is constant along the chord at a given span station. The 3D downwash can thus be seen as a downward shift in 
the apparent angle of attack of each airfoil section along the wing, often called the induced angle of attack. For 
positive total lift, the effect of the induced angle of attack, integrated over the span, always corresponds to a 
reduction in the apparent angle of attack of the wing. 

One consequence of the apparent downdraft in which the wing is flying 1s that a 3D wing generally requires 
a higher geometric angle of attack to achieve the same lift coefficient as a corresponding 2D airfoil, a fact we'll 
make use of when we attempt to explain features of the 3D flowfield. And because the downwash increases 
with angle of attack and thus subtracts progressively more from the lift, the lift slope of a 3D wing is generally 
less than that of a 2D airfoil section. 


The other important result of the downwash 1s that the total apparent lift vector is tilted backward slightly. 
This backward component of the apparent lift 1s called induced drag, and the work done against it is reflected 
in the kinetic energy of the large-scale flow pattern. We'll discuss the physics and the theory of induced drag 
further in Section 8.3. 

In Figure 8.1.2, we saw that the spanwise velocity components behind the wing are in the outboard direction 
below the wing and in the inboard direction above the wing. There is thus a mismatch, or jump, in the spanwise 
velocity, and this jump constitutes a vortex sheet that 1s shed from the trailing edge and convected downstream. 
The development of this vortex wake is our next topic. The induced drag and the presence of vorticity that is 
convected downstream are both earmarks of the general irreversibility of the 3D flow pattern. 


8.1.2 The Vortex Wake 


The trailing vortex wake 1s a distinctive feature of the lift-induced flowfield, and it plays a prominent role in 
discussions of induced drag and in the quantitative theory. The nature of the vortex wake and its role in induced 
drag have been a source of some serious misunderstandings, so we'll take extra care in the following discussion 
to point out the common misconceptions, to help ensure that we develop a correct understanding. 

As we noted above, the vortex wake starts as a vortex sheet shed from the trailing edge of the wing as a 
byproduct of the establishment of the flow pattern shown in Figure 8.1.2. It is a necessary part of the flowfield 
because the wing cannot produce the general flow pattern of Figure 8.1.2 without also producing the jump in 
spanwise velocity between the streams that pass above and below the wing. Even if we model the flow around 
the wing as inviscid, a vortex sheet must be shed if the lift is nonzero. Milne-Thomson (1966, Section 3.3.1) de- 
scribes the shedding of a vortex sheet from a body in 3D inviscid flow as the “bringing together of layers of air 
which were previously separated, and which are moving with different velocities.” Of course, if a shed vortex 
sheet is wetted on both sides by air that has come from the freestream without any change in stagnation pressure 
or stagnation temperature, the velocity magnitude on both sides must be the same. So by “different velocities” 
Milne-Thomson means different flow directions. Farther along in the discussion, we'll attempt to explain how 
those different flow directions arise in the case of a lifting 3D wing. 

Milne-Thomson's quote above provides interesting food for thought and merits a little digression. He's given 
us an intuitively appealing way to think of what's happening when the vortex sheet leaves the trailing edge, but 
“previously separated” in this context is ambiguous. If all it means is that the layers of air were not together 
prior to being joined, then there's no problem. But it could also be taken to imply that the layers of air coming 
together at the trailing edge were separated from each other at some location upstream, presumably where they 
attached to the wing at the attachment line. This more specific meaning wouldn't be precisely correct. On a wing 
of finite span, a layer of air can come only very close to attaching to the surface, but can't actually attach in 
a rigorous sense. Recall from our discussion in Section 5.2.2 that a finite attachment line is at best a band of 
approximate attachment, and that only discrete filaments of flow can actually attach to the surface at singular 
points. Often there is only one point of attachment, like the nodal point marked “N” on the nose of the fuselage 
of the simple wing-body combination sketched in Figure 5.2.4. In this case, strictly speaking, the layers of air 
that join at the trailing edge originated from the filament that attached at the nose of the fuselage, not from lay- 
ers that were split apart by the wing. At any short distance above and below the trailing edge, however, we find 
layers of air that were similarly close together when they passed near the leading edge. When such layers come 
close to “joining” at the trailing edge, they generally will have experienced considerable spanwise displacement 
relative to each other, in addition to the longitudinal displacement that we discussed in connection with 2D flow 
in Section 7.3.1. 


And now to return to our main topic. We've seen that the vortex wake starts its life as a free vortex sheet 
that seems to originate from the trailing edge of the wing. But the trailing edge cannot be the actual origin of 
the vorticity in the wake. Because vortex lines cannot end at a solid surface with a no-slip condition (except at 
singular points of attachment or separation, as we saw in Section 3.3.7), the vorticity in the wake must originate 
in the viscous or turbulent boundary layers on the upper and lower surfaces of the wing. Where does this lead? 
If we look at all of the vorticity present in the 3D flow in the boundary layers and the wake, we see a very 
complicated picture, but it can be simplified greatly if we boil 1t down to its essentials. With regard to the global 
flowfield, what really matters is the net vorticity at any station on the wing or wake, as seen 1n a local plan view. 
At a station on the wing planform, the net vorticity would thus be defined by integration through both the upper 
and lower surface boundary layers; and at a station on the wake, it would be defined by integration through the 
entire viscous layer. The complicated distribution of vorticity through the viscous layer at a given station is thus 
replaced by a single vector value that is much easier to visualize. 

Viewed thusly in terms of net vorticity, the shed vortex sheet is actually a continuation of the bound vorticity 
associated with the lift of the wing, which we discussed in Section 7.2 in connection with lift in 2D. This view 
in terms of net vorticity was option 3 of the ways of looking at bound vorticity that we identified in that dis- 
cussion, illustrated in Figure 7.2.1c. In the 3D case, as the lift per unit span decreases in the outboard direction 
along the span, the circulation and total bound vorticity flux must also decrease. The vorticity representing this 
loss in total strength cannot just disappear and is shed from the trailing edge into the flowfield, supplying the 
vorticity that forms the vortex wake. 

Now imagine the net vorticity on the wing and in the wake as an array of vortex filaments. These filaments 
take on a general horseshoe shape, as shown in Figure 8.1.3. Since each filament of this system forms a horse- 
shoe, if we take a cut through the wake anywhere downstream of the wing, at the plane marked A, for example, 
the filament will be cut in two places, and the flux of vorticity passing through the cut will be equal and opposite 
in the two places. (Recall from Section 3.3.7 that a vortex filament is a construct that carries the same flux of 
vorticity along its entire length.) Thus it is clear from the horseshoe configuration of the vortex system that the 
total vorticity flux passing through a cut through the whole wake is zero. We could arrive at the same conclusion 
by placing a closed contour in the cut such that it encloses the entire wake, and invoking Stokes's theorem on a 
capping surface that bulges out ahead of the wing and thus cuts none of the vorticity. The circulation around the 
closed contour must then be zero, and there can then be no net vorticity flux through any cut across the entire 
wake. 


Figure 8.1.3 Bound and trailing vorticity of a lifting wing viewed as vortex filaments. The plane marked “A” 
illustrates how these filaments would be cut by a transverse plane behind the wing, and the plane marked “B” 
illustrates how they would be cut by a longitudinal plane through the wing 





Another conclusion that follows from the horseshoe configuration of the system is that if we take a cut 
through the wing anywhere along the span, at the plane marked B in Figure 8.1.3, for example, the total fluxes 
of vorticity shed from the trailing edge on opposite sides of the cut will be equal and opposite, and their mag- 
nitudes will equal the flux of bound vorticity at that span station. Then, as a special case, we can say that when 
the lift distribution is laterally symmetrical, the total vorticity flux of the sheet shed from each side must equal 
the flux of bound vorticity at the center. 

Like the boundary layers in which it originated, the wake shear layer is a real physical shear layer filled with 
small-scale turbulent motions. The idealized inviscid theories model the shed vortex wake as a thin vortex sheet 
of the kind we discussed in Sections 3.3.7 and 3.3.8, and illustrations often show it that way for simplicity. In all 
of the discussion that follows, “vortex sheet” can be thought of as referring to either a real physical shear layer 
or to an idealized thin sheet. 

The development of the vortex sheet after it leaves the trailing edge is illustrated in Figure 8.1.4. The vortex 
lines in the sheet leave the trailing edge and follow the general direction of the flow downstream. In the case of 
the ideal thin vortex sheet, the vortex lines are aligned with the mean of the velocity vectors above and below 
the sheet, as in Figure 3.3.8c. Within the first couple of wingspans downstream, the sheet generally rolls up 
toward its outer edges to form two distinct vortex cores. (This is the general pattern for a wing in the “clean” 
condition, flaps up. The flaps-down pattern is more complicated, with cores forming behind flap edges as well 
behind the wingtips.) Although the vortex cores are distinct, they are not as concentrated as they are sometimes 
portrayed, because a considerable amount of air that was initially nonvortical is entrained between the “coils” 
of the spiral formed by the sheet during rollup. 





Figure 8.1.4 Development of the vortex wake downstream of a lifting wing. The lines drawn on the sheet are 
the vortex lines of a continuous distribution of vorticity in the sheet 





In simplified theoretical models, such as the Trefftz-plane theory we'll discuss in the next section, the vortex 
lines are assumed to stream straight back from the trailing edge in the direction of the freestream, and the de- 
formation and rollup of the wake are not represented. The classic argument 1s that the assumption of a nondis- 
torting wake is valid in the limit of zero lift on the wing. For finite lift, however, it is kinematically impossible 
for the wake sheet to remain undistorted. Obviously a nonuniform downwash field behind the wing will distort 
the wake, and downwash fields behind lifting wings are generally nonuniform. Even in the case of an elliptic 
spanwise load distribution, which we'll see in the next section ideally produces a uniform downwash “contri- 
bution” from the trailing vortex wake, the “contribution” from the bound vortex is nonuniform, and the sheet 
must still distort and ultimately roll up. Even if we could find a situation in which the downwash was uniform, 
however, it would still be impossible for the wake to remain undistorted. This is sometimes attributed to an “in- 
stability” (Milne-Thomson, 1966, for example, in Section 10.4 refers to the wake sheet as “unstable” but does 
not provide a detailed explanation, and in Section 3.31 hints that viscosity might play a role). Spalart (1998), 
however, has shown that it 1s not an instability, but a result of the basic kinematics of the sheet, associated with 
the singularity in its strength at the edge, due to the usual infinite slope of the loading at the tip. In the real 
world, of course, there is no singularity, but there still tends to be a high concentration of vortex strength, and 
real wake sheets still roll up at their edges. 

Note in Figure 8.1.4 that in the early phase of wake rollup, the vortex lines are swept outboard toward the 
rolling-up edges of the vortex sheet. As the sheet rolls up into coils, the vortex lines become helical, and when 
rollup is complete, the vortex lines in the outer part of the core appear as illustrated in Figure 8.1.5. In the ideal- 
ized inviscid world, however, the vortex cores would appear as tightly wound spirals, continually stretching and 
tightening. In the real world, the coils of the spiral merge by turbulent diffusion, and diffuse vortical cores are 
formed in which the vortex lines are still helical. The helical vortex lines are very closely aligned with the helic- 
al streamlines. This is consistent with Crocco's theorem (Equation 3.8.8) and with the fact that the total-pressure 
loss associated with the viscous drag has diffused throughout the wake, so that the local total-pressure deficit is 
relatively small. 


Figure 8.1.5 Helical vortex lines and the associated velocities in the rolled-up vortex cores. The helical vortex 
lines line up closely with the helical streamlines 
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To get an idea of how the vorticity should be distributed within the cores, consider the initial distribution 
of vortex strength in the sheet that leaves the trailing edge and is eventually “wound up” into the cores. Figure 
8.1.6 illustrates the distributions of bound and shed vorticity for a typical wing, showing that the shed vorticity 
is most intense at the tips and is much weaker inboard. Based on this, we should expect intense vorticity in the 
center of the rolled-up core and much weaker vorticity in the outer part. 


Figure 8.1.6 Sketch of typical distributions of the fluxes of bound and shed (trailing) vorticity, showing that 
shed vorticity is heavily concentrated near the tip 
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Now let's look at the details of the velocity field in the mature wake far from the wing. The first feature we 
must note is that the pair of vortex cores descends slowly relative to the flight path of the airplane. This is often 
attributed to “mutual induction,” but it 1s better to think of it simply as convection by the downwash behind the 
wing, which persists far downstream because there is practically nothing acting to stop it. We'll consider the 
cause-and-effect issues further in Section 8.1.4. As the cores descend, they carry with them a “descending oval” 
of fluid, as illustrated in Figure 8.1.7. 


Figure 8.1.7 Sketch of the descending “oval” associated with the mature, rolled-up vortex wake, in terms of 
streamlines in the reference frame descending with the oval 








The flow within much of the vortex core in each half of the descending oval is nearly axisymmetric. Figure 
8.1.8 shows how the vertical velocity, the vorticity, and the pressure are distributed spanwise along a horizontal 
line through the centers of the cores (based on rollup calculations by Spalart and consistent with measurements 
by Widnall; see Spalart, 1998). Only the right half of the field is shown, with a symmetry plane assumed at y/bo 
= (0. For comparison, dashed curves show what these distributions would be if all of the vorticity were concen- 
trated in a pair of line vortices, and the flow everywhere else were irrotational (potential flow). Another dashed 
line shows what the distributions would look like for a Rankine core, with the core in solid-body rotation with 
constant vorticity, a model we'll look at again in Section 8.3.3 in connection with induced drag. 


Figure 8.1.8 Details of the flow field of the mature vortex wake, shown as distributions along a horizontal cut 
through the middle of the descending oval. (a) Vertical velocity w. (After Spalart (1998).) (b) Vorticity, estim- 


ated from w, assuming axisymmetric flow in core. (c) Pressure, estimated from w, assuming axisymmetric 
flow in core. (From Spalart, 1998) 
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In the real flow (solid curves), the peak circumferential velocity (Figure 8.1.8a) is quite high and occurs at 
a small radius r1 from the center of the core. Significant vorticity persists out to a much larger radius r2, which 
extends almost to the center plane and to the boundary of the oval. This persistence of the vorticity is clearly 
seen in the fact that the velocity profile does not fair in to the potential-flow curve until it reaches r2, but it 1s 
hard to see in the plot of the vorticity distribution (Figure 8.1.8b) because the scale was chosen to show the 
very high vorticity at the center of the core. The intense vorticity concentrated in the central peak and the lower 
levels in the rest of the core are consistent with the vorticity distribution in the initial sheet that feeds the core, 
as we expected from our earlier discussion. The plot of the pressure distribution (Figure 8.1.8c) shows that very 
low pressures are concentrated only in the intense central core. 


The low pressure in a vortex core is accompanied by low temperature, which often causes condensation of 
water vapor, making the core visible. How much of the core is visible under such conditions depends on the 
situation. In a newly forming core just downstream of a wingtip or flap end, usually only a central portion of 
the core is marked by condensation, making the core appear more compact than it really is. In the farfield, the 
situation is more complicated. Often, engine exhaust has been rolled up into the cores, carrying with it water 
vapor and condensation nuclei (soot) from the engines into the outer parts of the cores. Under such conditions, 
nearly the entire turbulent wake of the airplane may be visible. But the picture can change over time, as the 
condensation evaporates, as it appears to be doing in the photos in Figure 8.1.10. 

The vortex cores are often referred to as “wingtip vortices,” though we can see from the foregoing that this is 
a bit of a misnomer. While it is true that the cores line up not very far inboard of the wingtips, the term “wingtip 
vortices” implies that the wingtips are the sources of all of the vorticity. Actually, as we saw in Figure 8.1.4, 
the vorticity that feeds into the cores generally comes from the entire span of the trailing edge, not just from the 
wingtips. Though it is difficult to tell from the curve in Figure 8.1.8b, the concentrated peak of high vorticity 
inside of rj in Figure 8.1.8b accounts for only about 30% of the total vorticity in the core. 

Figure 8.1.9 illustrates another feature of the velocity field associated with the rolled-up wake. In the direc- 
tion parallel to the core axes, there is usually an axial “jet” in the downstream direction, away from the wing. 


Figure 8.1.9 Sketch of a “jet” of axial velocity in a vortex core 
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Progression of vortex breakup initiated by the Crow instability, made visible by condensation. 
Wake of a B-47 photographed at 15-second intervals. (From Crow, 1970) 





The decay of the trailing vortex cores, if it were by viscosity alone, would be extremely slow, and even 
though the cores are turbulent at all but the lowest Reynolds numbers, turbulent transport 1s suppressed by flow 
curvature, and the decay of the vortices 1s still very slow. At the scale of a large airplane, the vortices would 
persist for hundreds of miles behind the wing if viscosity and small-scale turbulent diffusion were the only dis- 
sipation mechanisms. In actuality, the vortices typically persist for something more on the order of 10 miles, and 
the eventual breakup of the wake is not by small-scale turbulence but by large-scale motions and distortions of 
the vortices, resulting from slow-growing instabilities such as the Crow instability (Crow, 1970). 
shows a progression of breakup initiated by the Crow instability, made visible by water condensation. Spalart 
(1998) refers to this process as collapse, to contrast it with the much slower process of decay. We'll consider the 
breakup process further and speculate on what the whole flowfield looks like in a time-averaged sense on a very 
large scale, in Section 8.5.5. 


§.1.3 The Pressure Field around a 3D Wing 


If we take longitudinal vertical cuts through the flowfield as we did to look at streamlines in | ) , but 
look at the pressure field instead, what we see looks qualitatively like what we saw for a 2D airfoil in 
and = . This is a view 1n which 3D effects are difficult to discern. For purposes of understanding the 


3D flow, the view in cross-stream planes is more informative. Consider pressure distributions 1n a succession 


of cross-stream planes: one a short distance upstream of the wing, one through the middle of the wing, and one 
immediately downstream of the wing. These are illustrated in Figure 8.1.11 for both the 2D and 3D cases. The 
cuts shown for the 2D case are just cross-sections of the generic 2D lifting pressure field we considered in Fig- 
ure 7.3.11. In making these sketches, I've assumed that the maximum chord and load per unit span at the center 
plane of the 3D wing are the same as for the 2D airfoil, so that the center section of the 3D wing matches the 
lift coefficient of the 2D airfoil, not the angle of attack. Because of the downwash in the 3D case (which is one 
of the things we'll be seeking to explain in Section 8.1.4), the 3D wing will need a higher angle of attack than 
the 2D airfoil, and eventually we'll come back around to seeing this reflected in our explanation of the velocity 
field. 


Figure 8.1.11 Gross features of pressure distributions in cross-planes in lifting flows. (a) Upstream. (b) Cut 
through the wing. (c) Downstream 
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Now note in Figure 8.1.11 that the pressure distributions in the 3D case show distinct effects of finite span. 
As the lift decreases outboard of the center section, a combination of the intensity and vertical extent of the 
pressure distribution must decrease, depending on the planform and lift distribution of the wing. In this case, we 
show the vertical extent decreasing, as would be the case if the reduction in local lift load were due mostly to a 
reduction in chord. Note that there 1s also a kind of “3D relief’ effect, in which the pressure disturbances off the 
surface inboard are “dragged down” closer to the smaller disturbances outboard. As a result, the vertical extent 
of the pressure distributions in 3D is smaller everywhere along the span, even at the center section, than it is for 


the 2D airfoil. This more rapid “dying off’ of the pressure disturbances away from the wing in 3D is seen at all 
stations: ahead of the wing, at the wing, and behind the wing. 


8.1.4 Explanations for the Flowfield 


Now that we have a qualitative description of the flowfield around a 3D lifting wing, we'd like to explain phys- 
ically how the flow does what it does. One of the main things we'll want to explain is why the velocity disturb- 
ance downstream, which dies off rapidly in the case of a 2D airfoil, persists over long distances in the case of 
a 3D wing. As we've already noted, the classical approach to this is to describe the distribution of the vorticity, 
both the bound vorticity and the vorticity in the wake, and to use the Biot-Savart law to infer what the velocity 
field does. Of course the Biot-Savart law is applicable, and all of the features of the cross-flow velocity field 
near the wing that we saw in Figure 8.1.2 are “explainable” as being “induced” by the bound vorticity and the 
shed vortex sheet, mostly the part in the near field downstream of the trailing edge. Likewise, the velocity field 
that persists far downstream, and differs from Figure 8.1.2 only in some details, is consistent with “induction” 
by the rolled-up vortex wake shown in Figure 8.1.8. An apparent strength of the vorticity-based approach 1s that 
the convection of a somewhat compact vortex wake downstream provides an “explanation,” of sorts, for the 
persistence of the velocity disturbance. A weakness is that the “explanation” that it provides is incomplete in 
that we had to know or assume a priori how the vorticity is distributed. And we must also remember that even 
if we know the vorticity distribution, appealing to Biot-Savart only gives a correct description of the flowfield, 
and that it does not explain it directly in terms of physical cause and effect. The description using Biot-Savart 
is convenient as a mental crutch and for quantitative purposes, but it is prone to misinterpretation in terms of 
the induction fallacy, as discussed in Section 3.3.9. The correct view is that the vorticity 1s not the cause of the 
flowfield but is more of a passive result of other things that are happening in the flowfield. 


As we noted in the introduction to Section 8.1, a real physical explanation must involve the interaction of the 
pressure and velocity fields. In the case of a 2D airfoil in Section 7.3.3, the vertical component of the velocity 
and its interaction with the pressure field played a prominent role in our explanation of the flow. Now let's see 
how far this kind of thinking can take us in the 3D case, in explaining the cross-flow velocity components illus- 
trated in Figure 8.1.2. Of course, now we'll have to explain the evolution of the spanwise velocity component, 
in addition to that of the vertical component. 

We'll approach the problem by thinking in terms of individual fluid parcels passing through different parts 
of the pressure field sketched in Figure 8.1.11, and of the pressure gradients the parcels are subjected to dur- 
ing their passage. The most obvious conclusions we can draw have to do with those major portions of the field 
where one component of the pressure gradient maintains the same sign throughout a parcel's passage through 
the region. In these situations, the corresponding velocity component is set in motion and not stopped, and we 
should expect that part of the motion to persist downstream. By inspection of the pressure distributions in Fig- 
ure 8.1.11, we can see that this mechanism is consistent with the spanwise velocities in the outward direction 
below the wing and in the inward direction above the wing, and with the upwash outboard of the tips. So these 
features of the flowfield of Figure 8.1.2, which persist far downstream and have no counterparts in the 2D case, 
seem to be explainable in terms of simple gross features of the 3D pressure field associated with the lift. 

The downwash between the tips is the only major feature of the pre-rollup velocity field not yet explained, 
and it is more complicated. The vertical component of the pressure gradient, which drives this part of the mo- 
tion, reverses sign twice for fluid parcels passing above or below the wing, just as it does in the 2D case. In our 
explanation of the 2D case in Section 7.3.4, we saw that the pressure field participates in a delicate balancing 
act that results in downwash that decays to zero far downstream of the airfoil. In the 3D case, on the other hand, 
we know the downwash persists over long distances downstream. With pressure fields that are qualitatively so 


similar, that is, with two reversals of the gradient in both cases, how do we account for the dramatic difference 
in the resulting downwash fields? To answer this question, we have to look at the interactions in both cases in 
more detail. 


In the 2D case, there is both upward and downward turning taking place in the flowfield ahead of the airfoil. 
In connection with Figure 7.3.23, we noted that vertical cuts through the field ahead of the airfoil see the same 
net flux of vertical momentum across them, corresponding to half the lift, which doesn't change from one cut to 
another. However, if we limit our attention to a streamtube that passes close to the airfoil above and below, we 
see that the pressure gradient ahead of the airfoil turns the flow upward, then the gradients above and below the 
airfoil turn the flow downward, and finally the gradient behind the airfoil turns the flow upward again, canceling 
the local downwash velocities in an asymptotic sense far away from the airfoil. The upward turnings ahead of 
the airfoil and behind are just enough to cancel the downward turning that takes place as the flow passes close 
to the airfoil surfaces. 

In the 3D case, the downward turning immediately above and below the wing 1s stronger than it 1s in the 2D 
case, for the same lift. This is because the more rapid dying off of the pressures above and below the airfoil 
means the vertical pressure gradient near the wing surface is stronger than in 2D. More rapid downward turning, 
resulting in larger downwash by the time the trailing edge is reached, is also consistent with the fact that the 
3D wing requires a higher angle of attack to achieve the same lift. The airfoil pressure field also dies out more 
rapidly ahead of the airfoil and behind, which results in less upward turning of the flow in those regions. So in 
the 3D case, we have more downward turning above and below the wing, and less upward turning ahead and 
behind, with the result that some downwash persists in a central portion of the field behind the wing, that 1s, 
between the wake-vortex cores. 

To complete this explanation, we must point out that all of these effects of the pressure gradients on the cross- 
flow velocities constitute only one side of the interaction. Remember that cause-and-effect is a two-way street 
and that the velocity changes, or accelerations of the flow, are both caused by the pressure gradients and also 
serve to sustain the pressure gradients. This is the same point that we made a major issue of in our explanation 
of 2D airfoil flow in Section 7.3.3. There we talked about “confinement” of the “clouds” of high and low pres- 
sure and how vertical and longitudinal accelerations of the flow provided that confinement. That description of 
vertical and longitudinal confinement also applies in the 3D case, but the spanwise component of acceleration 
also comes into play: The outboard acceleration of the flow beneath the wing and the inboard acceleration above 
the wing provide spanwise confinement of the pressure differences around a 3D wing. 

In the 3D wing flow, the vertical pressure gradients above and below the airfoil are sustained by the down- 
ward turning of the flow, just as we noted that they are in 2D. However, a feature of the 3D pressure field that 
is not so easy to explain in simple qualitative terms is the “3D relief effect” that we described in Section 8.1.3, 
in which the vertical extent of the pressure distribution in 3D 1s lower than in 2D for the same chord and lift 
per unit span. Reducing the vertical extent of the pressure distribution means an increase in the vertical pressure 
gradient close to the wing surface and a reduction farther from the surface. It is a result of the 3D flow's freedom 
to accelerate spanwise, but not a simple result to explain. 


It 1s also interesting to note how the pattern of horizontal and vertical velocities that we've just explained 
fits together in terms of conservation of mass. Referring to Figure 8.1.2, note that the horizontal velocities con- 
verge toward the center plane above the wing and diverge from the center plane below the wing. The downwash 
between the tips thus “exhausts” the converging flow above and “feeds” the diverging flow below. Around the 
tips, the opposite occurs: divergence above and convergence below, which are “relieved” by the upwash out- 
board of the tips. So we see that for the horizontal velocities that were set in motion by the wing to persist far 
downstream, they must be accompanied by downwash between the tips and upwash outboard, and must there- 
fore be part of a general circulatory pattern behind each half of the wing. And, of course, each of these circulat- 
ory regions must have vorticity (half of the vortex wake) somewhere inside it. 


The final features needing an explanation are the axial jets in the rolled-up vortex cores that we saw in Figure 
8.1.9. It has been shown that far downstream of the wing the component of the velocity disturbance parallel to 
the core axes is nonzero only within the vortical cores (Spalart, 2008). Within the cores, it is clear from Figure 
8.1.9 that the axial velocity disturbance is “explainable” as being “induced” by the circumferential component 
of vorticity associated with the helical configuration of the vortex lines, which, as we've already noted, lines up 
closely with the helical configuration of the streamlines. A direct physical explanation starts with the observa- 
tion that balancing the centrifugal forces associated with the circumferential velocities requires a radial pressure 
gradient, and therefore substantially lower than ambient pressure within the cores, as we saw in Figure 8.1.8c. 
The air in the cores started at ambient pressure upstream of the wing and, having entered the low-pressure region 
within the cores, has experienced a net acceleration in the axial direction. And again, our caveat regarding one- 
way cause and effect applies, and we must remember that the accelerations and the pressure gradients share a 
mutual interaction. 


6.1.5 Vortex Shedding from Edges Other Than the Trailing 
Edge 


So far we've considered the flow over wings of moderate-to-high aspect ratio, assuming that all of the significant 
vorticity shedding is from the trailing edge. But vorticity shedding is not always confined to trailing edges. On 
many wings, especially those with squared-off or nearly squared-off tips, the shedding at the tip starts well for- 
ward of the trailing edge, on the nearly streamwise “edge” at the tip. In this case, the vorticity can roll up over 
the wing upper surface, as shown in Figure 8.1.12a. A similar pattern is common on the squared-off outboard 
edges of deployed trailing-edge flaps. On high-aspect-ratio surfaces, such details near the tips have only small 
effects on the overall development of the vortex wake and the global flowfield. However, on a low-aspect-ra- 
tio wing, shedding from edges other than the trailing edge can dominate the development of the flow. For ex- 
ample, on low-aspect-ratio delta wings at moderate-to-high angles of attack, most of the vorticity 1s shed from 
the highly swept leading edge, and large, partly-rolled-up vortex coils occupy much of the area over the wing 
upper surface, as shown in Figure 8.1.12b. 


Figure 8.1.12 Vorticity shedding from edges other than the trailing edge. (a) The nearly streamwise tip “edge”’ 
of a rectangular wing (dye visualization in a water tunnel by Werle, 1974). Photo by Werle, (1974). Courtesy 
of ONERA. (b) The highly-swept leading edge of a low aspect-ratio delta wing (dye visualization in a water 
tunnel by Werle, 1963) Photo by Werle, (1963). Courtesy of ONERA 





8.2 Distribution of Lift on a 3D Wing 


In the steady attached-flow regime, the lift distribution on a 3D wing can generally be predicted reasonably 
accurately by high-fidelity computational fluid dynamics (CFD) with turbulence modeling. The importance of 
viscous effects in these predictions varies greatly, depending on the conditions. Under transonic conditions, the 
displacement effect of the boundary layer is very important, and the accuracy of predictions is often limited by 
our inability to model the turbulent boundary layer sufficiently accurately. At low Mach number and high Reyn- 
olds number, the displacement effect of the boundary layer has a smaller effect on the pressure distribution, and 
an inviscid solution can provide a reasonable prediction of the lift distribution. Here, we'll consider what we can 
learn with help from simplified inviscid theories. 


8.2.1 Basic and Additional Spanloads 


If the aspect ratio of a 3D wing is reasonably high, and the lift coefficient isn't too high, the spanload can usually 
be decomposed into two parts: 
1. The basic spanload at zero total lift, which depends on the planform, the shapes of the local airfoil 
sections, and the twist distribution of the wing, which we'll define below, and 


2. The additional spanload due to angle of attack, which depends only on the planform and is proportional 
to the angle of attack relative to the angle for zero total lift. 


A formal justification for this decomposition could be derived from either linearized lifting-surface theory or 
lifting-line theory, which are both discussed briefly below. Less formally, it should hold provided that 
1. The airfoil sections all along the span, except near the tips, behave like 2D airfoil sections that feel the 
effects of finite span only through changes in their effective angles of attack, due to the local 3D down- 
wash, which is where our assumption of high aspect ratio and low loading comes 1n; 


2. The sectional lift curves are linear, which we found in Section 7.4 to be approximately true for 2D air- 
foils in the attached-flow regime in the absence of transonic effects; and 


3. Nonlinear effects, such as movement of the vortex wake with angle of attack are negligible. Note that 
we haven't had to assume any particular shape for the vortex wake, only that any effects of movement of 
the wake are negligible. 


Note also that it should be permissible for the wing to be nonplanar, that is for it to have dihedral or nonplanar 
tip devices. We define the “twist distribution” as the distribution along the span of the orientations of the zero- 
lift lines of the sections, though sometimes in other contexts the term is used to describe the incidences of the 
sectional chord lines. Thus if the wing is shaped so that the sectional zero-lift lines of all of the airfoil sections 
are parallel, the wing is considered to be untwisted, and the basic spanload at zero total lift will be zero all along 
the span. If the orientations of the sectional zero-lift lines vary along the span, the wing is said to be twisted, 
and there will be positive and negative loads on different parts of the span when the total lift is zero, which 
constitutes a nonzero basic spanload, and there will be nonzero vorticity shed into the wake. Now as the angle 
of attack 1s changed from the zero-lift value, assumptions (2) and (3) above guarantee that both the additional 
wake vortex strengths and the additional sectional loadings all along the span will vary linearly. Because the 
local 3D downwash changes with angle of attack, the sectional lift slope at each station along the span will be 
different from what it would be for that airfoil section in 2D. And as we saw in Section 8.1, the overall lift slope 
of the 3D wing will be less than that of a 2D airfoil. 

The basic and additional spanloads, and their sum, are illustrated for a typical twisted, unswept wing in Fig- 
ure 8.2.1. For an untwisted wing, the basic spanload would be zero everywhere, and only the additional span- 
load would be nonzero. In this case, the wing was assumed to have a typically small amount of washout (1.e., 
it is twisted leading-edge down outboard) so that the basic spanload is negative outboard. Figure 8.2.2 shows 
the same spanload decomposition for a comparable swept wing, illustrating how aft sweep tends to shift the 
additional spanload outboard. 


Figure 8.2.1 Illustration of the spanload decomposition for a typical unswept wing, assumed to have a small 
amount of washout (twist leading-edge down outboard) 
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Figure 8.2.2 Illustration of the spanload decomposition for a typical aft-swept wing, assumed to have the sub- 
stantial washout (twist leading-edge down outboard) that is typically required to achieve a favorable total 
spanload on a swept wing 





This effect is usually explained in terms of vortex “induction,” as illustrated in Figure 8.2.3. At any station 
on the wing outboard of span station A, the wing “feels” more upwash from the trailing vorticity inboard and 
less downwash from the trailing vorticity outboard than it would if the wing were unswept As a result, the wing 
outboard of A feels less downwash than it would in the unswept case. This effect is just as easily (and better) ex- 
plained in terms of the pressure field. Consider the cross-stream plane P cutting the wing near where the leading 
edge crosses station A. The pressure field in this plane would look like a part-span version of that shown for the 
full 3D wing in Figure 8.1.11b. The flow approaching the wing outboard of station A thus experiences a vertical 


pressure gradient like that outboard of a wingtip, that is, low pressure above and high pressure below, and 1s ac- 
celerated upward (more so than it would be in the unswept case, because of the influence of the wing inboard). 
Again we conclude that the wing outboard of A feels less downwash than it would in the unswept case. This 3D 
effect of the planform can be quite strong. In fact, for sweep angles typical of swept-wing transport airplanes, 
the outboard half of the wing actually feels a 3D upwash instead of a downwash. 


Figure 8.2.3 Illustration of how a section of an aft-swept wing 1s “influenced” by more shed vorticity inboard 
and less shed vorticity outboard compared with an unswept wing 
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An untwisted aft-swept wing would have a spanload like the additional spanload illustrated in Figure 8.2.2, 
which is undesirable for several reasons: It produces unnecessarily high induced drag, it leads to excessive 
bending moments on the structure at high-load conditions, and it produces a tendency for the tips to stall first, 
which is bad for airplane handling characteristics. To avoid these effects, an aft-swept wing must generally be 
designed with considerable washout, and thus have a basic spanload with a strong download outboard, to have 
an advantageous total spanload like that shown in Figure 8.2.2. The favorable spanload that is achieved in this 
way persists only over a limited range of angle of attack. 


8.2.2 Linearized Lifting-Surface Theory 


A linearized version of the incompressible inviscid theory is sometimes useful for illustrating trends and provid- 
ing insight into the behavior of 3D wings, though it suffers a significant loss in physical fidelity. We assume 
the airfoil sections are thin, and the angle of attack is small, just as we did in 2D (see Section 7.4.1). The flow 
disturbance produced by the wing is represented by singularities distributed over the chord plane, and the no- 
through-flow boundary conditions on the wing's upper and lower surfaces are approximated by velocity-slope 
conditions applied at the chord plane, ignoring the perturbation u, as in 2D. A complication not present in 2D 
is that the vortex wake must also be modeled. This is done on the assumption that the wake is confined to a 
sheet that does not distort, and the vortex lines in the sheet stream straight back from where they are shed from 
the trailing edge, an assumption we'll see again and discuss further in connection with the Trefftz-plane theory 
of induced drag in Section 8.3.4. A derivation of the integral equations of the theory is given by Ashley and 
Landahl (1965). We'll not go into the details here; we'll limit our discussion to the general conclusions to be 
drawn from the theory. 


Of course, linearity allows solutions to be constructed by superposition, just as in 2D, and we can look at 
the effects of various geometry features separately. In 2D we identified separate effects of camber, thickness, 
and angle of attack. In addition to these three effects of airfoil section shape and orientation, in 3D we have the 
effects of the wing planform, that is, the distribution of chord along the span, and the sweep, if any. As was 
the case with regard to spanload decomposition in Section 8.2.1, it should be permissible for the planform to 
be nonplanar, that is, for the wing to have dihedral and for the dihedral angle to change along the span, which 
would require the wake sheet to be correspondingly “bent” in rear view. However, references on the theory usu- 
ally assume that the wing is confined to a single plane, as in Ashley and Landahl (1965). 

The three basic sectional effects have different relationships to the effects of planform. Sectional camber and 
angle of attack both affect lift, and therefore they affect the distribution of vorticity in the wake, which by “in- 
duction” affects the velocity perpendicular to the chord plane at other locations on the span. Because of this, 
sectional camber and angle of attack have effects that are not just local, but spread over the entire planform in a 
way that depends on the details of the planform. The effects of section thickness are less strongly coupled to the 
planform. If the wing has a high aspect ratio in addition to being thin, the effects of thickness become effect- 
ively local in the limit, depending only on the local streamwise distribution of thickness and the local sweep of 
the planform, in a manner consistent with the “simple sweep theory” that we'll discuss in Section 8.6.1. In 3D 
linearized solutions, just as in 2D, airfoil thickness does not affect the distribution of lift. 

So in the linear limit, the distribution of lift on a 3D wing depends only on the planform and the distributions 
of sectional camber and angle of attack. The total lift varies linearly with a, just as it does in 2D, but due to 3D 
effects the lift curves at different stations along the span can have different slopes and intercepts. The lifting- 
surface theory predicts both the spanwise and chordwise distributions of load. The downwash is not assumed to 
be constant in the chordwise direction, so that downwash can affect not just the local effective angle of attack, 
but also the local effective camber. Still, because of the general linearity that is assumed, the spanload can be 
decomposed into a basic part at zero lift and a part proportional to angle of attack, as in Section 8.2.1. There 
we assumed that the aspect ratio is high, and the local downwash affects only the local angle of attack. Here we 
assume that disturbances are small, and we needn't assume high aspect ratio. 


§.2.3 Lifting-Line Theory 


The simplest way to predict just the spanload of a 3D wing is the so-called Jifting-line theory, 1n which the 
chordwise distribution of the load is ignored. The lift is assumed to be concentrated in a single bound vortex, 
called the /ifting line, generally located along the quarter-chord line of the planform, and the vortex-wake sheet 
is assumed to stream straight back from that, as illustrated in Figure 8.2.4. The bound vortex strength is related 
to the local lift per unit span using the Kutta-Joukowski theorem, Equation 7.2.1, and as the local lift changes 
along the span, the change in bound vortex strength is shed into the wake, in keeping with Helmholtz's second 
theorem (Section 3.3.7). Thus the distribution of vortex strength in the wake sheet is equal to the spanwise rate 
of change of the bound vorticity. In the early theory developed by Prandtl and his colleagues, the lifting line is 
assumed to be straight, so that the bound vortex at one part of the span has no influence on the downwash on 
other parts. The 3D downwash is thus assumed to be only that which is “induced” by the trailing vortex wake, 
and it is evaluated at the upstream end of the vortex wake, which is by definition on the lifting line itself. Local 
sections of the wing are assumed to function as 2D airfoils with known sectional (2D) lift curves, with each 
section operating at an effective angle of attack modified by the local 3D downwash angle. 


Figure 8.2.4 Arrangement of the bound vortex at c/4 and the trailing vortex lines in the early development of 
lifting-line theory. Control points (x) are placed on the bound vortex 
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The original theory was justified by the informal physical arguments I just outlined. Later, Van Dyke (1964) 
used the method of matched asymptotic expansions to show that lifting-line theory represents a formally valid 
approximation in the limit of high aspect ratio and small loading. Early lifting-line theory was used not only to 
predict spanload, but also induced drag, which we'll consider in Section 8.3. 

In the more general case in which the lifting line 1s not straight, the “contribution” of the lifting line itself to 
the downwash must be taken into account. The original formulation of lifting-line theory, in which the down- 
wash is evaluated on the lifting line itself, then breaks down because a curved lifting line has infinite self-in- 
duced velocity. One way to get around this problem is to introduce a different kind of boundary condition, in 
which the downwash angle “induced” by the bound and trailing vorticity 1s evaluated at a downwash line loc- 
ated off of the lifting line and is made to account for both the 3D part of the downwash and the effective sec- 
tional angle of attack. This calls for setting the downwash angle equal to the angle of the sectional zero-lift line, 
and placing the downwash line at the 3/4-chord location, as illustrated in Figure 8.2.5. The 3/4-chord location is 
chosen because the downwash there, in the 2D case, is equal to the angle of attack of the zero-lift line, provided 
the 2D lift-curve slope has the linear-theory value of 27, a result known as Pistolesi's theorem. (The reader can 
easily verify this using the Kutta-Joukowski theorem, Equation 7.2.1 and the definition of circulation.) 


Figure 8.2.5 Arrangement of the bound vortex at c/4, the trailing vortex lines, and the downwash line at 3c/4 
in later developments of lifting-line theory. Control points (x) are placed on the downwash line 
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It is clear from Figure 8.2.5 that when the downwash line 1s located off of the lifting line, the calculation of 
the downwash “induced” by the trailing vortex lines requires accounting for the additional chordwise segment 
between the downwash line and the lifting line. In the Weissinger “L”’ method (Weissinger, 1947), a simplified 
approximate accounting for the additional segment is used (also see Ashley and Landahl, 1965). 


Of course when the lifting line is not straight, the bound vortex influences not just the local 3D downwash 
angles, but the effective local freestream velocity magnitudes as well. This leads to what is often called a non- 
linear lift effect because 3D “induction” now affects the local lift through both the local downwash (and thus the 
local effective angle of attack and the local I) and the local effective Us. The local-Uw effect can be thought of 
as either affecting the lift for a given TI or the I required to produce a given lift. This nonlinear effect is gener- 
ally ignored in lifting-line calculations for several reasons. First, 1t keeps the equation system linear. Second, it 
wouldn't be consistent to include this nonlinear effect while assuming the crude lifting-line model for the vortex 
wake. Finally, the effect has been found to be small for wings of reasonably high aspect ratio (see Eppler, 1997, 
for example). 

In numerical implementations of lifting-line theories, the vortex wake is usually discretized as an array of 
line vortices of finite strength, and the bound vortex is assumed to be straight and to have constant strength 
between the intersections with the trailing vortices. The boundary condition is enforced at discrete control points 
between the trailing vortices. In most methods intended for application to nonstraight lifting lines, the control 
points are placed at the 3/4-chord location as indicated in Figure 8.2.5. Discrete methods have been proposed, 
however, 1n which the control point is placed on the lifting line, as in Figure 8.2.4, even though the lifting line 
is not globally straight (Phillips and Snyder, 2000, for example). The problem that this incurs is hidden from 
view because the discrete straight lifting-line segments artificially mask the problem of infinite velocity that we 
discussed above. But the problem is still there in the limit as the segment length goes to zero. Thus locating 
the control points away from the bound vortex is still the only way to have a general formulation that doesn't 
behave badly as the discretization is refined. 

Even when a downwash line separate from the lifting line 1s used, lifting-line theory in effect assumes that the 
downwash due to finite span doesn't vary much in the chordwise direction, over the whole chord of the section 
at any given station along the span. This is not a bad assumption for high-aspect-ratio wings with reasonably 
straight quarter-chord lines, and in such cases, the theory can provide fairly accurate results. However, for swept 
wings, which generally have a pronounced kink in the quarter-chord line at the center station, the assumption 
is poor for the inboard part of the wing, and Thwaites (1958) goes so far as to state that lifting-line theory is 
“completely unjustified” for swept wings. Still, it is often used for swept wings anyway, and semi-empirical 
adjustments to improve its accuracy in the neighborhood of the kink in the lifting line have been proposed, as in 
Barnes (1997). 


§.2.4 3D Lift in Ground Effect 


In Section 7.4.9, we saw that as an airfoil in 2D flow gets closer to a ground plane, the lift is first reduced and 
then increased. For a 3D wing, a ground plane has a 3D effect on lift, which often overwhelms the 2D effects. 
A ground plane in 3D also affects the induced drag, as we'll see in Section 8.3.9. 

When a wing flies close to the ground, the no-through-flow condition at the ground forces the flowfield 
around the wing to change in a way that generally increases the lift at a given angle of attack or reduces the 
angle of attack required for a given lift. One way to look at this is that the ground has the effect of inhibiting 
vertical velocity throughout the field and therefore reduces the 3D downwash in which the wing is flying. 

A second way to look at it that also provides a basis for simplified quantitative calculations is to invoke the 
idea of images. A simple way to ensure that the no-through-flow condition at the ground is satisfied is to place 
an image of the vortex system below the ground, as shown in Figure 8.2.6. Then part of the downwash “in- 
duced” by the real vortex system can be seen to be canceled by the upwash “induced” by the image system. A 
single horseshoe vortex and its image could be used for this purpose, but the physical fidelity would be poor. 
A model with a more realistic distribution of the shed vorticity, but ignoring rollup, like that used in lifting-line 


theory (Section 8.2.3) or an inviscid panel method (Chapter 10), would provide somewhat better fidelity, and 
the effect on lift could still be readily calculated. The change (increase) in lift at a fixed angle of attack for a 
planar wing with a rectangular planform of aspect ratio 10 and no twist is shown in Figure 8.2.7, as calculated 
by a panel method with a nondistorting wake sheet. In this example, the 2D lift decrease is more than offset 
by the 3D lift increase due to the reduction in downwash. We can look at finite span as having two effects that 
work in the same direction. First, finite span introduces 3D downwash, which is reduced by the presence of the 
ground. Second, finite span also attenuates the 2D effects of the ground because an image bound vortex of finite 
span has less “influence” than one of infinite span. 


Figure 8.2.6 Bound and trailing vortex system of a wing flying in ground effect, and its image under the 
ground plane 





Figure 8.2.7 The increase in lift of a wing in ground effect. Results of a lifting-line calculation for a rectangu- 


lar planform AR = 10, no twist. C7 = 0.81 out of ground effect 
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Comparing Figure 8.2.7 for a 3D wing with Figure 7.4.35 for a 2D airfoil, it appears that for wings of ordin- 
ary aspect ratio the effect of the ground on lift is dominated by finite-span effects. Note that 2D lift in Figure 
7.4.35 is decreased until h/c goes below 0.3, which corresponds to h/b = 0.03 in Figure 8.2.7. Above this value, 
the 3D effect in Figure 8.2.7 clearly dominates. Presumably at some point below h/c = 0.3, the 2D effect would 
begin to contribute more to the lift increase than does the 3D effect, but this range is seldom of practical interest. 
For a wing with significant dihedral, or sweep combined with angle of attack, such low values of h/c would not 
be reachable over much of the span, even without a landing gear. 


Depending on the geometry of the wing, ground effect can change the shape of the spanload, and if the wing 
is swept, it can cause substantial changes in the pitching moment. A swept-wing airplane with an aft tail can 
experience complicated changes in its lift curve and pitching moments as functions of height when in close 
proximity to the ground plane. 


8.2.5 Maximum Lift, as Limited by 3D Effects 


In Sections 7.4.3 and 7.4.4, we looked at how the maximum lift of 2D airfoils, both single-element and 
multiple-element, is limited by boundary-layer separation. It turns out that sectional maximum lift, as limited by 
boundary-layer separation, 1s also generally the limiting factor for 3D wings. However, in the late 1950s, there 
was considerable interest in flap systems that used active jet blowing to control separation (“blowing BLC’’) 
and, when the blowing was very strong, to directly enhance the circulation around the airfoil (the “jet flap’). In 
such cases, the 3D downwash field can become the factor that limits the maximum lift of a 3D wing. 

As we saw in Section 8.1, the downwash associated with finite span has the effect of tilting the lift vector 
back. The horizontal component 1s felt as induced drag, and the vertical component is reduced to something less 
than the magnitude of the force. Furthermore, the magnitude of the force for a given circulation (bound vortex 
strength) is reduced because the vortex wake is generally tilted downward, so that the velocity “induced” by it 
has a forward component that subtracts from the effective freestream velocity. Thus when we try to increase the 
circulation on a 3D wing, by whatever means, the tilting back of the force vector and the reduction of the ef- 
fective freestream velocity both increase, and presumably at some point the vertical component (the lift) should 
stop increasing, thus defining a maximum achievable lift limited by 3D downwash. 

Davenport (1960) looked at three highly idealized models for this effect that had been proposed by others and 
found that their predictions varied widely depending on their assumptions about the wake. By their nature, such 
theories predict maximum lift proportional to span, independent of wing area. Thus when normalized by wing 
area, they all predicted Cymax proportional to aspect ratio. However, the constants of proportionality ranged 
from about 0.8 to 2.0. Davenport proposed a model of his own that gave a result at the high end of this range, 
but also concluded that the effect depends strongly on the details of the flow, especially as reflected in the tilt 
of the wake near the airfoil. In any case, the range of CLmax predicted by these models is so high as not to be 
achievable without some form of active flow control. 


8.3 Induced Drag 


In Section 8.1.1, we looked at the flowfield around a lifting wing of finite span, and we saw how the lift vector 
is tilted back, making a contribution to drag that we call induced drag. In this section, we delve into the related 
quantitative theory, which we should note at the outset requires some degree of idealization. Recall that in Sec- 
tion 6.1.3 we discussed how it is essentially impossible to decompose the total drag force on a body rigorously 
into separate contributions based on the different flow mechanisms responsible. In the theories of induced drag 
in this section, we'll sidestep that issue by assuming that the flow is inviscid and that there are no total-pressure 
losses through shocks, so that the induced drag is the only drag “component” present. So we must keep in mind 
that quantifying induced drag as a separate “component” of the drag force is an idealization. 

But assuming inviscid flow in the theory doesn't cost us as much in terms of accuracy as one might think in1- 
tially. We can use induced-drag theory without necessarily assuming that the entire flowfield must be consistent 
with inviscid flow. For example, in theories in which the lift distribution on the wing is an input, we can use a 


lift distribution consistent with the real flow, including viscous and transonic effects. Because we reintroduce 
realism in this way, the conclusions we draw from induced-drag theory can be reasonably accurate in most of 
the more general situations we'll encounter in practice. Just keep in mind that the theory of induced drag gener- 
ally ignores some physical complications and incurs at least some small error as a result. 


8.3.1 Basic Scaling of Induced Drag 


By appealing to the idealized lifting-line model for the flow around a simple wing illustrated in Figure 8.2.4, we 
can deduce how induced drag should scale with the lift, the flow conditions, and the dimensions of the wing. If 
I’ is the centerline circulation, the total lift will go as pUoIb. As we argued in Section 8.1.1, the wing is flying 
in a downwash field of its own making, as illustrated in Figure 8.1.2. We'll therefore assume that the induced 
drag is given by the lift tilted back through an average downwash angle s, or Dj ~ L € . For the simple straight 
lifting line in Figure 8.2.4, the lifting line “induces” no downwash on itself, and we need only consider that 
“induced” by the trailing vortex wake, for which the downwash velocity goes as y/b, and € goes as y/Uxb. Com- 
bining these so as to eliminate I, we get 
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which illustrates some important trends. Induced drag increases rapidly with increasing lift and decreases rap- 

idly with increasing span. Flying at high altitude (small p) or low speed increases the induced drag. Induced 

drag is the one major part of the drag of an airplane that decreases with increasing speed, in contrast with the 
et 


viscous drag that we considered in Chapter 6, which tends to increase roughly as Use . Given these two oppos- 
ing trends, the drag tends to be dominated by induced drag at low speeds and by viscous drag at high speeds, 
‘ o 


with a drag minimum in between, as illustrated in Figure 8.3.1. In this illustration, the ideal 1/U so and Usx 
dependences were assumed, so that the minimum total drag occurs where each component contributes half the 
total. For real wings, the nonideal behavior of the profile drag tends to drive the minimum drag to a higher speed 
(lower CL), as we discussed in Section 7.4.2. Nonideal behavior of the induced drag can shift the minimum in 
either direction. 


Figure 8.3.1 Schematic drag-versus-speed curve for an airplane, illustrating the induced and viscous contribu- 
tions 
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Note that as a result of our lifting-line assumptions the induced drag depends on the span of the wing and 
not the area. This also holds in the Trefftz-plane theory, which we'll discuss in Section 8.3.4 and which provides 
sufficient accuracy for nearly all practical predictions of induced drag. So for practical purposes, induced drag 
does not depend at all on wing area. This sets induced drag apart from the lift, viscous drag, and pitching mo- 
ment, which tend to be proportional to area, and introduces a practical problem that has been a source of some 
confusion. Most of our dealings with these forces are in terms of the dimensionless coefficients CL, Cp, and Cm. 
Because all of the raw dimensional forces other than the induced drag are roughly proportional to qS, nondi- 
mensionalizing by qS is the only choice that makes sense. But then to be consistent, we must nondimensionalize 
induced drag the same way, getting 
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So unfortunately, nondimensionalizing induced drag by wing area makes it look (misleadingly) as if the induced 


drag depends on wing area, or aspect ratio. The appearance of aspect ratio in such formulas is a red herring, an 
artifact of a nondimensionalization that is more appropriate for other quantities. 


8.3.2 Induced Drag from a Farfield Momentum Balance 


We can derive a very general formula for the induced drag based only on the farfield flow, with only minimal 
assumptions about how the flow behaves: 
1. The flow is steady and inviscid, and density variations in the farfield can be ignored, so that we can use 
the steady, incompressible form of Bernoulli's equation. 
2. Velocity disturbances in the farfield tend to zero except in the neighborhood of a vortex wake that is 
convected indefinitely downstream but does not spread out without bound in the other directions. This 1s 
consistent with the general character of the vortex wake that we saw in Section 8.1.2. 


3. The wake flowfield 1s established during the wake rollup process, which is effectively completed with- 
in the relative nearfield of the airplane, so that the wake becomes unchanging from there downstream, 
except for a downward drift. 


We orient the x-axis in the flight direction and tie our reference frame to the airplane in steady flight, so that 
the freestream velocity is U, and the velocity everywhere else is (U, V, W) = (U + u, v, w). We expect the wake 
far downstream to carry a significant u disturbance, including a nonzero integrated u, and thus a net flight-direc- 
tion mass flux (remember the axial jets in the rolled-up vortex wake in Figure 8.1.9). This flight-direction mass 
flux will enter into the momentum balance in any control volume with a far-downstream boundary, and it must 
come from somewhere. A wake with a jet, as in Figure 8.1.9, requires flow to converge toward the region where 
the wake 1s forming, while a wake with a velocity deficit (negative u) would require flow to diverge. From far 
enough away, this will look like a single sink or source located in the neighborhood of the airplane, because we 
have assumed that wake development is completed not far from the airplane. In our derivation of the simplified 
formula for viscous drag (Equation 6.1.3) we also had to account for a balancing mass flux, in that case a source. 

In the momentum balance for a general control volume surrounding the airplane, the pressure and 
momentum-flux disturbances in the neighborhood of where the wake leaves the downstream boundary are obvi- 
ously significant. The significance of disturbances elsewhere is not quite as obvious. The velocity and pressure 
disturbances associated with the source or sink are spread diffusely in all directions and die off with increasing 
distance, but it turns out that they don't die off fast enough that their integrated effects can be neglected, no 
matter how far away we put the boundaries. Having to deal with the source or sink terms complicates the ana- 
lysis a bit, and the source or sink ends up dividing up its contributions through the pressure and the momentum 
flux differently depending on the shape of the control volume. The total contribution of the source or sink to 
the inferred drag must of course be the same regardless of how the control volume is shaped. We can get the 
right answer for the total contribution of the source or sink and simplify the algebra considerably, if we assume 
a particular kind of shape for the control volume. 

We give the control volume a general cylindrical shape as in Figure 8.3.2, with an upstream boundary, a 
downstream boundary, and lateral/top/bottom boundaries that simply need to form a general cylinder (not ne- 
cessarily circular) parallel to the x-axis. 


Figure 8.3.2 Cylindrical control volume for deriving Equation 8.3.5 for the induced drag from a far-field mo- 
mentum balance 
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The cross-section shape in the y-z plane can be anything, as long as the cross section 1s large enough that 
the downstream boundary captures all of the significant pressure and velocity disturbance associated with the 
vortex wake. Now we take the upstream and downstream boundaries to large distances compared to the other 
dimensions of the control volume, but not so large that downward drift of the vortex wake carries it out through 
the cylindrical boundary instead of the downstream boundary. To proportion the control volume in this way re- 
quires a small inclination angle for the vortex wake. This requires that the spanloading on the wing producing 
the wake not be too large, which is more restrictive than our original assumptions (1—3). This does not mean we 
are assuming that u, v, and w are small in the vortex wake. The assumptions regarding the shape of the control 
volume lead to the following simplifications: 


1. The cylindrical part of the boundary makes no contribution to the momentum balance through the pres- 
sure, because its normal is everywhere perpendicular to the x-axis. 


2. At the upstream and downstream boundaries we can neglect the pressure and velocity disturbances due 
to the source or sink. 


3. The strength of the source or sink can be calculated from the integrated u deficit or excess at the down- 
stream boundary, designated T, due solely to the vortex wake: 


m= p /| (Cl — ttuake) ds. 
8.3.3 re i 


The total effect of the source or sink on the momentum balance is due to the momentum flux through the 
cylindrical boundary and is given by I! C. 
With these simplifications, the momentum balance becomes 


D;= /| (p — pa.) ds — p || (2U'u + u?) ds — mU. 
8.3.4 fa T JIT 


After we use the steady, incompressible Bernoulli equation to express the pressure in the wake terms of U, u, 
v, and w, and simplify, we obtain 


7 pf, 7. ,, , 
p= : /| (—u" +0° + w*)ds. 
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The negative sign of the u> term is a little disturbing at first glance, because it means that the integrand is 
not positive-definite and raises the concern that the drag might not be always be positive. A rigorous argument 
by Spalart (2008) indicates that this is not a problem. An informal argument that reaches essentially the same 
conclusion goes as follows: 


The components v and w are proportional to the vorticity in the wake, while u 1s proportional to both the 
vorticity and to the inclination of the helical vortex lines, which is proportional to v and w. So uw 1s higher 
order in the vortex strength than v and w. Because we haven't assumed small disturbances in the wake, this 
does not guarantee that u> is small compared with the other terms, but it does indicate that u> will never 
outweigh the other terms and that the drag will always be positive. 


In the next two theoretical models that we'll consider, we'll ignore the downward drift of the wake and the 
associated general downward tilt of the vortex lines. We'll also ignore any other deviation of the vortex lines 
from the freestream direction, as, for example, in the helical alignment of the vortex lines in the cores shown in 
Figure 8.1.5. The farfield wake then has no u disturbance associated with it, and the u- term in Equation 8.3.5 
is zero. We can then interpret the induced drag as being accounted for by the kinetic energy left behind in suc- 
cessive slices of the flow in the wake. 


8.3.3 Induced Drag in Terms of Kinetic Energy and an 
Idealized Rolled-Up Vortex Wake 


Here we make the same basic assumptions as we did leading to Equation 8.3.5, and we calculate the velocities 
in Equation 8.3.5 using the Biot-Savart law, based on simple assumptions regarding the distribution of vorticity 
in the wake. We ignore the tilt of the wake and the circumferential component of the vorticity in the vortex 
cores. The simplest model for a rolled-up vortex wake 1s a pair of line vortices, but the kinetic energy integral 
in the neighborhood of a line vortex is infinite, which rules this model out for evaluating induced drag in terms 
of kinetic energy. For the kinetic energy to be finite, the vorticity must have finite strength and must therefore 
be spread out over a finite area. The simplest vortex model that does this is the so-called Rankine vortex we 
described in Section 3.3.8 and Figure 3.3.8f, 1n which the vorticity is assumed constant inside of a circular core 
of radius rc, outside of which the flow is assumed irrotational. So a simple model for the rolled-up wake behind 
a lifting wing consists of two Rankine vortices of strength I> and radius re with their centers separated by a 
spanwise distance by, as shown in Figure 8.3.3. Now three parameters, Io, bo, and re, are sufficient to determine 
the drag. If [o, bo, and the drag are known, 7¢ can be adjusted to match the drag. This idea of adjusting the core 
radius to match the drag began with Prandtl (see Spalart, 2008). 


Figure 8.3.3 Idealized model for the rolled-up vortex wake used in the theories of Spreiter and Sacks (1951) 


and Milne-Thomson (1966) (drawn to scale for r¢/b = 0.0855, r¢/bo = 0.1089) 
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One problem with a wake model based on Rankine vortices is that the combined flowfield associated with 
two such vortices 1s not consistent with maintaining the circular boundaries of the cores over time. (The flow 
outside the Rankine cores should be the same as the flow around two point vortices, and this flow does not have 
circular streamlines of radius re centered on the vortices.) This has not discouraged use of the model, however, 
and the problem has been sidestepped in at least two different ways, neither of which is entirely satisfactory. 

Spreiter and Sacks (1951) sought to work around the problem by taking advantage of the fact that the stream- 
lines of the flow around two point vortices are actually circular, just not all centered on the vortices. They noted 
that only the streamline circles of zero radius are centered on the vortices and that as we look at larger radii, the 
streamline circles have centers shifted increasingly outboard. They therefore placed the Rankine cores so that 
their boundaries matched the streamline circles of radius re, so as to align the core boundaries with streamlines 
of the irrotational flow surrounding the cores. The trouble with this is that when the cores are positioned 1n this 
way, their centers don't coincide with the locations of the original point vortices, so that the flow they “induce” 
isn't consistent with the assumed flow outside the cores. 

Milne-Thomson (1966) also assumed the flow outside the cores 1s consistent with two point vortices but did 
not shift the centers of the Rankine cores outboard from the locations of the point vortices, arguing that this 
was valid as long as the core radius is small compared to the separation. However, the core radius he finally 


deduces turns out not to be that small. So this is also an inconsistency, just a different one from that of Spreiter 
and Sacks. 

In spite of the inconsistencies of both of these models, the calculations for the kinetic energy can be carried 
out. For the Milne-Thomson version, the result is 


D; = p= (; + log =) , 
8.3.6 ax \4 re, 


where Io is the circulation of the cores. To see what this implies about the size of the cores behind a typical 
wing, assume an elliptic load distribution on a wing of span 5, for which the classical theory of Section 8.3.4 
gives 


} 
mpl 
D; = 2 
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and 
b, = —b. 
8.3.8 
Then the core radius is given by 
r. | _ 
— = 0.0855, 
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The model is thus kinematically inconsistent. But the more serious problem with Rankine vortex cores is that 
they don't model the flow in the rolled-up wake behind a wing well at all, as we saw in Figure 8.1.8. 


§.3.4 Induced Drag from the Loading on the Wing Itself: 
Trefftz-Plane Theory 


Here we seek to infer induced drag from the loading on the wing itself, without laboriously computing the entire 
flowfield. This requires making simplifying assumptions not just about the farfield wake, but regarding the de- 
velopment of the wake all the way from where it leaves the wing trailing edge. Trefftz-plane theory does this by 
ignoring the rollup of the trailing vortex sheet and assuming instead that the vortex lines stream straight back in 
the freestream direction from where they are shed at the trailing edge, as illustrated in Figure 8.3.4. This theor- 
etical framework was established in the early 1900s (Prandtl and Tietjens, 1934) and is still in use today. The 
formal justification is that neglecting the deformation of the wake should be valid in the limit of small loading 
or high aspect ratio. The practical justification is that the resulting theory gives reasonably accurate results for 
practical loadings and aspect ratios. 


Figure 8.3.4 Assumed wake model in the Trefftz-plane theory of induced drag, with the vortex lines running 
straight back in the freestream direction 
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A real vortex wake, outside of the rolled-up portion at the outboard edge, is a finite-thickness shear layer 
that aligns itself with the flow on both sides, has no flow through it, and supports very little pressure difference 
across it. At the next level of idealization, this shear layer would be modeled as a thin vortex sheet that 1s also a 
stream surface of the flow, having, by definition, no flow passing through it and no force on it. No force means 
there can be no pressure jump across the sheet. In a shock-free inviscid flow, no pressure jump means there can 
be no jump in velocity magnitude across the sheet, only a jump in velocity direction. No jump in velocity mag- 
nitude in turn requires that the vortex lines in the sheet be aligned parallel to the average of the velocities on the 
two sides of the sheet (If the vortex lines were not aligned in this way, there would have to be a jump in velocity 
magnitude, as we noted in Section 3.3.8, and thus also a jump in the pressure, violating our no-force condition). 
So a vortex wake modeled as a force-free vortex sheet must satisfy requirements on both the shape of the sheet 
and on the alignment of the vortex lines in the sheet. 


Unlike this ideal force-free vortex sheet, the assumed wake sheet in Trefftz-plane theory, made up of vortex 
lines aligned with the free-stream direction, will generally have a nonzero component of velocity perpendicu- 
lar to the vortex lines, both from flow passing through the sheet and from vortex lines not being appropriately 
aligned within the sheet. Whenever there is a velocity component perpendicular to vortex lines, and the vortex 
lines are not being convected with the flow, the Kutta-Joukowski theorem requires that there be a “lift” force 
perpendicular to both the velocity and vorticity vectors. Thus the nonzero component of velocity perpendicular 
to the vortex lines results in local forces exerted on the wake that are small for a typical high-aspect-ratio wing, 
but nonzero. Because these forces must be perpendicular to the vortex lines, they make no contribution in the 
drag direction. So the wake in Trefftz-plane theory is not force free, but it is drag free. 

Kroo (2001) argues that because the wake assumed in Trefftz-plane theory is drag free, Trefftz-plane theory 
incurs no error in the drag calculation, and that it should therefore give the same result for the induced drag 
as a calculation that takes into account realistic distortions of the wake. I would argue that a drag-free wake 
does not guarantee this level of correctness, but only consistency in the sense that the total drag in the field 
does not contain a contribution from drag on the wake. Consistency in this sense then means that the farfield 
drag determination would agree with the drag determined by surface-pressure integration, if the flow is inviscid 
and shock-free, but it does not preclude an error in the drag, due to the incorrect positioning of the wake. The 
error due to the simplified wake model will generally be nonzero, though in most practical cases it is likely 
to be small. This is an issue we'll take up again in Section 10.4.3 when we discuss 3D CFD codes based on 
potential-flow theory, in which the wakes of lifting surfaces are often modeled in a way that is equivalent to the 
assumptions of Trefftz-plane theory. 


Trefftz-plane theory 1s applicable to a lifting system that can be made up of one or more lifting surfaces, each 
of which can be either planar or nonplanar. The theory takes the spanwise distribution of lift to be known and 
determines the induced drag either locally in terms of the backward tilt of the lift vector distributed along each 
lifting surface, or globally in terms of the total kinetic energy in a cross-flow plane (the Treffiz plane) far down- 
stream. The local determination defines both the spanwise distribution of induced drag along the surfaces and 
the total, while the global determination defines only the total. The total induced drag determined either way is 
the same. 

The flowfield velocities that are used in calculating the drag do not come from solving the equations of mo- 
tion in the flowfield, but are inferred from the idealized model of the vortex wake, through the Biot-Savart law. 
The theory thus depends on inferring velocity from vorticity, which 1s justified mathematically, but obscures the 
physical cause-and-effect relationships. As a result, the physical understanding provided by the theory 1s not all 
that we might hope for. While the theory makes very clear the relationship between the downwash distribution 
and the distribution of drag, it does not provide much intuitive physical understanding as to why a particular 
lift distribution produces a particular downwash distribution. Although the physical understanding it provides 
is minimal, Trefftz-plane theory is very valuable for its quantitative predictions, and we depend heavily on it, 
for nearly all predictions of induced drag. Historically, its predictions have been found to agree reasonably well 
with drag measurements both in the wind tunnel and in flight. 

Note that in our simplified model of the wake in Figure 8.3.4, the wake sheet forms a general cylinder (“gen- 
eral” in the sense of not necessarily circular) defined by the trailing edge and having generators in the direction 
of the freestream. The distributions of vortex strength in the wake sheet and the bound vorticity on the wing 
are defined consistently with the Kutta-Joukowski theorem (Equation 7.2.1), and Helmholtz's second theorem 
(Section 3.3.7), in the manner we described in connection with the lifting-line theory in Section 8.2.3. Given 
the spanwise distribution of lift and the geometry of the trailing edge, the distribution of vorticity in the wake is 
defined, and the downwash at each station along the span of the wing itself can be calculated using Biot-Savart. 
This determines the local backward tilt of the lift vector and the local induced drag. In the literature, this local 
determination of the drag seems to be discussed only in the context of straight, unswept wings, for which the 
contribution to the downwash from the bound vorticity on other parts of the span can be ignored. For example, 
Milne-Thomson (1966), in his Section 11.22, evaluates the downwash at the trailing edge, which 1s assumed to 
be straight, using only the contribution of the trailing vorticity. To extend this treatment to swept wings or wings 
that are otherwise not straight would entail the same inaccuracies we discussed in connection with lifting-line 
theory in Section 8.2.3. Because of these inaccuracies, local determinations of induced drag are not usually pur- 
sued for wings that are not straight, and for most purposes, it 1s only the total induced drag that matters anyway. 

The total induced drag can also be determined from the flowfield far downstream using Biot-Savart and 
Equation 8.3.5. One way of expressing the result is through the following integral expression: 


D; = 5 | v,T(£) dé. 
8.3.10 os 
where the integration is over the line or curve where the undeformed trailing vortex sheet intersects the Trefftz 
plane, and vy is the velocity “induced” by the wake perpendicular to the line or curve. The factor of 1/2 is re- 
quired when vy is evaluated far downstream, where the wake vortices appear infinite, instead of at the wing 
itself, where the wake vortices appear only semi-infinite. Only the trailing vortex wakes enter into this integra- 
tion for the total induced drag, and given our idealized model for the wakes, the distribution of vortex strength 
depends only on the spanwise distribution of lift and on the shapes of the lifting surfaces as seen in rear view, 
the so-called “Trefftz-plane view.” Thus for a given spanwise distribution of lift, the total induced drag is inde- 
pendent of the fore-and-aft arrangement, including the sweep, of the parts of the lifting system. This result was 
originally derived with reference to a biplane and 1s referred to as Munk's stagger theorem (see Kroo, 2001). 





Fore and aft disposition affects both the spanwise distribution of induced drag on individual lifting surfaces and 
the distribution of drag among multiple lifting surfaces, but not the total, provided the spanwise distribution of 
lift is held constant. Note that this doesn't generally apply when lifting surfaces of fixed shape are moved fore 
and aft relative to each other because 1n that case the lift distributions generally change. To keep the lift distribu- 
tions constant as surfaces are moved fore and aft, as required by the stagger theorem, generally requires changes 
in twist and/or camber. 

Numerical methods for calculating the rollup of the wake sheet in potential flow that in principle predict in- 
duced drag with higher fidelity than Trefftz-plane theory are available, but they are not that widely used when 
the objective is to study induced drag. CFD methods based on Euler or Navier-Stokes equations predict the 
entire flowfield in detail, including the rollup of the vortex wake. However, flow solutions provided by these 
methods present us with the same problem we encountered with the real flow; that is, how do we define what 
part of the total drag is induced drag? It 1s telling that when users of high-fidelity CFD codes want a separate 
number for the induced drag predicted by their solutions, they usually plug their calculated lift distributions into 
Trefftz-plane theory. 


8.3.5 Ideal (Minimum) Induced-Drag Theory 


What is the minimum induced drag that a wing can have? This question makes sense only if we constrain the 
total lift to a nonzero value, because if the load is zero everywhere, the induced drag is zero. So for a given 
configuration of lifting surfaces as viewed in the Trefftz plane, Equation 8.3.10 defines an optimization problem 
that can be solved for the minimum induced drag, and for the spanload that goes with it, provided we constrain 
the total lift. When total lift 1s the only constraint, and minimum induced drag is the only objective, the results 
of this optimization are called the ideal induced drag and the ideal spanload. There are various ways this op- 
timization problem can be solved, sometimes analytically, but usually numerically. We'll forego the details here 
and discuss only the general results and conclusions of the theory. 

One general conclusion has to do with the normalwash in the Trefftz plane. If the spanload 1s ideal, the com- 
ponent of the wake-induced velocity in the direction perpendicular to the wake cut in the Trefftz plane is related 
to the local dihedral angle @ of the cut: 


83.41 Vn Constant x cos(@). 


This relation was derived by Munk (1921) and is often referred to as Munk's minimum-induced-drag cri- 
terion. A normalwash distribution obeying this rule 1s illustrated in Figure 8.3.5. 


Figure 8.3.5 Illustration of the normalwash rule, Equation 8.3.11, associated with an ideal spanload. The vec- 
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For a planar wing, 8 is zero, the normalwash is constant and is the same as the downwash, which, according 
to the theory, requires an elliptic lift distribution, as illustrated in Figure 8.3.6. 


Figure 8.3.6 Elliptical spanload and constant downwash of an ideally loaded planar wing 
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In this case, we have the classic result for the induced drag: 





| L* 
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8.3.12 Tqh 
or in dimensionless terms: 
Cs 
Cpi = —. 
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This, of course, reflects the same scaling we deduced in Equations 8.3.1 and 8.3.2 and fills in the constant 
of proportionality for one particular case. The dimensionless form, Equation 8.3.13, is the better known of the 
two, which is unfortunate. It gives the misleading impression that aspect ratio plays an important role in induced 
drag. As we saw earlier, the aspect ratio in Equation 8.3.13 is really a red herring, an artifact of the nondimen- 
sionalization. It 1s clear from Equations 8.3.1 and 8.3.12 that the induced drag force depends on span, not on 
aspect ratio. 

So the well-known elliptic spanload is “ideal” for a planar (flat) wing. For nonplanar configurations, the ideal 
spanload is not generally elliptic, but it 1s easily calculated for a given geometry. With a vertical winglet added, 
for example, the ideal spanload shows less lift inboard and more lift outboard, relative to elliptic, with a certain 
optimum distribution on the winglet itself, as shown in Figure 8.3.7. Note that “lift” in this context refers to the 
aerodynamic force perpendicular to the wing locally, which in the case of the vertical winglet is a horizontal 
force inward. Also note that according to Equation 8.3.11 the normalwash produced in the Trefftz plane by a 
vertical winglet is zero. If the wing and winglet are unswept, the normalwash at the surfaces themselves is half 
what it 1s in the Trefftz plane, because the wing and winglet “see” only a semi-infinite wake sheet, whereas the 
Trefftz plane “sees” a wake sheet that is effectively infinite both fore and aft. This means that the winglet itself 
experiences no sidewash, or that the load on the winglet cancels the sidewash that would be there in the absence 
of the winglet. This is a point we'll consider further in Section 8.4.2 in connection with the common misunder- 
standing that wingtip devices like winglets generally produce induced thrust. 


Figure 8.3.7 Ideal spanload for a wing and vertical winglets, compared with the elliptic ideal loading of a 
planar wing 
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Relative to the “ideal” spanloads we've discussed here, the spanloads used on real wings are usually modified 
somewhat to reduce bending loads at highly loaded structural critical conditions and allow a lighter wing struc- 
ture, at the expense of a slight increase in drag in cruise. The presence of a fuselage and wing-mounted engines 
also tends to alter the spanloads on real wings, an effect for which we'll discuss one idealized model next. 

The fact that downwash is constant for an elliptically loaded planar wing, both at the wing itself if it is 
unswept, and in the Trefftz plane, might lead us to expect that the vortex wake would remain undistorted, as 
Trefftz-plane theory assumes. This expectation 1s unrealistic for more than one reason. Remember from our dis- 
cussion of the vortex wake in Section 8.1 that Spalart (1998) showed that it is not possible for the wake to re- 
main undistorted at the outer edge. And the “induction” by the trailing vortex sheet is not the only contribution 
to the downwash. The bound vorticity “induces” a nonuniform downwash that distorts the wake in the nearfield 
of the wing. So uniform downwash in the Trefftz plane doesn't mean that the wake remains undistorted. And 
there is another important way in which the undistorted wake assumed in the Trefftz-plane theory is unrealistic. 
The farfield descent rate of an undistorted wake sheet of an elliptically loaded wing, implied by its own self-in- 
duced velocity, is much higher (a factor of 1/2 ~ 4.9) than the descent rate of the real rolled-up wake. 


8.3.6 Span-Efficiency Factors 


Equations 8.3.12 and 8.3.13 apply to an ideal elliptically loaded planar wing. If the wing 1s nonplanar or the 
spanload is nonideal, the induced drag will differ from the planar ideal. It is often convenient to relate the actual 
drag to the planar ideal through the induced-drag span-efficiency factor, e, defined by 


a Ct 
( hi = a 
8.3.14 TARe 








For a planar wing, the actual induced drag is always greater than or equal to the ideal, so that e is always less 
than or equal to one. For a nonplanar wing, the induced drag can be less than the planar ideal, so that e can be 
greater than one. 

Another “efficiency factor,” called the Oswald efficiency factor, eo, takes into account the variation with Cr 
of the total drag, including the viscous profile drag. It is defined in practice by fitting the total drag polar with 
the following expression: 
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Recall from Section 7.4.2 that the sectional profile drag polar of a 2D airfoil typically has a roughly parabolic 
variation in the low-to-moderate range of Cy. The integrated profile drag of a 3D wing tends to behave simil- 


qi 
arly, so that the inclusion of the profile drag makes the coefficient of the C fs term in Equation 8.3.15 larger 
than it would be for induced drag alone. Thus for wings with ordinary airfoil sections, eo is usually smaller than 
e. 

An important point to note about the two efficiency factors is that eo is defined such that it can in general 
be determined by a curve-fit of a known total-drag polar, though the value generally depends on what part of 
the polar is fitted, while e cannot generally be determined just from the total drag. Sometimes this point is over- 
looked, and eo is confused with e. An example of this is seen in the claims of low induced drag that were made 
for the Winggrid wingtip device (La Roach and La Roach, 2004), which consists of a cascade of constant-chord 
airfoils appended to the tip of a conventional wing. The cascade almost certainly has high profile drag at low 
C7 s, and as a result, a range of C7 over which the profile drag decreases with increasing C;. This would make 
that portion of the total polar shallower than it would be with induced drag alone, rather than steeper, as would 
usually be the case with an ordinary single-element airfoil. The author apparently fitted this portion of the polar 
and obtained a high value of eo, from which he inferred a low value of induced drag that is probably not realist- 
Ic. 


8.3.7 The Induced-Drag Polar 


If we assume that e is constant, independent of CL, Equation 8.3.14 implies that the induced-drag polar is a 
simple parabola, with zero induced drag at zero lift. But for this to be true the wing would have to maintain the 
same spanload shape as Cz varies, which a real wing doesn't generally do. The spanload of a real wing can have 
a complicated variation with angle of attack, due to transonic and viscous effects, and the induced-drag polar 
will be correspondingly complicated. But even in the absence of nonlinear effects, the spanload of a twisted 
wing changes shape with angle of attack, and it is instructive to look at the trends predicted by simplified theor- 
ies. Combining Trefftz-plane theory for the total induced drag with the decomposition of the spanload into basic 
and additional parts (Section 8.2.1) leads to a prediction of the induced-drag polar (Rubbert, 1984). In terms of 
the bound circulation, the spanload decomposition can be expressed as 
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where Io and do are the circulation distribution and angle of attack of the wing when the total lift is zero. Sub- 


stituting this into the total-induced-drag integral, Equation 8.3.10, and doing some rearranging yields the result 
that the induced-drag polar is parabolic: 


“ a | : 2 
83.17 CDi = Cp, + AiCi + A2Cz, 


for a general twisted wing, where Cpjo and Aj are zero if Io 1s zero everywhere. Thus for an untwisted wing, 
we would have only 


(a — Ao), 


— j + 
g3.18 ©, = A2C7. 


Equation 8.3.17 can be rearranged in terms of Cz(CpDimin) and Cpimin, the lift and drag coefficients at the min- 
imum of the polar: 
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where ey7T 1s the span-efficiency factor of the corresponding untwisted wing. 
Figure 8.3.8a illustrates the parabolic induced-drag polar of a swept, twisted wing. Note that 1n this polar, the 
CDio term is significant, but the Aj term is relatively small, which seems to be typical of most real wings. The 


spanload-efficiency factor e in Figure 8.3.8b varies dramatically with Cz, showing what a serious mistake it is 
to assume e 1s constant, even though ey7 is constant. And this is another example in which confusing eg with e 
a 
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and determining e by a curve-fit to the Cj - term in the total drag would be a mistake in that it would miss the 
Cpio part of the induced drag. 


Figure 8.3.8 Induced-drag behavior of a swept, twisted wing according to Equation 8.3.17. (a) The induced- 
drag polar. (b) The corresponding span-efficiency factor e 
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8.3.8 The Sin-Series Spanloads 


There is a simple trigonometric series for representing the spanload of a planar wing that has some very useful 
properties (see Durand, 1967b). The lift per unit span is represented in terms of a series with coefficients Aj: 


: T= J 7 f f L = a 
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or in dimensionless form: 

3.21 Cl e/é = 4(b/e) Ai X(A;/ Aj) sin(iy), 
where y is the transformed spanwise coordinate: 

83.22 WV = acos(—2y/b). 
The integrated lift involves only the first term of the series: 

3.3.23 Lb = (m/2)pU,,b° Ag, 


or in dimensionless form: 


9324 CL =7 AR A. 


Only the odd-numbered terms are usually used, as they are the ones that are symmetrical about the center 
plane. The first three odd-numbered terms are plotted in Figure 8.3.9. The first term represents an elliptic load- 
ing, while all the higher terms carry no net lift, which is why they don't appear in Equation 8.3.24. 


Figure 8.3.9 The first three odd-numbered terms in the sin-series expansion for representing spanloads 








When the spanload expression 8.3.20 is introduced into Equation 8.3.10 for the induced drag, the result is 
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or in dimensionless form: 
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93.2 Coi = (CL/(a AR))=i(Aj/A1)*, 
so that the span efficiency factor is given by 
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The wing-root bending moment Mgr due to the lift distribution also has a simple expression: 


93.0 2Mpr/Lb = (4/(37)) [1 + (3/5)(A,/A,) +...] 


The first two odd-numbered terms of the series provide a convenient way to generate spanload shapes that are 
compromised from elliptical to reduce bending moments and thus reduce wing structural weight. Using just the 
first two terms gives the least increase in induced drag for a given reduction in root moment. A family of such 
spanloads is illustrated in Figure 8.3.10, and the e-versus-bending moment curve is plotted in Figure 8.3.11. 
Note that induced drag increases (e decreases) whether root bending moment decreases or increases, consistent 
with fact that the baseline spanload is the elliptic optimum. Note also that e decreases only quadratically with 
the deviation from the optimum, something we'll see again in connection with the effect of a tail or canard on 
induced drag in Section 8.3.11. The first two odd-numbered terms also provide a convenient way of sketching 
spanloads of different shapes for illustration purposes, which was used in generating the sketches in Figures 
8.2.1 and 8.2.2. 


Figure 8.3.10 A family of spanloads compromised from elliptic to reduce wing-root bending moment using 
the first two odd-numbered terms in the sin-series expansion 
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Figure 8.3.11 The variation of e with Mgp for spanloads using the first two odd-numbered terms in the sin- 
series expansion 
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8.3.9 The Reduction of Induced Drag in Ground Effect 


When a wing flies close to the ground the no-through-flow condition at the ground forces the flow field around 
the wing to change in a way that reduces the downwash in which the wing is flying. This has the general effect 
of increasing the lift at a given angle of attack, as we saw in Section 8.2.4, and reducing the induced drag for a 
given lift. 

For calculating this reduction in induced drag, it is helpful to invoke the idea of images, just as it was for cal- 


culating the increase in lift in 3D ground effect in Section 8.2.4. A typical result of a panel-method calculation 
md 

is Shown in Figure 8.3.12. Note that both Cp; and the change ACp; due to ground effect are of order Cj “, SO 

that the drag reduction expressed as a ratio Cpj/CDio doesn't depend on C7. 


Figure 8.3.12 The reduction in induced drag in ground effect at fixed lift. Results of a panel-method calcula- 
tion for a planar wing with elliptic planform, AR = 10, no twist 
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It's clear from Figure 8.3.12 that the ground produces substantial percentage reductions in induced drag when 
h/b < 1. This can be important in many types of flight on minimal power, as, for example, in human-powered 
flight and the long-distance flight of some birds over water. The idea of specifically designing a transport air- 
plane to take advantage of ground effect has been explored, especially for flight over water. The Ekranoplans 
(Russianized French for “screen planes’’) developed in the former Soviet Union were the largest and most highly 
developed of such craft (see Scott, 2003). The operational disadvantages of flying very close to the surface 
(relatively low cruise speed, the possibility of conflicts with surface vessels, and sensitivity to weather) have 
discouraged widespread applications. 

In Section 8.2.4, we saw that the change in lift in ground effect on a 3D wing tends to be dominated by the 
3D change in downwash, and of course the reduction in induced drag we've discussed in this section is also a 
3D effect. But there is occasionally confusion in this regard. One example is the explanation of the induced- 
drag reduction proposed by Anderson and Eberhardt (2001). This explanation never refers to the finite span of 
the wing, but relies instead on a 2D reduction in a purported “loading due to upwash,” which we showed in 
Section 7.3.4 doesn't exist. And Anderson and Eberhardt compounded the error by trying to use a 2D argument 
to explain a 3D effect. 


§.3.10 The Effect of a Fuselage on Induced Drag 


So far, we've considered 3D lifting flow and induced drag only for lifting surfaces by themselves. The presence 
of a fuselage that typically encloses or replaces part of the wing complicates the situation in several ways. The 
pressure field produced by the wing is imposed on the fuselage, so that the fuselage also produces lift, but gen- 
erally significantly less than a continuation of the wing would produce in the absence of the fuselage. Even if 
the fuselage sheds no vorticity, so that the bound vorticity and circulation of the wing carry across the fuselage 
undiminished, the fuselage needn't produce the corresponding lift loading because the Kutta-Joukowski theor- 
em doesn't apply to such a low-aspect-ratio body. The presence of a fuselage also affects the downwash field 
and the kinetic energy left behind in the farfield, and thus affects the induced drag. 

In early simplified theories, the fuselage was modeled as an infinite cylinder, extending forever fore and aft 
(Lennertz, 1927, and Pepper, 1941). Such models by definition fail to deal with one of the most important issues 


from a practical standpoint, which is the effect of the aft closure of the body and the distortion of the vortex 
wake that it causes. 

A highly simplified model for the fuselage-closure effect was developed by Nikolski (1959). In this model, 
the trailing-vortex lines shed from the trailing edge of the wing are assumed to follow streamlines of the body- 
alone flowfield, an assumption that should be valid in the limit of small lift loading, just as in conventional 
Trefftz-plane theory. For the most common implementation of the theory, we further assume: 

1. The body is axisymmetric and at zero angle of attack, 
2. The wing 1s “planar” with its trailing edge in the same horizontal plane as the body axis, and 
3. The velocity disturbance due to the body at the location of the wing is negligible. 


Given assumptions | and 2, the trailing-vortex sheet is in the horizontal plane of symmetry of the body flow- 
field and therefore remains planar. The vortex lines simply “neck in” with the flow closing 1n around the body, 
as shown in Figure 8.3.13. A vortex line that leaves the trailing edge at a distance y (z in Nikolski's drawing, 
Figure 8.3.13) from the axis ends up far downstream at a distance y’ such that the area of the circular streamtube 
of radius y’ is the same as that of the original annular streamtube between the body radius rp and y. This leads 

r2 2 
to» —¥ nh hs as the rule that defines the distribution of shed vorticity in the Trefftz plane as a function 
of the distribution shed from the trailing edge. Thus the wake in the Trefftz plane looks as if 1t were shed by a 


b’ = ,/b* —r? 

hypothetical wing alone with a reduced span V » To complete the model, we assume that the 
lift and induced drag of the wing-body combination are the same as for this hypothetical wing alone, because 
both configurations produce the same wake in the Trefftz plane. 


Figure 8.3.13 Trailing-vortex lines in Nikolski's model for the effect of fuselage closure on induced drag. 
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Because we've assumed that both configurations produce the same total lift, the difference between the lift 
on the hypothetical wing alone and the lift on the exposed wing (the wing outside the body) can be interpreted 
as an estimate of the “carry-through” lift induced on the body by the wing. This relationship between the load- 
ings 1s illustrated in Figure 8.3.14, where the carry-through lift on the body is shown as a constant, because the 
theory defines only the total body lift, not its “spanwise” distribution on the body. 


Figure 8.3.14 Illustration of the spanload on a wing-body combination and the hypothetical equivalent wing 
alone, according to Nikolski's (1959) model 
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Implementing the model for an arbitrary spanload on the wing of a wing-body combination, especially if 
the model is extended to a wing that is nonplanar, generally requires numerical integration to obtain the carry- 
through lift and the induced drag, as is done in the WINGOP code (Craig and McLean, 1988). 

For the ideal planar case, analytic expressions have been derived. The ideal spanload for the planar case is 
elliptic in the Trefftz-plane (the hypothetical wing alone), and when this is mapped back to the physical wing, 
the loading also elliptic, centered on the airplane axis, with only a truncated part of the elliptic load showing up 
on the exposed wing. The spanloads plotted in Figure 8.3.14 show this ideal case in a quantitatively correct way 
for 2rp/b = 0.2. The body carry-through lift predicted by the ideal relationship turns out not to differ much from 
the result of Lennertz for the infinite-cylinder body, as shown in Figure 8.3.15. 


Figure 8.3.15 Comparison of fuselage carry-through lift predicted by Nikolski's (1959) theory for the effect of 


fuselage closure and Lennertz's (1927) theory that represents the fuselage as an infinite cylinder 
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For the ideal case, the induced drag is given by 


nh. — L* 
9329 mq(b? — df)’ 


where dp is the body diameter. Comparing this with Equations 8.3.13 and 8.3.14, we see that it corresponds to 
an induced-drag efficiency factor given by 
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6.3.11 Effects of a Canard or Aft Tail on Induced Drag 


True flying-wing airplanes are relatively rare. At least two lifting surfaces, with some longitudinal distance 
between them, make it much easier to satisfy requirements for /ongitudinal trim (zero pitching moment) and 
longitudinal static stability (the tendency to return to the trimmed condition after a disturbance). Thus most air- 
planes are configured with a main wing and a smaller auxiliary lifting surface, either a forward canard, or, more 
commonly, an aft horizontal tail (stabilizer). And thus the question arises how the lift loads on these additional 
surfaces affect the induced drag. 

First, let's look at how the trim and stability requirements affect the lift loads the auxiliary surfaces must 
carry. Longitudinal static stability depends on how the aerodynamic pitching-moment of a configuration 
changes with angle of attack, which is determined primarily by the planforms and positions of the lifting sur- 
faces and to a lesser degree by the fuselage and engine nacelles. Of course, it is the moment about the center 
of mass, or center of gravity (CG), that matters, so the other important factor in longitudinal stability is the loc- 
ation of the CG relative to the aerodynamic configuration. The requirement for positive longitudinal stability 
generally places an aft limit on the CG location, while the requirement to be able to trim the airplane over a 
range of angles of attack without exceeding the maximum-lift capability of either surface generally defines a 
forward limit. So stability and trim together define a usable range of CG locations. Within the usable CG range, 
trimming the airplane for a particular total lift coefficient will require a particular division of the lift between 
the two lifting surfaces. 

For a canard configuration to be stable, the CG must generally be so far forward that the canard must lift 
upward to trim the airplane at a positive total lift coefficient, and thus trimmer is an alternative name for a ca- 
nard surface. The lift on an aft tail can be in either direction, depending on how far aft the CG is located. The 
relatively large horizontal tails of some free-flight model airplanes provide ample stability with the CG far aft, 
so that an upward load on the tail is required for trim. Most full-sized airplanes have relatively small tails and 
farther forward CG locations, so that the tail must carry a downward load. Of course, the down load must be 
offset by increased lift on the main wing. 

Proponents often argue that the canard configuration is superior because it is better to have both surfaces lift- 
ing up than to have one lifting up and one lifting down, but if we look just at induced drag, we find no support 
for this argument. For example, consider the simplest idealized comparison we can make: 

1. The total lift on the airplane 1s fixed, 


2. The lifting surfaces are coplanar, so that the total induced drag of each configuration is the same as if 
the total spanload acted on a single lifting surface, and 

3. The canard and aft tail to be compared have the same span and carry the same spanload, just of opposite 
signs, and the loadings on the main wings have the same offset from the elliptic ideal, also of opposite 
signs, as illustrated in Figure 8.3.16. 


Figure 8.3.16 [llustration of the simple argument for the induced-drag equivalence of a canard and an aft tail. 
(a) Uploaded tail or canard. (b) Downloaded tail 
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The ideal wing alone with an elliptic loading has the minimum induced drag. The total spanloads for the 
aft-tail and canard configurations represent the same perturbation from the ideal, just in opposite directions. If 
we represent the total spanloads by the sin series of Section 8.3.8, Equation 8.3.20, both must have the same 
coefficient A; because the total lift is the same. For lateral symmetry, only the odd-numbered coefficients are 
nonzero, and the deviations of the spanloads from elliptic are thus represented by odd A;'s for 1 > 3. Because the 
deviations are of opposite sign, corresponding Aj,'s are of opposite sign, and according to Equation 8.3.25 the 
total induced drag is the same. Here I've glossed over the difficulty the sin series would have in representing the 
kink in the total spanload at the tail or canard tip, but the conclusion is probably still valid. 


The above argument indicates no fundamental advantage of a canard over an aft tail in terms of induced drag. 
But it overlooks some significant practical issues. First, the uploads on canards are typically much larger than 
the downloads on aft tails. And an elliptic-looking loading on the main wing would be difficult to achieve in 
the presence of the downwash from a coplanar canard. Besides, it would be better to allow the downwash from 
the canard to depress the loading on the inboard part of the main wing, and thus to produce a total load closer 


to elliptic. Another issue is that many real-world configurations are not coplanar, that is there is often a vertical 
gap between the lifting surfaces. 

A nonzero vertical gap between the lifting surfaces has a significant effect on their combined induced drag. 
How the total induced drag varies as a function of gap depends on how the spanloads on the two surfaces vary. 
For a look at the gap effect, there are several options regarding the spanloads: 

1. Assume some arbitrary, fixed spanloads, say elliptic on both surfaces, 

2. Assume fixed planforms and twist and camber distributions for the surfaces and solve a lifting-surface 
analysis problem for the spanloads that go with them, or 

3. Optimize the loadings on both surfaces for minimum drag, with a constraint on total lift and on either 
the percentages of the lift carried by the two surfaces or on the total pitching moment, for whatever gap 
is chosen. 

Fixed geometry (option 2) is no longer very relevant to modern design practice. Optimized loadings (option 
3) are the most interesting, and we'll compare them with imposed elliptic loadings (option 1) for theoretical in- 
terest. 

Figure 8.3.17 shows Trefftz-plane induced drag as a function of vertical gap for cases in which the second 
lifting surface has 40% of the span of the wing. In one, the small surface carries a down load equal to 10% of 
the total lift, as might be the case with an aft tail. The other two cases are up loads of 10% and 30% of the total, 
spanning a range that might apply to a canard. The basis of comparison Dj elliptic 1s the ideal induced drag of a 
planar wing carrying the same total lift, with no load on another surface. The calculations are based on Trefftz- 
plane theory and were carried out in the Boeing WINGOP code (Craig and McLean, 1988). We can make the 
following observations: 


Figure 8.3.17 Variation of induced drag with vertical gap for aft tails or canards with optimized spanloads with 
a constraint on the percentages of the total lift carried by the two surfaces. Results for elliptic loadings 1m- 


posed on both surfaces are shown for comparison. Tail or canard span is 0.40b 1n all cases 
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1. For gap ratios greater than 0.2—0.3, there is practically no difference in drag between elliptic loadings 
and optimized loadings. 


2. At the large-gap end, the drag approaches the limiting the case of infinite gap, or two surfaces flying 
in isolation, for which the optimum is to have each surface carry an elliptic loading; and the total induced 
drag is just the sum of the two ideal induced drags. These limits are indicated by horizontal lines to the 


right in Figure 8.3.17. 

3. At zero gap, the optimized drag goes to an idealized limiting case in which the total loading 1s elliptic, 
and the total induced drag is just the ideal induced drag of a planar wing. The spanload on the canard or 
tail is not uniquely defined in this case; it has only to satisfy the assumed total canard or tail load required 
for trim. Then whatever the spanload on the canard or tail is, the wing must carry a load distribution that 
compensates for it, so that the total load is elliptic. This is an ideal that is not practically realizable, es- 
pecially if the canard or tail has a spanload with infinite slope at its tip, as 1s usually true for all practical 
purposes. Imagine an elliptically loaded canard that sheds the usual vortex wake with concentrated vorti- 
city outboard that rolls up. The wing would somehow have to capture that vorticity and cancel it in order 
to leave behind a wake equivalent to that of an elliptically loaded wing alone. So the limiting case of zero 
gap gives us a well-defined theoretical value for the drag, just not a practically realizable one. 


4. The 0.1 and — 0.1 loading cases with elliptic spanloads go to the same drag level at zero gap, somewhat 
above the ideal, consistent with our earlier argument that they should differ from ideal by the same 
amount. 


5. For gap ratios less than about 0.2, there is an advantage to optimizing the spanloads, relative to elliptic, 
but it is small for the smaller tail or canard loadings. Shortly we'll compare the optimum and elliptic span- 
loads for the small-download case indicated by the diamond symbol, typical of many aft-tail configura- 
tions, where the predicted advantage of optimization is only a couple of percent. For large canard load- 
ings, a very small gap is predicted to be good, if the loadings are optimized, and the predicted advantage 
for optimization is large. But remember that the zero-gap optimum requires the aft surface to “capture” 
and cancel the vorticity from the forward surface. In this case, only a fraction of the advantage of optim- 
ization 1s probably realizable. 


6. Having a second surface of small span carry part of the load can reduce the induced drag only if the 
load on the second surface is small and upward, and the vertical gap is larger than about 0.2. Most prac- 
tical, stable canards are probably closer to the 30% load case shown, for which the second surface exacts 
a sizeable penalty in induced drag unless the vertical gap 1s very large. 


Note that according to Munk's stagger theorem these results are independent of the longitudinal positions of 
the lifting surfaces. Also note that they should be taken only as indicating qualitative trends, given that they 
ignore the effects of rollup of the forward surface's vortex wake. 

It is interesting to note that for an aft-tail configuration with a vertical gap typical of a large low-wing air- 
plane, the spanload on the wing that gives minimum total induced drag is significantly altered from elliptic. This 
effect is illustrated in Figure 8.3.18 for a vertical gap of 8% of wingspan (the case indicated by the diamond 
symbol in Figure 8.3.17). For comparison with the optimum spanload, an elliptic spanload and the optimum 
spanload for the idealized case of zero gap, assuming elliptic spanload on the tail, are shown. The calculated 
Trefftz-plane optimum for the spanload on the wing is a kind of washed-out compromise between these limit- 
ing cases. Note that on the tail there is very little difference between the optimum spanload and elliptical. Also 
recall from Figure 8.3.17 that the difference in drag between the optimum and elliptic spanloads is small. 


Figure 8.3.18 Optimum spanloads for a wing and a tail or canard with 40% span ratio and 8% vertical gap and 
downward load equal to 10% of the total lift. An elliptic loading on the wing and an optimum loading for zero 
gap (an elliptic loading with a “lump” in the middle, so as to have an elliptic total loading) are shown for com- 
parison. The wing spanloads all include an excess over the total lift, to make up for the download on the tail 
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In some treatments of this topic, the induced drag of a two-surface configuration is decomposed into four 
pieces. Because the downwash experienced by either surface is a first-order quantity, 1t can be decomposed into 
a “self-induced” contribution from the same surface and a contribution “induced” by the other surface. The total 
induced drag can therefore be expressed as the sum of two “self-induced” parts and two parts “induced” on one 
surface by the other. This approach was convenient in the old days when simplified assumptions were often used 
in estimating the “shared” parts. Now that numerical Trefftz-plane calculations are so easy to do on a computer, 
it is just as easy to skip the decomposition and calculate the total drag using Equation 8.3.10. However, there is 
some intuitive appeal to thinking of the drag in terms of its separate parts. For example, for a downloaded aft 
tail flying in the downwash from the wing, the contribution “induced” by the wing on the tail is a small thrust 
that offsets some of the “self-induced” parts of the drag. 


8.3.12 Biplane Drag 


For a given span and total lift, a biplane with some vertical gap between the wings has lower ideal induced drag 
than a monoplane, according to Trefftz-plane calculations by Munk and Prandtl (see Thwaites, 1958). Their cal- 
culations covered a range of span ratios between the two elements of the biplane, but here we'll look just at the 
case where the two elements are of equal span. In that case, the ideal spanload is the same for both elements, 
with each carrying half the total lift, and the ideal induced drag depends on the vertical gap as shown in Figure 
8.3.19. Note the substantial drag reductions for relatively small gaps, 17% drag reduction for a 10% gap, for 
example. Also note that the curve is headed for an asymptote at 50% drag reduction as the gap goes to infinity. 
In this large-gap limit, we have two elliptically loaded wings that don't significantly interfere with each other, 
each carrying half the lift and a quarter of the induced drag of the monoplane, for a total of half the induced drag 
of the monoplane. 


Figure 8.3.19 Ideal induced drag of a biplane of equal spans as a function of vertical gap, relative to the in- 
duced drag of an elliptically loaded monoplane of the same span. (Calculated by Munk and Prandtl, from Th- 


waites, 1958) 
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Given this induced-drag advantage of a biplane over a monoplane of the same span, why aren't more air- 
planes configured as biplanes? The reasons are several. The biplane arrangement generally incurs higher profile 
drag and manufacturing cost. Structural weight can either favor a biplane or penalize it, depending on the struc- 
tural arrangement. If a biplane is configured with the same span and wing area as a monoplane, the biplane 
wings would have only half the chord of the monoplane, and for the same airfoil thickness ratio would have only 
half the physical thickness. The smaller thickness would entail a large structural weight penalty unless external 
bracing (struts and wires) were used. Of course most actual biplanes use external bracing, saving considerable 
structural weight but incurring a substantial viscous-drag penalty. 

In the early days of aviation, the preference for biplanes was largely driven by structural weight, as influen- 
ced by the airfoil technology of the time. Most of the airfoil data that were available early on were taken in wind 
tunnels like those built by the Wright brothers, in which models were very small and chord Reynolds numbers 
were very low. Data at very low Reynolds numbers generally indicate a heavy drag penalty for airfoil thickness, 
and early designers therefore assumed that airfoils should be quite thin, which favored the externally braced 
biplane arrangement. 

Apart from the drag of struts and wires, the airfoil profile drag of a biplane is an interesting issue and is the 
subject of some popular misconceptions. When two airfoils are placed in vertical proximity, there is a 2D invis- 
cid interference effect that reduces the lift at a given angle of attack, compared with that of the isolated airfoils. 
When there is no longitudinal stagger between the wings, this is partly a local-q effect, similar to the one we as- 
sumed in our simple model for 2D ground effect in Equation 7.4.2, and partly an induced-camber effect. Some 
commentators (Garrison, 2008, for example) infer from this lift loss a serious loss in sectional “efficiency” for 
biplanes. However, inferring an “efficiency” loss from the inviscid lift loss seriously overstates the case. It's true 
that the lift loss implies a substantial reduction in sectional L/D at fixed angle of attack, but the full reduction 
needn't be accepted because the lift loss is easily compensated by an increase in angle of attack. And presum- 
ably we could even compensate for the details of the induced-camber effect by redesigning the airfoils to pro- 
duce something close to the same pressure distribution as an isolated airfoil, at least at one operating condition. 


What cannot be compensated without thinning the airfoils is the induced thickness effect, that is, the effective 
thickness of the airfoils is increased by their proximity. Indeed, when the viscous drag polar of a biplane pair 
of airfoils is calculated, the drag is shifted upward by a modest amount relative to the isolated case at the same 
chord Reynolds number, consistent with an increased effective thickness. The actual loss in maximum sectional 
L/D is not nearly as gross an effect as that implied by the lift loss at constant angle of attack. In practical design 
applications, chord Reynolds number is also an issue. A biplane will usually have smaller chords than a com- 
parable monoplane, and the lower Reynolds number will incur some increase in profile drag. 


8.4 Wingtip Devices 


The idea of a beneficial wingtip appendage or “device” has been around since the early twentieth century, when 
theoretical calculations first indicated that a vertical endplate added to a wingtip would reduce the induced drag. 
Early on, however, reality did not live up to the theoretical promise. The simple flat endplate turned out to be a 
disappointment in practice because the added viscous profile drag more than offsets the saving in induced drag, 
and the device fails to produce a net benefit. Whitcomb (1976) seems to have been the first to recognize that it is 
possible to reap the induced-drag benefit of an endplate, and at the same time to realize a net benefit, by keeping 
the additional profile drag to a minimum through good aerodynamic design practice. The direct result of Whit- 
comb's work is the classic near-vertical winglet. Less directly, Whitcomb's paradigm of applying good design 
practice has also contributed to the development of concepts other than the winglet. Both winglets and tapered 
horizontal span extensions (raked tips) have been put into commercial service, and several other device concepts 
have also been proposed and brought to varying levels of development (see Figure 8.4.1). 


Figure 8.4.1 An assortment of wingtip-device concepts. (a) Raked tip. (b) Blended winglet. (c) 747-400 canted 
winglet. (d) C-17 canted winglet. (e) MD-11 style up/down winglet. (f) MD-12 style up/down winglet. (g) API 
spiroid. Used with permission of Aviation Partners, Inc. (h) Tip feathers. (1) Tip fence 
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From an aerodynamicist's point of view, the motivation behind all wingtip devices is to reduce induced drag. 
Beyond that, as Whitcomb showed, the designer's job is to configure the device so as to minimize the offset- 
ting penalties, so that a net performance improvement is realized. For any particular airplane and tip device, the 
performance-improvement can be measured relative to the same airplane with no tip device. 


In Section 6.1.3, we noted that it is not possible to decompose the drag exactly into component parts, but that 
with the help of idealized theoretical models, it is possible to estimate an induced-drag component. In Sections 
8.1 and 8.3, we discussed the physics and the theory of induced drag, including a correct understanding of the 
role of the vortex wake. Because the vortex wake has been the source of so much confusion regarding how 
wingtip devices work, we'll take the time to set the record straight on that score before proceeding. 


8.4.1 Myths Regarding the Vortex Wake, and Some 
Questionable Ideas for Wingtip Devices 


With our background so far, we are ready to discuss two common misunderstandings regarding the nature and 
role of the vortex wake. I'll refer to these as the “compactness myth” and the “induction myth,” and after defin- 
ing them and explaining where they go wrong, I'll discuss some of the erroneous tip-device ideas that arise from 
them. 

The compactness myth is simply the idea that the vortex wake consists of vortex cores that spring from the 
wingtips and are quite compact from the start. Illustrations that present misleading views of the vortex wake, 
such as the one in Figure 8.4.2, are common, and they have helped to perpetuate the myth. The water-vapor 
condensation trails that can sometimes be seen streaming from flap edges or wingtips under humid conditions 
can also be misleading. These trails tend to mark only an inner portion of the core and give the impression that 
the core is more compact than it really is. The correct view, as we saw in Figure 8.1.4, is that the vortex wake 
starts as a sheet and that the wake rollup process generally produces cores that are relatively diffuse. A very 
compact vortex would require shedding all of the vorticity from the tip itself, which would in turn require a 
uniform spanload. A uniform spanload simply can't be generated by a wing of any reasonable shape, given the 
strong downwash that such a loading would entail near the tips. The compactness myth is a simple misunder- 
standing that by itself wouldn't cause much harm, but when it is combined with the induction myth, the potential 
for serious mischief arises. 


Figure 8.4.2 A misleading view of a compact “wingtip vortex.” 
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The induction myth is more complicated and involves a serious misunderstanding of cause and effect. The 
trailing vortex sheet and the rolled-up vortex cores are often seen as the direct cause of the velocities everywhere 
else in the flowfield and thus also the cause of induced drag, but this view is mistaken. It is true that when a 3D 
wing produces its characteristic large-scale flow pattern, as illustrated in Figure 8.1.2, there must be a vortex 
sheet shed from the trailing edge, but the vortex sheet is not a direct physical cause of the large-scale flow; it 1s 
more of a manifestation. The induction myth reflects a common misunderstanding of what the Biot-Savart law 
represents, a misunderstanding we discussed in Section 3.3.9. 

So what kind of mischief can result from the combined compactness and induction myths? The induction 
myth leads us to think of induced drag as being “caused” by the vortex wake, and thus to think that by doing 
something very local to change the flow in the core of the “tip vortex” we can have a large effect on the in- 


duced drag. To compound the error, the compactness myth leads us to think we can influence the induced drag 
by acting just on a very small part of the flow. This kind of thinking has spawned many questionable ideas for 
novel wingtip devices. A common theme is to provide an inlet that swallows the tip vortex itself, or some of 
the flow that would otherwise become part of the vortex, and to exhaust it straight back, presumably with its 
swirl and thus its vorticity removed. A schematic illustration of a device of this kind is shown in Figure 8.4.3. 
Specific proposed devices based on similar ideas are subjects of U.S. patents by Loerke (1937), Frakes (1984), 
and Hugues (2005). A naive view might lead one to expect such a device to produce dramatic reductions in the 
strength of the tip vortex and in the induced drag. However, there is no reason to expect that any such device 
can provide a reduction in induced drag beyond what can be explained as the result of an increase in physical 
span when the device 1s added. 


Figure 8.4.3 Schematic plan view illustrating a class of tip device that cannot work as intended 
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The device concept of Figure 8.4.3 has a fatal flaw. It is based on thinking that we can alter the global flow 
pattern of Figure 8.1.2 by tinkering with the tip vortex, without having to change the overall distribution of lift 
on the wing. This is wrong, of course. First, we shouldn't expect to be able to alter the general velocity field 
remotely through the vorticity in a limited region, because as I argued in Section 3.3.9, the vorticity at one 
point does not directly “induce” velocity elsewhere. Second, unless the overall distribution of lift on the wing 1s 
changed, the global flow pattern cannot be significantly altered. 

But how can this be? If Biot-Savart is correct, and a device like that shown in Figure 8.4.3 succeeds in swal- 
lowing part of the vortex and “straightening” it out, why isn't the global flow pattern changed? The answer is 
that a relatively small device can only rearrange the vorticity locally; it cannot significantly change the total 
vorticity flux, as measured, say, by the circulation around each half of the vortex wake. Generally speaking, in 
this kind of device it is kinematically impossible to eliminate the vorticity flux from a streamtube without pro- 
ducing a compensating vorticity flux adjacent to it. 

Let's look at this in a little more detail. Consider a device that swallows an entire vortex into an inlet. Ahead 
of the inlet, the vortex retains its full vorticity flux and circulation. Inside the duct however, the circulation of the 


entire captured streamtube must be zero, even before the flow has gone through any straightener. This follows 
from the no-slip condition on the duct wall. The vortex's original vorticity can still be there in the middle of the 
duct, but its net vorticity flux is exactly canceled by the net vorticity flux that has been automatically generated 
in the duct-wall boundary layer. This process involves dynamics, of course, but the net vorticity flux cancela- 
tion is a kinematic necessity arising from the no-slip condition. So the internal flow ahead of any straightener 
contains a vortex in the middle and an equal net vorticity flux of opposite sign in the wall boundary layer. An 
internal flow straightener could now, in principle, zero out all of this internal vorticity locally, but the net in- 
ternal vorticity flux was already zero ahead of the straightener. Now consider the flow outside the device. On 
any closed contour outside the device's external boundary layer, we should see a circulation approximately the 
same as that of the original vortex. The net vorticity flux associated with this circulation has been automatically 
generated in the external boundary layer, also by kinematic necessity. Finally, when all of this rearranged vorti- 
city, both internal and external, is shed into the wake, the circulation of the wake and the large-scale flow field 
outside it will not have been greatly changed, and there will have been no dramatic reduction in induced drag. 
This is similar to what we noted in Section 6.1.10, that it is impossible for a propeller to produce a net axial 
vorticity in its slipstream. 


8.4.2 The Facts of Life Regarding Induced Drag and 
Induced-Drag Reduction 


We have seen that induced drag is a result of large-scale air motion produced by the lifting system. This motion 
is not physically “induced” by the vortex wake, but is a response to the lift force and depends on the overall lift 
distribution. When we try to go beyond this on an intuitive level, we are limited to very general observations, 
such as that increasing the span of a wing generally reduces induced drag. For anything more specific, espe- 
cially regarding any device other than a simple span extension, we must rely on quantitative predictions, usually 
from Trefftz-plane theory. Unfortunately, as we noted in Section 8.3.4, the theory does not generally provide for 
a simple intuitive understanding of how the details of a particular configuration or lift distribution will affect the 
drag. 

Based on our general appreciation of the physics, we can anticipate that drag-reduction devices need to be 
fairly large as viewed 1n the Trefftz plane, because any significant reduction in induced drag requires changing 
the global flowfield associated with the lift, so as to reduce its total kinetic energy. We know that we can't do 
this just by tinkering with the “tip vortex” and thus that having a significant effect on the drag requires a signi- 
ficant change in the way the lift is distributed spatially. If our starting point is a wing on which the lift is already 
advantageously distributed, the only way to improve will be to provide a significant increase in the horizontal 
span or to introduce a nonplanar element that has a similar effect. The quantitative theory tells us that the effect 
on drag will be roughly proportional to the horizontal and/or vertical span of the device and that a small device 
can therefore produce at most a small drag reduction. 

There is a common misunderstanding that a wingtip device reduces drag by producing thrust on the surfaces 
of the device itself. For example, there is the popular explanation that likens a winglet to a sailboat beating into 
the wind, usually accompanied by a diagram showing the lift vector on the winglet tilted forward by the strong 
sidewash directed inboard above the wingtip, as in Figure 8.4.4. A corollary is that drag-reduction effectiveness 
is enhanced if the winglet is mounted well aft on the wingtip, where the sidewash 1s stronger. This general pic- 
ture of winglet effectiveness has been put forward by Hackett (1980) and McCormick (1995), among others. It 
is intuitively appealing, but it is flawed. 





Figure 8.4.4 Misleading view indicating that a winglet produces thrust in the presence of an inboard-directed 
sidewash “induced” by the wing. Actually, for an unswept wing with a vertical winglet, the optimum loading 
produces zero sidewash at the winglet 
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First, the idea that there is a favorable sidewash at the winglet location is based on the flowfield (sidewash) 
that would be there in the absence of the winglet. This view is valid only in the limit of small loading on the 
winglet and is not even close to being correct for practical levels of winglet loading. When a winglet is ad- 
vantageously loaded, the flowfield is altered considerably, and the resulting force on the winglet can be very 
different from what the undisturbed flowfield would lead you to expect. For example, when a wing and vertical 
winglet are unswept and are carrying their ideal spanload, the sidewash all along the span of the winglet is can- 
celed, and the winglet itself feels no induced thrust or drag. This 1s required by Munk's minimum-induced-drag 
criterion from Section 8.3.5, which tells us that any vertical portion of an optimally loaded system must see zero 
sidewash in the Trefftz plane. With no sweep, the sidewash at the lifting line itself is half what it is in the Trefftz 
plane, and is therefore still zero on a vertical winglet. So in this case, all of the drag reduction due to the winglet 
is felt on the horizontal wing, and the sailboat analogy misses the mark badly. 

So it is a mistake to try to understand a tip device in terms of the flowfield that would be there in the absence 
of the device, and it is also a mistake to expect the benefit of a tip device to come just from thrust on the device 
itself. As we saw above, in an optimally loaded unswept wing/winglet combination, the winglet produces no 
thrust. And the idea of a winglet as a thrust producer fares even worse in the case of a forward-swept wing: 
An optimally loaded winglet on a forward-swept wing can produce a large drag on its own surfaces and still 
produce a net drag reduction. In general, the sweep of the wing has a strong effect on the thrust or drag felt by a 
tip device. Let's look at this sweep effect in more detail. 

According to Trefftz-plane theory, the total induced drag of a wing/winglet combination depends only on the 
arrangement of the trailing edges as viewed in the Trefftz plane and on the spanload (Section 8.3.4). Trefftz- 
plane theory is not exact, but for high-aspect-ratio surfaces it is a reasonable approximation. Total induced drag 
should thus be largely independent of sweep, provided that the spanload 1s fixed. On the other hand, the induced 
drag or thrust felt by the winglet itself depends strongly on the general sweep of the whole lifting system. In 
fact, the distribution of induced drag on the whole system, whether it has a tip device or not, is strongly affec- 
ted by sweep. Induced-drag distributions that illustrate this are shown in Figure 8.4.5. Ideal spanloadings were 
assumed for a planar wing and for a wing with 20%-semispan vertical winglets, as shown in Figure 8.3.7. The 
Boeing WINGOP code (Craig and McLean, 1988) was used to carry out numerical lifting-line calculations of 
the induced-drag distributions for cases in which the wing and winglet are both unswept, and for cases in which 


both are swept back by 30°, and swept forward by 30°. Small spikes in the distributions at tips are due to the 
lifting-line numerics, and larger spikes at the winglet junction reflect shortcomings of lifting-line theory (Get- 
ting rid of the spikes at the winglet junction would require spreading the bound vorticity out chordwise, as in a 
lifting-surface theory). In spite of these anomalies, the general trends can be discerned. 


Figure 8.4.5 Induced-drag distributions calculated by numerical lifting-line theory, assuming the ideal span- 
loadings for a planar wing and a wing with 20%-semispan winglets, as shown in Figure 8.3.7. Lift equivalent 
to AR = 10.0, C; = 1.0. (a) Wing and winglet both unswept. (b) Wing and winglet both swept 30° aft. (c) Wing 
and winglet both swept 30° forward 
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In the unswept baseline case (Figure 8.4.5a), the induced-drag distribution is elliptic, as expected. In the un- 
swept winglet case, the drag distribution on the wing is as we might expect based on the spanload in Figure 
8.3.7, and the induced drag on the winglet is effectively zero, as we expect based on the theoretical discussion 


of Section 8.3.5. The strongest reduction in induced drag comes from the inboard part of the wing, and there is 
a small portion of the wing near the junction where the induced drag is increased. 

In the swept cases (Figure 8.4.5b,c), the distributions are dramatically different. For aft sweep without a 
winglet, there is a large thrust on the outboard half of the wing, more than offset by the large drag on the inboard 
half. The total induced drag, which is the same as in the baseline unswept case, is a small difference between the 
two. In the aft-swept winglet case, there is a large thrust on the winglet, which is mostly offset by a reduction 
in thrust on the outboard wing. The drag reduction on the inboard wing isn't as large as in the unswept case. So 
in the aft-swept case, there is significant thrust on the winglet, but looking just at that thrust would lead you to 
grossly overestimate the benefit. 

With forward sweep, we see generally the opposite of what we saw with aft sweep, except that the crossover 
from thrust to drag is farther inboard. Without a winglet, there is a very large thrust inboard of 10% semispan, 
more than offset by high drag on the rest of the wing. When a winglet is added, it carries a high drag on its own 
surfaces, which is more than offset by a reduction in drag over most of the span of the wing. In this case, the 
sailboat analogy, or any idea of the winglet as a thrust producer, is highly inappropriate. 

Trefftz-plane theory tells us that we can reduce the ideal induced drag by increasing the vertical height of the 
lifting system, as well as by increasing the horizontal span. A vertical fin or winglet that adds vertical height 
to the system will reduce the ideal induced drag if it is placed anywhere along the span of the wing off of the 
airplane center plane, but it is most effective by far when it is placed at the station of maximum span; that is, 
at the tip. This is one example of the more general problem of minimizing the ideal induced drag of a lifting 
system with given maximum horizontal span and vertical height; that is, a system that must fit within a given 
rectangular box in the Trefftz plane. Figure 8.4.6 illustrates a series of such configurations, in order from lowest 
ideal induced drag to highest. The configuration with the lowest drag is the box wing, which has lifting surfaces 
along all four edges of the box. Note that any retreat from either the corners or the edges of the box (e.g., a 
“blending” region in the junction between a winglet and the wing in the third example, or the retreat from the 
outer edge of the box by the “feathers” in the fourth example) increases ideal induced drag, but that there may 
be compensating advantages such as avoiding the viscous-drag penalty associated with sharp-cornered intersec- 
tions or reducing the wetted area of the surfaces. 


Figure 8.4.6 Lifting-surface configurations constrained to fit within a rectangular box in the Trefftz plane 
(height equal to 10% of full span), shown in order from the lowest ideal induced drag to the highest 
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Ideal-induced-drag theory is useful for guidance as to how to achieve a large reduction in induced drag, 
but the benefits that it implies are not generally achievable in practice. First, the induced-drag reduction that 
can actually be achieved in most applications typically falls significantly short of ideal. In addition, the actual 
induced-drag reduction is always offset by other factors that detract from the net benefit to the airplane. 


Several factors can contribute to the shortfall in induced-drag reduction relative to ideal: 


¢ Spanloads compromised to save weight As we discussed in Sections 8.3.5 and 8.3.8, spanloads of real 
wings are usually compromised to reduce bending loads and save structural weight, by carrying more 
load inboard and less load outboard than the ideal spanload. (This is in addition to the fact that the span- 
load with minimum induced drag for a wing in the presence of a down-loaded horizontal tail typically 
already has less-than-elliptic outboard loading, as we saw in Figure 8.3.18.) If the baseline wing without 
a tip device and the wing with a device added are both optimized to the same weight/drag objective, they 
will both carry reduced loads outboard compared with their respective ideal loadings, and a tip device 
of a given size will have less leverage in reducing induced drag than it would if both configurations had 
ideal loadings. 


¢ Twist distribution of the existing wing in retrofit applications If the tip device is to be retrofitted to an 
existing wing, the jig-twist distribution of the existing wing is nearly always preserved for cost reasons, 
and as a result the best spanload that can be achieved with the device installed will be farther from ideal 
than the load on the original wing was. This penalty for keeping the existing twist distribution 1s larger 
when the baseline wing is more lightly loaded outboard than ideal, which 1s usually the case. 

¢ Aeroelastic effects in retrofit applications In the case of an aft-swept wing, aeroelastic effects may add 
to the shortfall in drag reduction. Increasing the outboard loading (using a tip device) tends to increase 
wing bending at cruise conditions, which on an aft-swept wing washes the wing out, reducing the drag 
benefit of the tip device compared with what it would have been on a rigid wing. The amount of addi- 
tional wing washout and the resultant reduction of the induced drag benefit will depend on the flexibility 
of the wing, on the weight distribution of the airplane (payload and fuel), and on whether the structure is 
beefed up for the addition of the tip device. 

¢ Trim effects In the case of an aft-swept wing, adding a tip device that increases the loading outboard 
will increase the download on the horizontal tail that 1s required for trim, which in most cases will offset 
some of the induced-drag reduction and add to the offsetting profile-drag increase. 


The addition of a wingtip device generally adds wetted surface area and thereby increases viscous drag, and 
there may also be junction flows or areas with unfavorable pressure distributions that further increase the vis- 
cous drag. The redistribution of the spanload that the device produces can change the shock drag on the rest of 
the wing, but this effect can go in either direction and is usually not large. In any case, the induced-drag reduc- 
tion is nearly always partly offset by a net increase in the other drag components. 

Any practical device that reduces induced drag generally increases bending moments on the entire wing at 
the cruise condition and at the critical flight conditions that determine the design of the wing structure. The ad- 
dition of a wingtip device therefore often requires beefing up the wing structure, which adds weight, over and 
above the weight of the device itself, and subtracts from the net benefit of the device. This trade between drag 
reduction and weight increase is discussed further in Section 8.4.5. When a tip device is included in the design 
of an all-new wing, this structural-weight penalty must generally be paid in full. On an existing airplane, flight 
testing will sometimes have established that the wing has excess structural margin that can be “used up” by the 
addition of a tip device. The presence of an existing excess structural margin can thus reduce or even eliminate 
the required beefing up of the existing structure. 


8.4.3 Milestones in the Development of Theory and Practice 


Lanchester, the British aeronautical pioneer, had developed a qualitative understanding of the 3D flow around 
a lifting wing, including the vortex wake, by 1895 (Lanchester, 1907). A quantitative understanding of induced 
drag was first provided by the Trefftz-plane/lifting-line theory, developed by Prandtl in 1910 (Prandtl and Ti- 
etjens, 1934) and elaborated by several others in the following years. Even now, well into the era of CFD, our 
conceptual understanding of induced drag depends almost entirely on this early theoretical work. The concep- 
tual touchstones include 


¢ Ideal-induced-drag theory, including the elliptic ideal spanload (for minimum induced drag) and the 
simple formula for predicting the ideal induced drag of a planar wing. 

¢ The prediction that multiple and/or nonplanar lifting surfaces, including endplates, could have lower in- 
duced drag than a simple planar wing of the same maximum horizontal span. 

¢ Munk's stagger theorem (see Kroo, 2001), and the general prediction that, for a given Trefftz-plane geo- 
metry and spanload, the induced drag 1s independent of the fore-and-aft disposition of the lifting surfaces. 


We've already discussed ideal-induced drag theory and pointed out that it provides a method by which the po- 
tential induced-drag-reduction effectiveness of various lifting-surface geometries can be compared fairly. One 
of the best-known examples of such studies is by Cone (1962), in which he used a physical analog apparat- 
us based on a rheoelectric analogy to “solve” the ideal-induced-drag optimization problem and compare many 
different Trefftz-plane shapes, a sampling of which is shown in Figure 8.4.7. He found that practically any non- 
planar shape that adds vertical height near the tip would reduce ideal induced drag. Results like these provided 
the basis for the development of many of the tip-device concepts discussed in the next section. 


Figure 8.4.7 Some lifting-surface configurations (Trefftz-plane shapes) investigated by Cone (1962). (Pub- 
lished by NASA) 
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The realization that non-planar lifting systems could generally reduce drag did not immediately lead to suc- 
cessful applications, however. For example, simple flat endplates were tried numerous times in the years after 
it was first predicted that endplates would reduce induced drag, but in practice they never produced a net drag 
benefit. Their induced-drag reduction tended to fall short of ideal, and it was always more than offset by the 
increase in viscous drag due to added wetted area and corner flows. I think it is likely that the failure for so 
many years to find a better configuration than the flat endplate can be blamed on an “endplate paradigm” based 
on a particular way of looking at the Trefftz-plane theory. The reasoning leading to the endplate paradigm is as 
follows. In the limit as the vertical span of an endplate becomes large, the ideal spanload on the horizontal wing 
becomes uniform. One way to achieve this situation is to have 2D flow over the wing, enforced by endplates 
that are flat and large in chord as well as span. The resulting paradigm is that an endplate should always be flat 
and have a large chord. 

What the endplate paradigm fails to recognize 1s that to realize the ideal induced drag of an endplated con- 
figuration, only the spanwise distribution of load on the endplate needs to be ideal, and it doesn't matter how 
the load is distributed longitudinally. A vertical tip device of small chord can achieve the same induced-drag 
reduction as a large endplate of the same span just by carrying the right spanload. Whitcomb (1976) seems to 
have been the first to recognize this and to realize that an effective endplate is just another part of the lifting sys- 


tem; that is, a lifting surface that should be carrying a spanload close to ideal, just like the rest of the wing. Of 
course, to keep the viscous drag of a lifting surface low, the surface should have an efficient aerodynamic cross 
section; that is, an airfoil, and the chord of the surface should be sized consistent with the efficient load carrying 
capacity of the section. This is just good aerodynamic design practice of the kind that has always been applied 
to wings, and Whitcomb's contribution was to apply it to what had formerly been seen as just an endplate. While 
the direct result of Whitcomb's work was the classic near-vertical winglet, his general idea of applying good 
design practice to keep the profile drag low has also contributed to the development of concepts other than the 
wineglet. 

The trade between drag reduction and structural weight was not addressed explicitly in most early work, but 
it began to attract attention with the development of the winglet. Whitcomb's work on winglets suggested that 
for a given increase in bending moment on the inboard wing, a near-vertical winglet offers nearly twice as much 
drag reduction as a horizontal span extension. This suggestion was not based on theory, but on the results of 
wind-tunnel tests in which the winglet and horizontal span extension configurations were supposed to be sized 
so as to be equivalent in terms of root bending moment. However, judging by the results of many later studies, 
the horizontal span extension Whitcomb used for comparison was not as large as it should have been. Further- 
more, the horizontal span extension was not as well optimized aerodynamically as the winglet. Both of these 
factors contributed to an overly optimistic assessment of the winglet. 

Later, a systematic theoretical investigation of the question was published by Jones and Lasinski (1980). 
They used Trefftz-plane theory to calculate induced drag, and bending moment integrated over the span as an 
indicator of likely structural weight. (This is only a rough indicator because for real wings at the high-g con- 
ditions that are a factor in determining the required structure, the spanload shape is usually different from that 
at cruise, and the effective structural depth of real wings is not usually constant along the span.) Starting with 
a baseline elliptically loaded wing of a given span, they added horizontal and vertical tip extensions of varying 
length. For each device length, they optimized the spanload to minimize induced drag, subject to the constraint 
that the integrated bending moment, or “structural weight,” was the same as that of the baseline. As the size of 
the extensions increased, the spanloads had to become increasingly non-ideal (farther from the pure induced- 
drag optimum) to meet the constraint, and yet drag was still reduced. Repeating the calculations with root bend- 
ing moment instead of integrated bending moment as the constraint produced essentially the same results. The 
calculations indicate that horizontal span extensions and vertical winglets offer essentially the same maximum 
induced-drag reduction when the spanloads are constrained so that there is no increase in “structural weight.” 
They also indicate that to achieve a given level of drag reduction, a vertical winglet must be nearly twice as 
large as a horizontal span extension. In Section 8.4.5, we'll see similar results for some variations on the basic 
vertical winglet. 


8.4.4 Wingtip Device Concepts 


Whitcomb's breaking of the “endplate paradigm” has led to the development of a variety of wingtip devices that 
can be effective in reducing total drag, some of which were shown in Figure 8.4.1. It is assumed from here on 
that Whitcomb's basic idea of applying good aerodynamic design practice will be adhered to 1n executing any of 
the concepts. We'll start our discussion of this assortment of competing concepts by listing the basic strategies 
(Table 8.4.1) that are used in various combinations by the different devices. 


Table 8.4.1 Basic strategies for practical tip devices 


Increasing horizontal span Straight tip extensions 


Tapered tip extensions 
Going non-planar Bending (winglets) 





[Bending with blending (blended winglets: reduced wetted area and junction drag) 
Splitting (split winglets and feathers: vertical height with less wetted area penalty) 
Splitting and rejoining (spiroids) 


Part-chord devices (less chord than the |Trapezoidal baseline tip often has more chord than needed to carry spanload 
baseline wingtip) 


Pronounced tapering Reduces wetted area and critical structural loads 
Applicable to all tip-device concepts 
Additional sweep (e.g., raked tips) Increases aeroelastic washout and reduces wing weight 


Aeroelastic relief not very effective for a vertical winglet, but sweeping a winglet 
aft can make transonic tailoring easier 





8.4.5 Effectiveness of Various Device Configurations 


For comparisons of the potential performance advantages of various tip-device configurations to be fair, the 
devices in question must be comparably optimized. We've seen that much of the early theoretical work was 
highly idealized and concentrated on induced drag only. For that case, the appropriate optimization is the minim- 
ization of induced drag alone, which defines the “ideal” induced drag and the “ideal” spanload. Ideal-induced- 
drag theory is useful for initial screening of concepts and for understanding basic trends, and we'll look at such 
comparisons below, but it is not a realistic optimization target for real-world tip devices. The spanload of a real 
transport-airplane wing, with or without a tip device, is not generally optimized for minimum induced drag, but 
instead is optimized for a favorable trade between total drag and structural weight. This is still spanload op- 
timization, just to a bottom-line performance objective such as fuel-burn or maximum range, rather than to an 
esoteric aerodynamic target. 

The usual procedure in design studies is to define the general configuration of a candidate tip device in terms 
of its planform and dihedral angle(s) and then to estimate its performance through analysis. A step that should 
always be included in this process is the optimization of the spanload. When the planforms and airfoil cross 
sections of the wing and the tip device are given, the spanload is controlled by the twist distribution. (Recall 
that in Section 8.2.1 we defined “twist distribution” to refer to the spanwise distribution of the orientation of 
the zero-lift lines of the sections. Thus what we refer to here as the “twist distribution” would encompass the 
overall incidence setting of a tip device, or, for example, the “toed-in” or “toed-out” setting of a winglet, as well 
as the variation in incidence along the span of the device.) If both the wing and the tip device are all new, the 
twist distribution of the entire system is open to optimization. In derivative or retrofit applications, the jig twist 
distribution of the existing wing is generally fixed, and the twist distribution of only the tip device itself can be 
optimized. If the twist distribution of the existing wing was optimized for operation without a tip device, the 
benefit available from the addition of the tip device will usually be substantially less than it would have been 
if the wing could have been reoptimized. In the discussion that follows, it is assumed that the twist distribution 
of each candidate tip device has been optimized to an appropriate bottom-line performance objective, taking 
into account structural weight and any other factor that affects the objective, subject to whatever constraints are 
applicable. For real-world design studies, computational tools are available that can carry out this optimization 
to different levels of physical fidelity, depending on the purpose of the study. 

Device size is always an important design decision, and it affects all devices in essentially the same way. 
Figure 8.4.8 shows figurative trends in drag, structural weight, and fuel burn as functions of device size for a 
generic tip device, assuming that the baseline airplane with no device has no excess structural margin, and that 
for each size the device has been optimized for some bottom-line measure of airplane performance, such as fuel 
burn. For simplicity, the drag and weight effects are assessed separately first (the drag reduction is plotted for 


constant weight) and then combined in their overall effect on fuel burn. Note that induced drag cannot be re- 
duced below zero, no matter how large the device is made and that wetted area and profile drag will continue 
to increase with increasing size. Thus the drag reduction at constant weight must eventually show a diminish- 
ing rate of return with increasing size, as indicated by the decreasing slope of the drag reduction curve in the 
first frame. Because of the ever-increasing profile drag, eventually the drag reduction would reach a maximum 
and then decline. But for this pure aerodynamic trade at constant weight, the maximum drag reduction would 
typically occur well outside the practical range of device size. In the real world, the effects of the increasing 
structural weight would intervene long before that. 





Figure 8.4.8 Effects of device size on performance for a generic wingtip device, assuming no excess structural 
margin in baseline wing 


Drag reduction % 
Weight increase % 





Device size Device size 


Fuel-bum reduction % 


Device size 


For reasonable device sizes, the weight increase tends to be roughly linear with size. When the weight in- 
crease is taken into account, the fuel-burn reduction generally reaches a maximum well before the maximum in 
pure aerodynamic drag reduction at constant weight. The optimum device size defined by the maximum fuel- 
burn reduction depends on the relative importance of drag and weight, which depends on the length of the mis- 
sion. If maximum range is the objective, the situation is even more complicated because the critical condition 
that limits maximum range can be different depending on the details of the baseline airplane design. Compared 
to a device optimized for fuel-burn, a device optimized for range can be anywhere from considerably smaller to 
considerably larger, depending on the details. Thus the optimum device size depends on many factors, including 
what performance objective is sought. 

Now let's look at the comparative advantages of horizontal span and vertical height. Earlier we mentioned 
that Whitcomb's work on winglets (Whitcomb, 1976) suggested a rule of thumb to the effect that for a given 
increase in bending moment on the inboard wing, a near-vertical winglet offers nearly twice as much drag re- 
duction as a horizontal span extension. This rule of thumb has not been borne out by studies since then. Jones 


and Lasinski (1980) indicated that horizontal span extensions and vertical winglets offer essentially the same 
maximum induced-drag reduction when the impact on “structural weight” due to bending loads is constrained 
to be the same. 


In Figure 8.4.9, we look at the problem a different way but reach a similar conclusion. While Jones and Las- 
inski constrained the root bending moment and looked at non-ideal induced drag, here we look at how ideal 
induced drag is reduced as a function of the increase in root bending moment. In these ideal-induced-drag cal- 
culations carried out by the author, induced drag was minimized for a fixed total lift, and root bending moment 
was free to increase by different amounts for the different devices. The three winglets indicated by the diamond 
symbols have the same 20% semispan height and thus the same ideal-induced-drag reductions as the corres- 
ponding ones shown in Figure 8.4.6. The difference here is that the increase in root bending moment 1s also 
shown, on the horizontal axis. Two main conclusions are: 

1. For a horizontal span extension to produce the same ideal-induced-drag reduction as a winglet, it needs 
to be only about half as large (a little more than half compared with a sharp-cornered vertical winglet, and 
a little less than half compared with a blended or split winglet). A horizontal span extension thus needs 
less additional wetted area than a winglet for the same induced-drag reduction, and taking profile drag 
into account would move the total-drag curves for horizontal and vertical devices closer together than the 
induced-drag curves. 


2. For a given increase in root bending moment, the three types of winglets produce roughly the same 
ideal-induced-drag reduction, about 17% more reduction than a horizontal extension. This is a much 
smaller difference than the factor of nearly two claimed by Whitcomb (1976). And again, taking profile 
drag into account makes the total-drag difference even smaller. 


Figure 8.4.9 Reduction in ideal induced drag versus increase in root bending moment for the three types of 
winglets shown in Figure 8.4.6, compared with horizontal span extensions. The lines indicate the trends as the 
sizes of the devices are increased from zero, and the solid symbols are for two particular sizes that yield 
roughly comparable drag reductions: 10% semispan horizontal span extensions, and 20% semispan winglets 
(height measurement) 
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This comparison is highly idealized and should be taken only as a rough indication of the trends. As we noted 
above, real tip devices are not generally optimized for ideal induced drag. Also, wing root bending moment 


associated with the cruise spanload is only a very crude indicator of structural-weight impact. Still, a general 
conclusion is that horizontal span extensions and vertical winglets have very similar performance potential in 
terms of the trade between drag reduction and weight increase. In general, in the choice between winglets, hori- 
zontal span extensions, and other tip-device configurations, there is no clear-cut favorite for all applications. In 
terms of the basic physics, the benefits they offer tend to be comparable. Which choice is favored for a partic- 
ular application depends on the details of the baseline airplane design, both aerodynamic and structural, and on 
the mission objective. And the differences between the choices are usually not large. 

Theoretically, there is a small general advantage for devices in which the lifting surface splits into two 
branches. In general, splitting allows a given level of induced-drag reduction to be achieved with less additional 
wetted area than a non-split configuration would require. For a feather configuration with a small included angle 
(see Figure 8.4.6), the wetted-area increase is only a little more than that of a horizontal span extension of the 
same projected span, but the induced-drag reduction is considerably greater. Of course, the weight increase 1s 
also considerably greater, so that the advantage in terms of drag versus weight is not large. If we increase the in- 
cluded angle between the feathers to 180°, we have split vertical winglets of equal span. Comparing this with a 
simple vertical winglet of the same total height, we find that the optimum spanloads for both are about the same 
just inboard of the junction. In the case of the split winglets, this load splits evenly between the two branches, 
so that if we size the chord of the device according to the load carried, the split winglets need only half as much 
chord as the single winglet. The split winglets produce about 90% of the induced-drag reduction of the single 
winglet (compare the “split winglets” with the “vertical winglets” in Figure 8.4.6) with only about half as much 
additional wetted area. A drawback to split winglets in practice is that the span of the lower winglet is often 
limited by ground clearance. 


8.5 Manifestations of Lift in the 
Atmosphere at Large 


In previous sections, we looked at the flow around a 3D wing, the lift distribution on a 3D wing, and the theory 
of induced drag. In this section, we look at the manifestations that the lift on a wing has in the atmosphere on a 
large scale and at some of the theoretical issues that arise in representing those manifestations. 

We start with a look at how the flowfield associated with the lift satisfies Newton's laws in an integrated 
sense. Highly simplified models suffice for this purpose, but the mathematics involved 1s tricky, and some erro- 
neous results have been widely propagated. A secure understanding of these issues thus requires addressing how 
the erroneous analyses went wrong. When the smoke has cleared, we'll find that there is no net downward mo- 
mentum imparted to the atmosphere as a whole and that the lift is reacted by pressure differences on horizontal 
planes above and below the wing, or on the ground plane, if there is one. We'll also consider how conservation 
of momentum applies to control volumes that don't encompass the entire atmosphere. To keep things simple, 
we'll look only at rectangular volumes with horizontal and vertical surfaces. We'll find that the lift can show up 
at the boundaries either as pressure differences on the horizontal surfaces or as fluxes of vertical momentum 
mainly through the vertical surfaces, or as combinations of the two, depending on the proportions of the control 
volume. 

Finally, the actual process by which the vortex wake disappears far downstream has received little coverage 
in the usual aerodynamics sources, and we'll attempt to at least fill in a rough picture of what happens there. 


8.5.1 The Net Vertical Momentum Imparted to the 
Atmosphere 


There is a widespread notion that an airplane in steady level flight continuously imparts net downward mo- 
mentum to the atmosphere. Contributing to this notion is the fact that the airplane is continuously adding to the 
impulse of its vortex-wake system and the fact that there is net flux of downward momentum across vertical 
planes behind the airplane, issues we'll deal with in the following sections. We also saw that a 2D airfoil imparts 
a downward momentum change to the air, but confusing the issue in that case 1s the fact that the downward mo- 
mentum behind the airfoil accounts for only half the lift. However, appearances are deceiving, and the notion 
that the airplane leaves behind a continuously increasing net downward momentum is mistaken. 

For our purpose of assessing the net vertical momentum, a very simple model of the flowfield produced by 
an airplane in steady level flight suffices. That is the classical, highly idealized model in which the vortex wake 
is represented by a pair of parallel line vortices, and the sinking and any other deformation of the wake are ig- 
nored. Assuming the airplane has been flying for a finite time since an impulsive start, we then have a vortex 
system consisting of a planar, rectangular loop, as shown in Figure 8.5.1. This model is unrealistic in several 
ways, especially in the vicinity of the starting vortex. Spalart (2008) has pointed out that a simple flow structure 
for this part of the vortex loop is not sustainable over time, where in the real world the vortex system would 
quickly distort, become unsteady, and disperse. However, we'll find that the starting vortex isn't a player in the 
overall vertical-momentum balance, which indicates that this oversimplification does no harm for the level of 
analysis we're attempting here. And in Section 8.5.5, we'll discuss how the vortex wake really terminates down- 
stream. 





Figure 8.5.1 Idealized rectangular-loop model for the vortex system of a 3D wing flying for a finite time 


Given our representation of the vortex-wake system as a closed vortex loop, a relevant concept is that of the 
impulse of a vortex loop. It is known that any incompressible potential flow can be established by the applic- 
ation of an impulsive pressure field at an initial instant, starting from an initial condition in which the fluid is 
at rest (Milne-Thomson, 1966, Section 3.31). In the case of a closed vortex loop like the one in Figure 8.5.1, 
in a field that would otherwise be at rest, an impulsive pressure field that could initiate the flow would have to 
impose a net impulse on the fluid, proportional to the circulation and the area of the loop, and in the direction 
perpendicular to the plane of the loop (Milne-Thomson, 1966, Section 10.21). This is what is called the impulse 
of the vortex loop. In our idealized model, the rectangular loop of Figure 8.5.1 is attributed to an airplane fly- 
ing with lift L for time t. The impulse of this vortex loop, and the actual mechanical impulse imposed by the 
airplane on the atmosphere (lift times time), are both equal to Lt. They both increase with time as the airplane 
continues to fly and the area of the vortex loop grows. So we have a one-to-one correspondence between the 
mechanical impulse imposed by the airplane and the impulse of the vortex loop. 

Thinking in intuitive physical terms, we might also expect the impulse imposed by the airplane on the air 
(the product Lt) to produce a net vertical momentum in the atmosphere that grows with time. This expectation 
is not satisfied by the mathematics, however. Wigton (1987, private communication) showed that the volume 


integral of the vertical velocity inferred from the vortex system, over the whole infinite atmosphere, is noncon- 
vergent. The integrand (the vertical velocity) doesn't decrease fast enough with increasing distance (only as rh ), 
and the flowfield associated with the vortex loop thus contains infinite amounts of both upward and downward 
momentum. An attempt to define a value for the integral can lead to any answer between minus infinity and 
plus infinity, depending on how the integration 1s sequenced. Thus the vertical-momentum content of the infin- 
ite atmosphere at any given time is at best undetermined, even though the mechanical impulse imposed by the 
airplane and the impulse of the vortex loop are well defined. 

This unsatisfying result is an artifact of our idealized assumptions: incompressible flow and an infinite at- 
mosphere. In the real world, no disturbance generated by the airplane would propagate faster than the speed of 
sound, and for finite flight time all flowfield integrals would be well defined, so presumably we could resolve 
this difficulty by doing an unsteady compressible analysis. But there is an easier way. The problem is resolved 
if we consider flight in a semi-infinite atmosphere bounded by a ground plane. The simplest way to impose 
the no-through-flow condition at the ground plane is to place an image of the “real” vortex system below the 
ground, as shown in Figure 8.5.2. This is the same thing we did in Figure 8.2.6 to model flight close to the 
ground, but this time our purpose is to model flight many wingspans from the ground. At large distances, the 
velocity disturbances associated with the vortex loop and its image die off as r which is fast enough for the 
vertical-momentum integral to converge. If we expected to see a net downward momentum equal to Lt, the res- 
ult comes as a surprise: The value of the integral over the semi-infinite space above the ground 1s zero, which 
means that the airplane imparts no net downward momentum to the atmosphere in steady level flight over a 
ground plane, regardless of height above the ground. If momentum conservation is to be satisfied, there is only 
one other way the lift can be balanced, and that 1s by the pressure, which is our next topic. 


Figure 8.5.2 The rectangular-loop vortex system of Figure 8.5.1 reflected in a ground plane 





8.5.2 The Pressure Far above and below the Airplane 


Our finding of zero net vertical momentum shouldn't have been a surprise. Prandtl and Tietjens (1934) showed 
how in steady level flight the lift is balanced by an overpressure on the ground under the airplane, so that of 
course there is no need for net momentum transfer. They assumed a rectangular vortex system and its image, 
like that shown in Figure 8.5.2, and showed that when the height is large compared with the wingspan, there 1s 
no significant pressure disturbance at the ground associated with the trailing vortices, but that there 1s a disturb- 
ance at the ground associated with the bound vortex and its image, given by 


Ap= cu =, 
8.5.1 27 Re 

where h is the airplane's height above the ground and R is the radial distance from the airplane to the point in 
question on the ground. The disturbance is thus distributed in a pattern of circular isobars centered directly un- 
der the airplane. The overpressure dies off radially in a bell-shaped distribution, as shown in Figure 8.5.3, and 
Prandtl and Tietjens showed that the integrated pressure disturbance accounts for all of the lift. The gross fea- 
tures of the pressure footprint predicted by the simple incompressible model are not altered by compressibility 
at subsonic speeds. For steady flight, the footprint remains centered under the airplane regardless of the Mach 
number, as long as it is subsonic. This steady-flow pressure distribution is not like an acoustic radiation pattern, 
which would lag behind the airplane at high subsonic Mach numbers. 


Figure 8.5.3 The airplane's pressure footprint on the ground in steady, level flight. From Prandtl and Tietjens, 
(1934). Used with permission of Dover Publications, Inc. 
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The presence of an integrated pressure disturbance imposed by the airplane on the ground plane has an easy 
intuitive interpretation: The ground plane is supporting the weight of the airplane in addition to the weight of 
the atmosphere. The typical local magnitude of the pressure disturbance at the ground is extremely small. For 
example, for a 747 weighing 800 0On pounds flying at 35 000 ft AGL, ae maximum pressure disturbance dir- 
ectly under the airplane is 1.0 x 10° + pounds per square foot or 5 x 10— 8 atmospheres. And, of course, the area 
covered by the disturbance is typically quite large. The disturbance is greater than half its maximum value in- 
side a circle on the ground of radius 0.766 h, which for our 747 at 35 000 ft covers an area of 81 square miles. 

Above the airplane, the integrated pressure disturbances “due to” the bound vortex and its image cancel each 
other, which is consistent with having all of the lift accounted for by the pressure disturbance on the ground 
below. The other place we should look for manifestations of lift 1s under the starting vortex, which wasn't men- 
tioned by Prandtl and Tietjens and which in the real world would be out of the picture in any reasonable length 
of time. We'll look at it anyway, just to understand the full implications of our vortex-loop model. One might 
expect that there would be a “suction” footprint under the starting vortex, a mirror image of the pressure foot- 
print under the airplane, because the starting vortex has a rotation opposite to that of the bound vortex. But it 
turns out that the integrated pressure disturbance under the starting vortex 1s zero. This is an instance in which 
being in the right reference frame is crucial. 

The Prandtl and Tietjens analysis of the pressure footprint is carried out in the reference frame moving with 
the airplane, in which the flow under airplane is steady, and the steady form of Bernoulli's equation holds. In this 
reference frame, there is a substantial freestream velocity, and far below the airplane, where the disturbances 
are small, the pressure disturbance goes as puou, that is, the pressure disturbance is first order in the velocity 


disturbance uw. To use the steady Bernoulli equation in the analysis of the flow under the starting vortex, we must 
move to the reference frame of the air mass, in which the starting vortex is not moving. In this frame, there is no 
freestream velocity, and the pressure disturbance goes as pu’, second order in the disturbance velocity, so that 
the integrated pressure disturbance vanishes in the limit of large height. 

So we see that for steady level flight in a semi-infinite atmosphere the lift is transmitted to the ground by the 
spread-out pressure disturbance of Figure 8.5.3. And none of the conclusions we've drawn here would change 
much if we introduced a more realistic model for the flow and the development of the wake near the airplane. 

Now let's return briefly to the case of the infinite atmosphere. The Prandtl and Tietjens analysis of the pres- 
sure on the ground plane has interesting consequences when it is applied to the situation without a ground plane. 
If we remove the ground plane by removing the image vortex system, the pressure disturbance at the original 
location of the ground plane is reduced to half what it was when the ground plane was there. And in the absence 
of an image vortex system, the integrated pressure disturbance above the airplane is not canceled. So looking at 
horizontal planes above and below, we now have a positive integrated pressure disturbance below the airplane 
accounting for half the lift, and a negative disturbance above the airplane accounting for the other half, as il- 
lustrated in Figure 8.5.4a. This holds regardless of how far above and below the airplane we place the planes. 
The transport of vertical momentum through these horizontal planes above and below the airplane is second 
order in the velocity disturbances, so that its integrated effect vanishes as we move the planes far away. All of 
this indicates that in an infinite atmosphere, pressure differences on horizontal planes far above and below the 
airplane account for the lift, and that there is no net transfer of momentum to the atmosphere. This of course 
implies that the “physically correct” value for the volume integral of the vertical momentum in the atmosphere 
should be zero, though we found it before to be mathematically indeterminate. 


Figure 8.5.4 Balancing momentum, pressure, and lift in rectangular control volumes of different proportions. 
(a) Ax => Az with no ground plane, or with a ground plane far away relative to any dimension of the control 
volume. (b) Ax = Az with a ground plane as the bottom boundary of the control volume. (c) Az = Ax with or 
without a ground plane 
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8.5.3 Downwash in the Trefftz Plane and Other 
Momentum-Conservation Issues 


We've seen that no net vertical momentum is imparted to the atmosphere as a whole by an airplane in steady 
level flight. However, there are regions of both upward and downward momentum in the field, and it 1s in- 
structive to look at the momentum balance at the boundaries of control volumes that don't encompass the entire 
atmosphere. We'll limit our attention to rectangular volumes with vertical and horizontal faces and start by con- 
sidering fluxes of vertical momentum across vertical planes at various stations in the flow. One example is the 
Trefftz plane downstream of the airplane, which we used as the downstream boundary of a control volume in 
our derivation of the general farfield integral for the induced drag, Equation 8.3.5. Then, of course, it was the 
flux of flight-direction momentum we were interested in, while now it is the flux of vertical momentum. 

There is a classical argument for the downward momentum at the Trefftz plane in an infinite atmosphere that 
goes as follows. A rectangular vortex loop is assumed, as in Figure 8.5.1, and the Trefftz plane is placed between 
the bound and starting vortices, far from both, and is pierced by the trailing vortices. At sufficiently large dis- 
tances, the bound and starting vortices contribute nothing, and only the trailing vortices have to be considered. 
The expression for the w disturbance “induced” by a trailing-vortex pair of infinite length is then integrated over 
the Trefftz plane, and the result is found to be (Thwaites, 1958, p. 303, for example) 


fo —L 
/ | wdzdy = | 
g.5.2 J /1 Pte. 


A widely held interpretation of this result is that there is a flux of downward momentum through the Trefftz 
plane, consistent with the lift on the wing, and that it is “induced” by the trailing vortices. 





However, it turns out that this classical result 1s incorrect. Wigton (1987, private communication) found that 
the w integral 8.5.2 is nonconvergent (w dies off only as r°), and he observed the classical symptom of non- 
convergence, that 1s, that the value obtained depends on the order of integration. Reversing the order from that 
shown in Equation 8.5.2 yields a value of zero. Wigton also found that if he used the w expression for trailing 
vortices of finite length, consistent with a finite vortex loop as in Figure 8.5.1, the integral converges, and the 
correct answer is zero, regardless of the length of the vortices or the order of integration: 


l| wdydz = /| wdedy = 0. 
8.5.3 2 #1 fu T 


So Wigton concluded that the classical argument is wrong and that the part of the velocity field “induced”’ 
by the trailing vortices results in no net flux of vertical momentum through the Trefftz plane. Lissaman (1996) 
independently came to the same conclusion. 

The story of this error represents an interesting cautionary tale. Clearly, incorrectly evaluating an improper 
integral is an easy trap to fall into. You can apply the standard procedures for evaluating integrals and, without 
making any procedural error, obtain a wrong answer. Equation 8.5.2 fortuitously gave the “expected” result, 
which apparently fooled more than one prominent aerodynamicist. And the incorrect result has been used in 
other analyses, rendering them incorrect as well. Sears's analysis of the pressure in the Trefftz plane is an ex- 
ample we'll discuss in Section 8.5.4. 

The trailing-vortex result of Equation 8.5.2 1s not the only error Wigton found. He also found that the classic- 
al argument that the bound and starting vortices contribute nothing is wrong. For any Trefftz plane between the 
bound and starting vortices, each “induces” an integrated downward momentum corresponding to half the lift, 
regardless of distance. At large distances, the w disturbances “induced” by the bound and starting vortices are 


very small, but are spread over a very large area. Thus the Trefftz plane in an infinite atmosphere does see a flux 
of downward momentum corresponding to the lift after all, but it 1s distributed very differently from the way it 
was in the classical picture, in which it was assumed to be associated with the trailing vortices. Lissaman (1996) 
also independently found this result. It applies only to a vertical plane between the bound and starting vortices. 
For a vertical plane either ahead of the airplane or behind the starting vortex, the flux of vertical momentum is 
zero, because the integrated contributions from the two vortices cancel. 


This finding of an integrated downward momentum in some vertical planes and none in others appears to 
contradict our finding of zero for the volume integral of the vertical momentum, even for an infinite atmosphere 
(in Sections 8.5.1 and 8.5.2). If we view the infinite-atmosphere case as a series of “Trefftz-plane” slices stacked 
in the flight direction, we see no net downward momentum in the slices ahead of the airplane or behind the start- 
ing vortex, and nonzero net downward momentum in the slices between the airplane and the starting vortex. If 
we were to naively sum the stack, we would conclude that the atmosphere contains a net downward momentum 
equal to Zt. But this naive summing is just another example of an incorrect evaluation of an improper integral, 
in this case the 3D volume integral of w in an infinite domain. Remember that we found that this integral 1s non- 
convergent and therefore indeterminate mathematically. Physically speaking, however, our argument based on 
the pressure indicated that the airplane imparts no net vertical momentum to the atmosphere. So the physically 
“correct” value for the volume integral of the vertical momentum in an infinite atmosphere is still zero and is 
not contradicted by the Trefftz-plane findings. 

So how do we reconcile a net flux of downward momentum in the Trefftz plane with the findings that there 
is no net vertical momentum imparted to the atmosphere by the lift and that the lift 1s reacted by pressure differ- 
ences on horizontal planes? If we look at the momentum balance in control volumes of different proportions, as 
illustrated in Figure 8.5.4, it appears that all of this can be reconciled, though we'll find that the lift 1s accounted 
for either by pressure or by momentum flux, depending on the proportions of the control volume. 

Consider rectangular control volumes with dimensions Ax, Ay, Az. Assume that the spanwise dimension Ay 
is very large compared with the other two, so that fluxes of vertical momentum through the side boundaries 
effectively vanish. Now, keeping Ax and Az much smaller than Ay, we can define two limiting shapes for the 
control volume in side view, tall and slender, or wide and flattened. Looking at these limiting cases with and 
without a ground plane gives us the three combinations shown in Figure 8.5.4. Unlike Wigton's (1987, private 
communication) analysis above, we put the starting vortex out of the picture by assuming that it is far down- 
stream relative to any dimension of our control volume, so that its “contribution” to any integrated momentum 
fluxes effectively vanishes. There are only three distinct combinations in Figure 8.5.4 because, with the starting 
vortex out of the picture, it doesn't matter whether there is a ground plane in the case of the tall, slender control 
volume in Figure 8.5.4c. The momentum balances then work out as follows: 

1. Ax — Az with no ground plane, or with a ground plane far away relative to any dimension of our con- 
trol volume, so that the image vortex system has negligible “effect”: As Ax grows large relative to Az, 
the integrated fluxes of vertical momentum through the upstream and downstream boundaries effectively 
vanish, and the lift is accounted for by the pressure disturbances on the top and bottom boundaries, as in 
Figure 8.5.4a. The pressures on the top and bottom boundaries account for half the lift each, regardless of 
the relative distances above and below the airplane. If one of the boundaries is moved farther away, the 
pressure disturbances become weaker but are more spread out, so that the integral is unchanged. 


2. Ax -*= Az with a ground plane as the bottom boundary of the control volume: As in (1) above, the integ- 
rated fluxes through the upstream and downstream boundaries effectively vanish, and the lift is accounted 
for by the pressure disturbance on the ground, as in Figure 8.5.4b. This is similar to what we found in 
Figure 8.5.4a, except that the ground plane has the effect of doubling the pressure disturbance there, so 
that it accounts for all of the lift, and the pressure footprint is the same one illustrated in Figure 8.5.3. On 
the upper boundary, the disturbances “due to” the bound vortex and its image are of opposite signs and 


cancel in an integrated sense. But they do not cancel locally. There is a central region directly above the 
airplane where the “contribution” of the bound vortex dominates, and the pressure disturbances are neg- 
ative. At larger horizontal distances, the “contribution” of the image vortex dominates, and the pressure 
disturbances are positive. The positive disturbances are much weaker than the negative disturbances, but 
they can cancel the positive disturbances in the integral sense because they are spread over a much larger 
area (note that the distribution is axisymmetric, and the integration is therefore radius-weighted). 


3. Az —* Ax with or without a ground plane: As Az grows large relative to Ax, the integrated pressure 
forces on the top and bottom boundaries effectively vanish, and the lift 1s accounted for by the fluxes 
of vertical momentum through the upstream and downstream boundaries, as in Figure 8.5.4c. Because 
we have put the starting vortex out of the picture, the total flux of vertical momentum is split equally 
between upward momentum upstream and downward momentum downstream instead of showing up 
only as downward momentum downstream, as it did in Wigton's (1987, private communication) analysis. 
This is the same distribution of integrated momentum flux that we found in 2D in Section 7.3.4. 


I've presented these arguments in an arm-waving way, but I'm confident they represent correct limiting cases. 
They demonstrate that there is no inherent contradiction in seeing the lift manifested both as pressure disturb- 
ances on horizontal planes and as fluxes of vertical momentum through vertical planes. And there is no incon- 
sistency between these manifestations and our earlier finding of no net vertical momentum in the atmosphere. 


8.5.4 Sears's Incorrect Analysis of the Integrated Pressure 
Far Downstream 


Sears (1974) used conservation of momentum in control-volume form, combined with the classical Trefftz- 
plane integral for the induced drag, Equation 8.3.10, to deduce that the integrated pressure in the Trefftz plane 
behind a lifting wing in inviscid flow 1s higher than ambient. 

On an intuitive level, this result is surprising, because we tend to think of induced drag as being associated 
with the kinetic energy left behind in the downstream flowfield, and we tend to associate excess kinetic energy 
with a velocity magnitude higher than freestream, and pressure lower than ambient. We also tend to associate 
the swirling motion about the vortex cores with lower-than-ambient pressure. But our naive intuition 1n this case 
is mistaken. In inviscid incompressible flow, the pressure perturbation goes as — 2Uu — voy w’, so that 
even though v and w tend to be larger in magnitude than u near the vortex wake, the — 2Uu term 1s of higher 
order and can dominate. When downwash occurs near a tilted wake, u can be negative, and the perturbation 
pressure can be positive, contrary to our naive intuition. So there can be regions of positive perturbation pres- 
sure behind a lifting wing, and we can't rule out Sears's result just on the basis of arm waving and intuition. 

But it turns out that Sears's conclusion is actually wrong. Given his flowfield model, there is a region of 
higher-than-ambient pressure in the Trefftz plane, but the integrated pressure is lower than ambient, not higher. 
So where did the analysis go wrong? 

Sears assumed that the wing is elliptically loaded and that the vortex wake remains planar but is tilted down- 
ward as shown in Figure 8.5.5, at an angle consistent with what this wake model gives for the downwash far 
downstream of the wing. He assumed that far downstream the velocity disturbances are those “induced” by the 
vortex wake perpendicular to its own plane, and he substituted these velocity disturbances into the integral ex- 
pressions governing the integrated momentum balance. The tilt angle of the wake thus enters directly into his 
analysis, which is unfortunate, as the tilt angle of an undistorted tilted wake 1s much too large, n’/2 ~ 4.9 times 
that of a realistic rolled-up wake, as we saw in our discussion of ideal induced drag in Section 8.3.5. 


Figure 8.5.5 Sears's (1974) model using a planar, tilted vortex wake, and Trefftz plane perpendicular to flight 
direction 





But even being this far off in the tilt angle of the wake isn't enough to account for the wrong sign in the final 
result. That, it turns out, is the result of a mathematical error. One of the integrals appearing in Sears's analysis is 
the integral of w, that is, the integrated downwash in the Trefftz plane, associated with the trailing vortex wake, 
for which he assumed the incorrect value from Equation 8.5.2, — L/(puo). We saw earlier that this integral is 
nonconvergent for a wake of infinite length, but that 1ts value is zero for a wake of finite length, regardless of 
the length. Had Sears used the correct value of zero for the w integral, he would have obtained the negative 
value — Dj; for the integrated pressure in the Trefftz plane, instead of the erroneous +Dj;, where D; is the induced 
drag. 


8.5.5 The Real Flowfield Far Downstream of the Airplane 


Most of our discussion so far has been based on a vortex-wake system consisting of a rectangular loop termin- 
ated at the downstream end by a starting vortex, though we noted that the assumption of a persistent starting 
vortex 1s unrealistic. Of course, in the real world, the starting vortex would be left on the runway where the air- 
plane took off, and it would dissipate rapidly through viscous interaction with the ground. And there would be 
diffuse spanwise vorticity left behind as the bound vorticity changed during climb and the transition to cruise. 
But even if the flight started impulsively at cruise altitude, the starting vortex could not persist in the simple 
form we assumed. So what really happens far downstream, as the vortex-wake system collapses and the flow 
disturbances associated with the lift die out? 

For purposes of this discussion, we'll skip the complexities of takeoff and climb. We'll assume the airplane 
has been in steady level flight at high altitude for a long time, and we'll seek a “steady state” understanding 
of the flowfield far downstream, in which a starting vortex is no longer in the picture. In Section 8.1, we de- 
scribed some features of the flow, up to the point where the wake “collapsed” into an unsteady pattern of loops 
and eddies. Here, we'll attempt to deduce what the time-averaged structure of the flow must be throughout this 
process of collapse and beyond, to where the velocity disturbances die out. The final stages of this progression 


are not of any practical importance, because the mean velocities become very small. It's just an interesting and 
informative exercise to figure out what the laws of motion require. 

For a long distance downstream of the airplane, the rolled-up vortex wake is relatively compact and retains 
the descending-oval form shown in Figure 8.1.7, and the velocity and vorticity distributions shown in Figure 
8.1.8 change very little. The collapse of the wake starts with the slow growth of instabilities such as the Crow 
instability, which leads eventually to breakup into loops and eddies, as shown in Figure 8.1.10. This process 1s 
highly variable in response to atmospheric conditions, and it changes over short and long time scales. Down- 
stream of breakup, we have a turbulent flow with motions over a wide range of length scales, spread over a 
much wider area than the original descending oval occupied. 

Of course, this breakup into a diffuse turbulent flow doesn't mean the vorticity in the upstream wake has 
disappeared. Remember that Helmholtz's second theorem tells us that vortex lines can't end in the interior of 
the field. The vortex lines from the wake upstream haven't ended in the field, but in the instantaneous turbu- 
lent flow downstream of breakup they have become an unsteady, chaotic tangle. Does this mean the flow is 
just an unorganized turbulent jumble that is otherwise featureless? No. Averaged over time, the flow must have 
an overall structure consistent with its origins upstream. The time-averaged flow must obey all the same kin- 
ematic rules as any other flow, which means that the abundant time-averaged vorticity that was present in the 
wake upstream cannot end in the interior of the field any more than the instantaneous vorticity could. And the 
time-averaged flow must obey the RANS equations, with the instability motions and turbulence supplying the 
Reynolds stresses (see Section 3.7) that transport time-averaged momentum. 

So to see the underlying structure of what happens during and after breakup, imagine time-averaging the 
flow in the reference frame moving with the airplane, so as to take out the unsteady motions on the time scales 
of the instability and the eventual turbulence. This is formally the same process we used in Reynolds averaging 
for turbulence modeling (Section 3.7), but in this case the flow we're applying it to is different from the more 
usual applications of Reynolds averaging. For one thing, we'll be averaging out part of the motion (the highly 
coherent motion associated with instability growth) that doesn't look like ordinary turbulence. And over much 
of the flowfield far downstream, the mean-velocity disturbances are smaller than the turbulent fluctuations and 
are effectively invisible before we do the averaging. Furthermore, we'll do this strictly as a thought experiment, 
since no such calculation has actually been done, to my knowledge. We'll do the averaging 1n our imaginations 
and use physical reasoning to try to deduce what the qualitative structure of the time-averaged flow must be. 

First, we'll assume that the airplane and the mean flow are symmetrical about the y = 0 plane. We'll also 
assume that at the start of the instability-growth and breakup process the wake is in the form of a descending 
oval as in Figure 8.1.7 and that the vorticity is still concentrated in two distinct cores, as in Figure 8.1.8b. The 
net vorticity and circulation of the core on either side of the symmetry plane must then be the same as what 
was shed into the nearfield from the corresponding half of the wing. One of the main things we'll be looking to 
explain is where all this mean-flow vorticity “goes” in the farfield, consistent with the requirement that vortex 
lines can't end in the interior of the field. 

Something we know from observations is that things change slowly in the flight direction. Even though the 
breakup process produces much faster change than what ordinary turbulent diffusion could produce, change in 
the mean flow is still slow in the sense that the flight-direction distance over which it takes place is long com- 
pared with any other dimension of the wake. Thus we have an effectively “slender” flow in which mean-flow 
gradients in the direction of the wake axis play a much smaller role in the development of the flow than do 
gradients in other directions. In this sense, the flow in cross-flow planes is similar to the limiting case of 2D 
planar unsteady flow, and as an aid to understanding, it is useful to look at what can happen to the mean vorti- 
city in simple 2D cases. 

In 2D planar flow, the mean vorticity vector can have only one nonzero component, perpendicular to the 
plane of the flow, so we can think of the vorticity as a scalar. We'll start with the example of a single axisym- 


metric vortex core in an infinite 2D domain. At an initial instant, we have some axisymmetric distribution of 
vorticity, as shown in Figure 8.5.6a, and as time goes on it remains axisymmetric but becomes more diffuse, as 
in Figure 8.5.6b. The integrated vorticity and circulation remain constant, even as the vorticity spreads over a 
very large area and becomes locally very weak. 


Figure 8.5.6 Diffusion of vorticity in a single axisymmetric vortex core in 2D unsteady flow. (a) Concentrated 
at an initial instant. (b) More diffuse at a subsequent time 
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Our second example is a pair of counter-rotating vortices of equal strength separated by a plane of symmetry, 
like our vortex wake. In this case, each core has its own total vorticity and circulation in its own half of the do- 
main, but because the two cores have vorticity of opposite signs, the total vorticity and circulation of the entire 
flow are zero. Starting with two cores that are initially distinct, as in Figure 8.5.7a, viscous or turbulent diffusion 
eventually brings the cores into “contact” in the sense that the vorticity gradient becomes significant across the 
plane of symmetry, as in Figure 8.5.7b. Because there isn't any no-slip surface anywhere in the field to interact 
with, vorticity can be destroyed only by interaction with vorticity of opposite sign, and that is what happens 
here. Vorticity is diffused toward the symmetry plane and annihilated there by interaction with the opposite-sign 
vorticity on the other side, and the circulation on each side of the symmetry plane is reduced. In this example, 
not only does the vorticity become more diffuse and locally weaker, but the total vorticity on each side of the 
symmetry plane tends to zero with time. In this 2D domain, vorticity disappears in time, without having vortex 
lines end in space, so Helmholtz's second theorem 1s not violated. 


Figure 8.5.7 Diffusion of vorticity in a pair of counter-rotating vortices in 2D unsteady flow. (a) Vorticity in 
distinct cores at an initial instant. (b) Vorticity diffused into contact with the symmetry plane at a subsequent 
time 








What does this imply for our real 3D flow? As we've already observed, the 3D mean flow throughout the 
region of breakup and decay is “slender” and therefore behaves in a nearly 2D fashion, but it is steady by defin- 
ition in the reference frame of the airplane. Imagine cutting this flow with a constant-x plane fixed to the atmo- 
sphere, so that in the airplane reference frame, the cutting plane moves away from the airplane at the speed of 
the freestream. For the flow that we observe in the cutting plane, the movement in x is analogous to the advance 
of time in the 2D flow we just discussed, and much of what happens will be similar in the two cases. In the 3D 
case, the vigorous “turbulent” motions of the instability growth will diffuse the mean vorticity in all directions, 
including toward the plane of symmetry, just like what we saw in 2D in Figure 8.5.7b, only faster. As our cutting 
plane continues to move downstream, vorticity diffusing toward the symmetry plane will effectively disappear 
there, and the integrated vorticity passing through the cutting the plane on each side of the symmetry plane will 
decrease, as it did in the 2D case. 

The analogy between our 3D steady mean flow and the 2D unsteady case isn't perfect, however, and there 
must be some differences in the details. In the 2D unsteady case, the vorticity disappears in time, while in the 3D 
steady case, it can't do that. In the steady case, for the vorticity piercing a cutting plane to decrease as the plane 
moves downstream, Stokes's theorem (see Section 3.3.5) requires that some of the vortex lines must cross the 
symmetry plane and connect with the vortex lines on the other side. Because only the perpendicular component 
of the vorticity can be nonzero at the symmetry plane, the vortex lines must turn perpendicular to the plane as 
they approach and cross it. Also, since we've assumed that the mean flow is symmetrical, having vortex lines 


cross the symmetry plane constitutes a reconnection of vortex lines that are already connected upstream through 
the bound vorticity on the wing, and such reconnections thus form closed vortex loops. 


The overall structure this leads to is illustrated schematically in Figure 8.5.8, which shows a plan view of the 
wake with the streamwise scale greatly compressed so as to fit the entire development on the page. To illustrate 
the general topology of the vortex lines clearly, we've removed the circumferential component that made the 
vortex lines appear helical in Figure 8.1.5. In a process that starts with the growth of instability motions and 
continues through the breakup and postbreakup turbulence, the vortex core on each side of the symmetry plane 
spreads both inboard and outboard. The vorticity that diffuses toward the symmetry plane turns and reconnects 
with vorticity on the other side. The wake spreads and becomes more diffuse, and at the same time, the net vor- 
ticity (circulation) on each side decreases through the process of reconnection. 


Figure 8.5.8 Plan view of the farfield development of the vortex wake of a lifting wing, showing spreading out 
of vortex cores, vortex-line reconnection, and the formation of closed loops. The streamwise scale is highly 
compressed to allow showing the entire development in one view 
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So far our argument for reconnection has been indirect: We expect vorticity from both sides to disappear at 
the symmetry plane, and the only way it can do that is through reconnection. Now let's try to identify a more dir- 
ect physical mechanism. Reconnection requires the production of a nonzero spanwise component wy = — OW/Ox 
+ Ou/Oz at the symmetry plane. The rotation associated with w) must be opposite to that of the bound vorticity 
on the wing, as illustrated in Figure 8.5.9, and from the sense of the rotation arrows, we can see that this dictates 
negative wy, and thus either positive Ow/Ox or negative Ou/0z or both. A positive Ow/Ox is fairly easy to explain. 
Looking at the vertical-velocity distribution in the wake oval in Figure 8.1.8a, we can see that when vorticity 
diffuses out from the center of a vortex core the associated shear-stress gradient has the effect of reducing the 
circumferential velocities around the core. Thus when vorticity reaches the symmetry plane, it should have the 
effect of reducing the downwash there. A gradual reduction in the magnitude of the downwash (the negative 
Ww) aS we move downstream constitutes a positive Ow/Ox, which is consistent with reconnection. We know that 
things change slowly in the x direction, so that dw/dx is small, and the corresponding wy, is weak, which is con- 
sistent with a reconnection process that is spread out over a long distance. 


Figure 8.5.9 Illustrating the rotation of the bound vorticity and the reconnecting vorticity at the plane of sym- 
metry far downstream in the vortex wake 
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As all of this happens, the cross-sectional shape of the wake probably changes in other ways. In addition 
to the mean vorticity diffusing toward the symmetry plane, some vorticity likely diffuses outward beyond the 
boundary of the oval and is convected upward relative to the oval, forming a vertical “curtain,” as illustrated in 


Figure 8.5.10. 


Figure 8.5.10 Change in cross-sectional shape of the vortex wake as vorticity diffuses beyond the boundary of 
the descending oval. In the reference frame of the descending oval, the distribution of vertical velocity in the 
“curtain” above the oval is similar to that in a viscous wake 
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This distortion of the vorticity distribution relative to the oval has been observed in 2D unsteady calculations 
for laminar vortices in a stratified atmosphere (Spalart, 1996), and Figure 8.5.10 is based on the assumption that 
the same qualitative pattern will occur in the “turbulent” mean flow during instability growth and breakup. The 
mean velocity distribution in the “curtain” is similar to that in the viscous wake of a 2D body, as in Figure 5.3.1. 
In the reference frame of the atmosphere, the fluid in a viscous wake flows toward the body that generated the 
wake, just not as fast as the body is moving away. Similarly, the fluid in the “curtain” is descending relative to 
the atmosphere, just not as fast as the wake oval from which it came. The velocities in the oval are easier to 
visualize in the reference frame descending with the oval, and in this frame the vertical velocity in the “curtain” 
has a conventional-looking wake velocity profile, as illustrated in Figure 8.5.10. 

As we move downstream, and the wake becomes more diffuse, the net vorticity on each side decreases, and 
the descent rate of the oval decreases as well. Thus in side view, the wake should take the general form shown 
in Figure 8.5.11, in which the oval spreads out, and the “tail” grows taller. 


Figure 8.5.11 Side view of the farfield development of the vortex wake of a lifting wing, including the “cur- 


tain.” The streamwise scale is highly compressed to allow showing the entire development in one view 
Oval “Curtain” 





ae 
aoaremeae 






+ Rollup Distinct Instability 
|“ rolled-up growth, breakup, 7 
cores reconnection, decay 


To summarize what we have deduced so far: a time-averaged vortex-wake system as sketched in Figures 
8.5.8 and 8.5.11, consisting of closed loops that include the bound vorticity and the trailing vortices, and are 
closed downstream in a long and diffuse region of time-averaged vortex reconnection. Presumably all of the 
mean vorticity from the wake upstream eventually disappears through reconnection, though how much of this 
is complete before the wake encounters the ground is unknown. The ground, being a surface with a no-slip 
condition and thus capable of generating surface shear stresses, could serve as a sink for some of the vorticity. 
Of course, much of this is impractical to verify experimentally because the mean-velocity disturbances become 
much too small to be detected against the background turbulence in the atmosphere. The downstream reconnec- 
tion of the vorticity is topologically similar to a starting vortex, but it is spread over a very large area, especially 
in the flight direction. The mean vorticity in the reconnection region 1s convected freely with the flow, so it is 
moving downstream at roughly freestream velocity. But at the same time, diffusion of the vorticity by the tur- 
bulent motions is such that the entire time-averaged structure moves with the airplane and is steady in the frame 
of the airplane, unlike the field around an idealized starting vortex. 


The fact that the mean vorticity in the wake disappears in a process that is steady in the reference frame of 
the airplane raises an interesting question: Does this process produce an integrated pressure disturbance at the 
ground? In Section 8.5.2, we argued that a starting vortex at high altitude would not do so because the ideal- 
ized starting vortex was assumed to be steady in the reference frame of the atmosphere, not that of the airplane. 
Now, in our “real” situation, we have reconnecting mean vorticity (nonzero wy) imbedded in a mean flow that 
is steady relative to the airplane, and we might therefore expect it to produce an integrated pressure disturbance. 
But this expectation would be incorrect. The spanwise vorticity wy in the region of vorticity reconnection is 
not bound vorticity like that on the wing; it is free vorticity that is convected with the flow. Free vorticity is 
by definition force-free. Only bound vorticity that experiences a lift force could result in an integrated pressure 
disturbance. The flow in this region appears steady in the frame of the airplane only because of convection and 
diffusion, not because the vorticity is bound to a surface carrying a force. As each filament of wy is convected 
downstream, a new one replaces it by convection, and turbulent diffusion maintains the wy gradients that allow 
the whole field to appear steady even though ay varies with x. 

So much for what happens to the vortex wake. After this process is effectively complete, and the vortex 
wake has decayed into insignificance, is anything left other than undifferentiated turbulence? Well, if the flow- 
field hasn't yet interacted significantly with the ground so as to exchange a significant integrated shear force 
with it, the remaining viscous wake must have a momentum flux deficit corresponding to the total flight-direc- 
tion aerodynamic force on the airplane, including the induced drag. After all, the assumptions behind Equations 
6.1.2—6.1.6 are still satisfied. For an airplane in steady, level, powered flight, the total flight-direction force, in- 
cluding thrust, is zero, and the wake would have no momentum-flux deficit. But for a glider, there would be a 
deficit (in the flight direction, not the horizontal direction), and after the disappearance of the vortex wake, the 
wake would have become a viscous wake with a deficit that represents both induced drag and viscous drag. 


8.6 Effects of Wing Sweep 


Wing sweep can be used to provide better longitudinal stability and trim for a tailless airplane (flying wing), 
but 1t is most commonly used to increase Mach-number capability, whether the airplane has a tail or not. Sweep 
is so effective at increasing critical Mach number that it has become ubiquitous on wings designed to operate 
at high subsonic speeds. It works because compressibility effects in the wing flowfield are determined more by 
the component of Mach number perpendicular to the isobars of the flow than by the magnitude of the Mach 
number. On a swept wing, the isobars tend to sweep more or less with the general sweep of the wing, which can 
make the perpendicular component of Mach number significantly smaller than the magnitude. This mechanism 
was first explained by Busemann in 1935 and was classified as a German military secret shortly thereafter. In 
1945, it was independently discovered by R.T. Jones at the NACA. See Anderson (1997) for an account of the 
history of the idea and its incorporation into airplane design. We'll discuss the idealized theoretical argument in 
Section 8.6.1. 

Sweep can also benefit an airplane by providing static aeroelastic relief. Under the high g-loadings that often 
determine the required strength of the wing structure, a wing generally tends to bend upward. Even if there is 
no torsional deflection of the structure, a simple bending deflection, when combined with aft sweep, produces 
an effective geometric twist, leading-edge-down on the outboard wing. The result is reduced loading outboard 
and reduced bending moments compared with what would happen without the bending. An aft-swept wing can 
therefore usually have a lighter structure than a comparable unswept wing. 

In Section 8.2.1, we discussed the effects of planform on the spanload of a 3D wing, and in Figure 8.2.2, we 
saw how aft sweep moves the additional load due to angle of attack outboard. Because of this effect, aft-swept 
wings must generally be twisted leading-edge down, or washed out, to have an advantageous total spanload like 
that shown in Figure 8.2.2. A favorable spanload that 1s achieved in this way persists only over a limited range 
of angle of attack. 


§.6.1 Simple Sweep Theory 


The original idealized argument for the effectiveness of sweep is amazingly simple, thus the name. It can also 
be called infinite-span yawed-wing theory. The theory is exact for inviscid flows that satisfy the steady Euler 
equations and the geometric assumptions described below, and for real flows over real wing geometries, it can 
still be close enough to the truth to provide useful insight. The theory not only shows why sweep is effective 
in increasing critical Mach number; it provides a simple way of understanding the kinematics of swept-wing 
flows, and it predicts the “2D” effects of sweep on sectional lift and pitching moment. It does not predict the 
effects of sweep on profile drag or maximum lift, because these are viscous effects. 

We assume the wing is a cylinder of airfoil cross-section and infinite span, with its generators parallel to the 
y’-axis, which is yawed at an angle A relative to the y-axis as shown in Figure 8.6.1. Thus in the x’,z’ system, 
the wing is a 2D airfoil, and in the x,y,z system, it 1s just a yawed version of the airfoil. The freestream velocity 
Us is taken to be in the x direction and is assumed uniform far from the airfoil. Note that the sweep angle A 
is thus measured in the x-y plane, which contains Uo. Tying our definition of A to Us in this way makes the 
formulas we'll arrive at below simpler than they would be if we used a sweep angle measured in the more con- 
ventional way, in the chord plane of the wing, which moves with angle of attack (a). Thus if we want to keep the 
conventional sweep angle constant, we must allow A to vary with a. But this doesn't matter for small a. Note 
that the physical principle behind simple sweep theory doesn't require a small-a assumption. In what follows 
we will assume small o only for relationships that involve the chord. Otherwise, the only limiting assumption is 
our unconventional definition of A. 


Figure 8.6.1 Assumed wing geometry and flow situation in simple-sweep theory 





Because the span is infinite, and the airfoil contours and freestream conditions are unchanging in the y’ dir- 
ection, we can assume that all flowfield quantities must be unchanging in the y’ direction. Thus there can be 
no pressure gradient in the y’ direction, and the steady inviscid momentum equation dictates that the velocity 
component in the y’ direction must be constant throughout the field, with the freestream value Uj. 

Now comes the insight that makes this theory so beautifully simple: The solution for the yawed-wing flow 
in the x, y, z coordinate system 1s just the solution for the 2D flow in the x’, z’ coordinate system, with the con- 
stant velocity component Uj added to it. The swept flow and the unswept flow are the same flow, just viewed 
in different reference frames, related by a Galilean transformation in which the constant velocity is added. This, 
of course, means that the pressures throughout the field and the velocity components in the x’, z’ plane are the 
same, and this is reflected in the conversion formulas we'll discuss next. 

The dimensionality of the infinite-span yawed-wing flow is a mixture of 3D and 2D. The flow is 3D 1n the 
sense that it has three non-zero velocity components, but it is quasi-2D in the sense that all flow quantities are 
independent of y’ and can therefore be completely defined as functions of just two independent spatial coordin- 
ates, x’, z’. Though we are now dealing with the inviscid case, this dimensionality situation is the same as what 
we discussed in connection with infinite-span viscous (boundary-layer) flow in Section 4.3.5. There we also 
had three non-zero velocity components, and everything could also be described in terms of only two spatial 
coordinates. But the inviscid case is simpler in one respect: The inviscid velocity in the direction parallel to 
the yawed wing is constant everywhere, defined by the freestream velocity and the sweep angle, while in the 
boundary-layer case the parallel velocity component has a viscous velocity profile that goes to zero at the wall. 


Note that the chord lengths c2p and c3p in Figure 8.6.1 are depicted in the same plane as the freestream ve- 
locity vector, as they would appear at zero a. Thus based on Figure 8.6.1, we have formulas relating geometry 
parameters in the 2D and 3D systems, applicable to the view at zero alpha: 
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And because the thickness of the airfoil is fixed, we have 


Sectional thickness ratio : = : cos A. 

8.6.2 C3) cop 

Formulas for converting dimensionless force and moment coefficients between the 2D unswept flow and the 
yawed-wing flow are simply derived, given that the pressures are the same, and only the reference chord and 
reference free-stream velocity change. The theory guarantees that the pressures will be the same only if the flow 
is inviscid, so that the formulas for forces and moments are valid, strictly speaking, only for inviscid flow. This 
is especially limiting in the case of drag. The theory says nothing about how sectional viscous drag should vary 
with sweep, so the drag formula is limited to the shock drag, the only drag component that can be present in 
inviscid infinite-span flow. The shock drag differs from the lift, following a cos> rule rather than cos , because 
the shock can produce a pressure force only in the x’ direction, and resolving it in the drag (x) direction in the 
swept case introduces an additional factor of cosine. These formulas can be used for rough estimates of the ef- 
fects of sweep in viscous flows, but accuracy is not guaranteed because differences in viscous effects between 
the unswept and swept cases will usually change the pressure distribution. Here we have not assumed small a, 
only the definition of A that we discussed previously. Denoting the yawed-wing case by the subscript 3D, we 
obtain: 
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Dealing with the angle of attack in this framework requires some care. There are two ways of handling angle 
of attack, and if executed correctly, they both yield the same result. Defining angle of attack as a rotation about 
the y’-axis 1s the simpler choice because it preserves the y’ axis as the direction in which all flow quantities are 
constant, and there are thus no additional issues to account for. For small angles of attack, simple geometry then 
gives: 


g¢6.9 Sectional angle of attack: a3p = a2gpcos /A 


This result is correct, but perhaps not entirely satisfying intuitively because a rotation about the y’-axis isn't 
the usual way we change angle of attack. For an airplane, a change in angle of attack is a rotation about a pitch 
axis parallel to the y axis. We can impose an airplane-type angle-of-attack change on the yawed wing, but then 
there is an additional complication we have to account for: the spanwise direction of the wing has been rotated 
relative to the original direction of the spanwise component of freestream velocity, U|. Referring to Figure 8.6.1, 
if we rotate the wing through an angle Aa3p about an axis through point | and parallel to the y axis, the 2D 
alpha change is, for small angles: 
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where Azz? is the vertical movement of the leading edge at point 2, and Aw) is the change in the velocity perpen- 
dicular to the wing chord plane due to the rotation of the wing's spanwise direction relative to Uj). Given that 
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we then have 
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which is the same result as Equation 8.6.9. So we have the same relationship between the effective 2D and 3D 
alpha changes regardless of whether the rotation 1s about the yawed-wing axis or an airplane pitch axis. 


Given Equations 8.6.5 and 8.6.9, we obtain our final conversion formula, valid for small a: 
8.6.14 Sectional lift — curve slope - Cra Cra? pO COS A 


8.6.1.1 Some General Sweep Effects Deduced from Simple Sweep Theory 


In a yawed-wing flow, the critical Mach number is determined by the critical Mach number of the 2D airfoil 
in the perpendicular flow, M2Dcrit. The 3D critical Mach number is thus increased by the sweep, going as the 
inverse of the cosine of the sweep angle according to Equation 8.6.3. While sweep increases critical Mach num- 
ber, it decreases the lift-curve slope and the lift coefficient at a given angle of attack according to Equations 
8.6.14 and 8.6.5. Again, simple sweep theory says nothing about profile drag or maximum lift, because these 
are determined by viscous effects. 

Although the yawed-wing flow is the same as a 2D airfoil flow viewed in a different reference frame, it 
looks more complicated, belying its simple underlying flow structure. First, the stagnation point of attachment 
near the leading edge of the 2D airfoil 1s not a stagnation point in the yawed flow because of the nonzero span- 
wise velocity Uj). Thus instead of a leading-edge stagnation point, we have a leading-edge attachment line, with 
flow along it and diverging away from it, as shown in Figure 8.6.2. We considered the general properties of 
attachment-line flows in some detail in Section 5.2.2. On a yawed wing, the maximum surface pressure is gen- 
erally reached on the attachment line, but because of the spanwise velocity, it is lower than freestream stagna- 
tion pressure. The boundary layer that develops along the attachment line is different from a stagnation-point 
boundary layer in ways we'll discuss in Section 8.6.2. 


Figure 8.6.2 Flow pattern around the leading-edge attachment line of a swept wing 





Another major facet of flowfield behavior related to sweep is the variation in flow direction that takes place 
in response to pressure changes. Figure 8.6.3 illustrates how this works ideally, according to simple sweep the- 
ory. In this diagram, y’ is the direction of the generators of the yawed wing, x’ is the perpendicular direction on 


the wing surface, and Uj and U’ |. are the corresponding velocity components at some point along the wing 


surface. In sweep theory, the spanwise velocity Ujj is constant, while the perpendicular velocity U J changes 
in response to the pressure. Figure 8.6.3 shows several representative velocity vectors consistent with different 


values of ©. and with the requirement that Uj is constant. It is clear that there is a one-to-one corresponden- 


ce between (Land the velocity magnitude and a one-to-one correspondence between the velocity magnitude 
and the flow direction. And, of course, for the steady inviscid flow assumed in simple sweep theory, there is 
then a one-to-one correspondence between the flow direction and the pressure. When the pressure is high and 
the velocity magnitude is low, the flow is in the outboard direction, with the attachment-line flow in the span- 
wise direction as the limiting case (1.e., the minimum velocity magnitude and the maximum pressure). When the 
pressure 1s at the freestream value, the flow is in the freestream direction (assuming that we can ignore the slope 
of the airfoil surface). When the pressure is low and the velocity magnitude is high, the flow is in the inboard 
direction. In the limit, flow in the x’ direction perpendicular to the generators would require infinite perpendic- 
ular velocity. 


Figure 8.6.3 Illustration of the relationship between the velocity magnitude and the flow direction at some 
point along the surface of a swept wing, according to simple sweep theory. Several possible velocity vectors 


are shown, along with their parallel and perpendicular components 
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When such changes in direction in the inviscid flow outside the boundary layer take place, they constitute 
flow curvature. In local physical terms, the curvature is a result of the fact that the sweep skews the pressure 
gradient so that it is not lined up with the local flow direction, and an acceleration perpendicular to the stream- 
lines is the result. This of course has consequences for the flow in the boundary layer, as we saw in Section 
4.1.2 and will discuss further in Section 8.6.2. 


§.6.2 Boundary Layers on Swept Wings 


The development of the boundary layer on a swept wing can be thought of as beginning at the leading edge, but 
the flow situation is more complicated than that at the leading edge of a 2D airfoil. The inviscid attachment-line 
flow near the leading edge of a swept wing is essentially a 2D stagnation-point potential flow with a non-zero 
spanwise velocity added to it, as we saw in Figure 8.6.2. So unlike the stagnation-point boundary layer on a 2D 
airfoil, the attachment-line boundary layer on a swept wing has a non-zero edge velocity. And because of this, 
the attachment-line boundary layer is similar to boundary layers in many other situations, in that it can be either 
laminar or turbulent. However, a notable difference is that, unlike most other boundary layers, the attachment- 
line boundary layer tends not to grow thicker in the flow direction. On a wing where the leading-edge sweep is 
constant along the span, the spanwise velocity and the pressure along the attachment line are nearly constant. 
(According to simple sweep theory, they would be exactly constant.) A 2D boundary layer developing under 
these conditions would be a flat-plate boundary layer, the first case of “simplified” boundary-layer development 
we discussed in Section 4.3, and the boundary layer would grow thicker in the flow direction. In the attachment- 
line boundary layer on a swept wing, this tendency of the boundary layer to grow is offset by the strong diver- 
gence of the flow, and the boundary layer remains thin. 


To understand how flow divergence has this effect, consider first the flat-plate boundary layer. The shear 
force exerted on the flow by the surface (the skin friction) is a retarding force that produces an increasing 
momentum-flux deficit in the boundary layer, which shows up as a positive rate of growth, in the flow direction, 
of the momentum thickness. The skin friction in the attachment-line case is of similar magnitude, but the res- 
ulting momentum-deficient air is carried away by the flow divergence instead of building up as increasing mo- 
mentum thickness. 

The divergence effect can be understood mathematically through plane-of-symmetry boundary-layer theory, 
which is illustrated in Figure 4.3.10. The assumptions of the theory are satisfied exactly in the infinite-span 
yawed-wing case, because the attachment line is straight in that case. In the more general case of a typical 3D 
swept wing, the attachment line generally has some spanwise curvature due to the detailed shaping and aer- 
oelastic deflection of the wing. However, the radius of curvature is usually so large compared with the thickness 
of the attachment-line boundary layer that the effects of curvature are negligible, and the plane-of-symmetry 
theory incurs very little error in practical leading-edge attachment-line flows. In Section 4.3.4, we discussed the 
applicable simplifications to the equations of motion. In the momentum-integral equation for such flows, Equa- 
tion 4.3.13, the divergence effect shows up as the integral of the divergence derivative Ow/0z weighted by the 
velocity defect ue — u, which is consistent with our intuitive idea of momentum-deficient air being carried away 
by the flow divergence. 

On the attachment line in the infinite-span case, the skin friction and the divergence integral term are 1n equi- 
librium, which is consistent with zero d0/dx, and with spanwise invariance of everything in the flow. On the at- 
tachment line of a typical 3D swept wing, conditions vary along the span, but because the distances over which 
changes take place are typically so much greater than the boundary-layer thickness, the boundary layer locally 
achieves very nearly the same equilibrium as it would in the infinite-span case. An example of this 1s illustrated 
in Figure 8.6.4, which shows two sets of numerical solutions for the turbulent attachment-line boundary lay- 
er on the swept wing of a 727-200 (McLean, 1977). One set is based on the spanwise-varying attachment-line 
boundary-layer equations, marched from root to tip, and the other is based on infinite-span equations and local 
conditions at selected stations on the span. The calculated momentum thickness and skin friction vary consider- 
ably along the span, mainly as a result of the varying leading-edge radius. In spite of this, there is practically no 
difference between the two sets of calculations, showing that this turbulent boundary layer is very nearly in loc- 
al infinite-span equilibrium, or in other words, that the quantitative state of the boundary layer is dominated by 
the local flow divergence and skin friction, and carries practically no memory of its history upstream (inboard). 
The same holds for a laminar boundary layer. We'll discuss the factors determining whether the flow is laminar 
or turbulent next. 


Figure 8.6.4 Comparison of spanwise-varying and infinite-span attachment-line solutions for turbulent flow 
on 727—200 leading edge. Outer-flow boundary conditions are from a panel-method solution for Mao = 0.70, 
Cr = 0.342; the boundary-layer calculations are for a unit Reynolds number of 278 000 in, (From McLean, 
1977) 
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So the attachment-line boundary layer at any spanwise location along a typical swept wing is practically 
identical to what the boundary layer would be on an infinite-span attachment line under the same local con- 
ditions. And the infinite-span attachment-line boundary layer is especially simple to characterize. In either the 
laminar or turbulent case, the state of the boundary layer depends on a single parameter, a Reynolds number 
based on a combination of the velocity along the attachment line and the divergence derivative (Cumpsty and 
Head, 1967): 
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where the nomenclature corresponds to what we used for the plane-of-symmetry boundary layer in Figure 


4.3.10. More-recent references on the subject (Poll, 1989) use K. instead, which is equivalent: 
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The momentum thickness depends on F. as shown in Figure 8.6.5, where the laminar case is defined by a 
similarity solution, and the turbulent case by numerical solutions using the Mellor-Herring algebraic eddy vis- 
cosity (McLean, 1977). Figure 8.6.6 shows a comparison between these calculations and the experimental data 
of Cumpsty and Head (1969) for the spanwise velocity profile in the turbulent case. Note in Figure 8.6.5 that 


the boundary layer is always laminar for R. <260 and always turbulent for K. > 600, and that in between, it can 
be either laminar or turbulent. It is sometimes useful to look at these thresholds in terms of Ro of the laminar 
attachment-line boundary layer: The boundary layer is always laminar for Re < 100, and it cannot be laminar 
for Re > 240. We'll discuss laminar-to-turbulent transition issues further in Section 8.6.4. 


Figure 8.6.5 Dependence of boundary-layer state and momentum-thickness on R. for the infinite-span 
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Figure 8.6.6 Spanwise velocity profiles in a turbulent attachment-line boundary layer. Calculations by the 
method of McLean (1977) and experimental measurements of Cumpsty and Head (1969) 
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Away from the attachment line, the boundary layer on the rest of the wing's upper and lower surfaces 1s more 
complicated and 3D. In the outer inviscid flow, the streamline curvature we mentioned earlier is consistent with 
a pressure gradient that is skewed relative to the local flow direction and therefore has a component perpendicu- 
lar to the local flow. Moving closer to the wall and into the boundary layer, we find essentially the same pressure 
gradient, since the boundary layer is thin and the pressure doesn't change much across its thickness. But the 
velocity magnitude is smaller than it is outside the boundary layer, so that the same pressure gradient produces 
greater streamline curvature. The result 1s pressure-driven crossflow of the type we discussed in Section 4.1.2, 


with cross-flow velocity directions as illustrated in Figure 8.6.7. 


Figure 8.6.7 Typical cross-flow directions in the 3D boundary layer on the upper surface of a swept wing 








Accelerating flow: 
crossflow inboard 










Outer-flow 
streamline 






Decelerating flow: 
crossflow outboard 





Typical outer-flow streamlines and surface streamlines (skin-friction lines) for a wing upper surface are 
shown in Figure 8.6.8. Note that the behavior of the outer-flow streamlines is generally what we would expect 
based on simple sweep theory. Over the forward part, where the pressure coefficient is generally negative 
and the velocity magnitude is higher than freestream, the streamlines are oriented generally inboard. In the 
aft pressure-recovery region, they turn toward the outboard direction, and the skin-friction lines follow suit 
by turning outboard even more strongly. On this particular wing, significant outboard turning of the skin-fric- 
tion lines begins at a shock near midchord. We'll consider this flow further below in connection with Figures 
8.6.10—-8.6.11. 


Figure 8.6.8 Typical outer-flow streamlines and surface streamlines (shear-stress lines) on the upper surface of 
a swept wing. From 3D boundary-layer calculations by M.D. Murray using the method of McLean (1977), for 
wing shape derived from the RAE 2822 airfoil 
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Because of the kind of general outboard turning of the flow seen in Figure 8.6.8, swept wings have a repu- 
tation for having problems with “spanwise flow” in the boundary layer. A common impression seems to be that 
boundary-layer fluid typically migrates from inboard to outboard on the aft part of the surface and “piles up” 
outboard, causing earlier separation than would otherwise occur. This reputation overrates the problem. Skin- 
friction lines like those in Figure 8.6.8 don't typically indicate much long-distance migration of fluid prior to 
separation. It is only after separation has developed that fluid can migrate long distances spanwise, as in Figure 
5.2.13b. “Boundary-layer fences,” which are intended to inhibit spanwise flow, can be beneficial, but usually 
only after separation has begun to move forward. 

As in any general 3D boundary-layer flow, separation 1s indicated by the surface-streamline pattern accord- 
ing to the region-of-origin concept we discussed in Section 4.1.4. The usual kind of separation line on a swept 
wing is a surface streamline dividing a region in which the surface streamlines come from the general upstream 
direction from one in which they come from the general downstream direction. Surface streamlines coming 
from upstream often originate from the leading-edge attachment line, but they can also effectively originate 
from a reattachment following a separation. Figure 8.6.9 figuratively illustrates two common varieties of sep- 
aration on a swept wing: separation with reattachment (either a laminar separation bubble with turbulent reat- 
tachment, or a turbulent separation bubble under a shock), and separation without reattachment, in which the 
surface streamlines downstream come from the trailing edge. Remember that separation in 3D 1s not generally 
accompanied by zero magnitude of the skin friction and that at a separation line on a swept wing, the skin fric- 
tion usually has a significant spanwise component. 


Figure 8.6.9 Two types of separation on a swept wing. (a) Separation with reattachment (either laminar separ- 
ation with turbulent reattachment, or turbulent separation and reattachment under a shock). (b) Separation 
without reattachment 
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Figure 8.6.10 Calculated pressure distribution for a swept-wing shape designed by M.D. Murray specially for 
comparing different levels of fidelity in predicting the development of a swept-wing boundary layer 





Figure 8.6.11 Comparison of boundary-layer predictions for the swept wing of Figure 8.6.10, using the equa- 
tion sets from Table 8.6.1. Calculations by M.D. Murray using the method of McLean (1977). (1) General 3D 
boundary layer (basic solution). (2) Infinite span with sweep and taper. (3) Infinite span with sweep, using 
sweep angle at midchord. (4) 2D boundary-layer equations applied to the chordwise (perpendicular) compon- 
ent only. (5) 2D boundary-layer equations applied to the local streamwise component 
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We can use zero Cras a criterion for separation on swept-wings, but we must use the component of Cf per- 
pendicular to the trailing edge, and we must keep in mind that what it can tell us is limited. In infinite-span- 
swept-wing flow, where any separation line must lie straight along a line of constant x/c, the component of Cy 
perpendicular to the constant-x/c lines must be zero at separation, regardless of where on the chord the separa- 


tion occurs. In 3D swept-wing flow, strictly speaking, the zero-perpendicular-Cy criterion works only for incipi- 
ent separation at the trailing edge (1.e., separation that is on the verge of moving forward from the trailing edge). 
In this situation, perpendicular Cris zero at one isolated point and positive everywhere else on the trailing edge, 
as we noted in Section 5.2.3.2, in connection with Figure 5.2.13b. 


A zero-perpendicular-Cy criterion (relative to constant-x/c lines) is sometimes used to identify separation that 
has moved forward from the trailing edge, but it 1s error-prone. In Figure 5.2.13b we saw that the region of neg- 
ative perpendicular Cf at the trailing edge fails to identify the entire separation bubble (there is a portion of the 
separation bubble where perpendicular Cy is positive). If we apply the criterion ahead of the trailing edge, the 
opposite can also happen, and we can be fooled by regions of negative perpendicular Cy that are not part of the 
bubble, as you can imagine if you look just upstream of the separation line on the inboard side of the bubble in 
Figure 5.2.13b. If a bubble is sufficiently long and narrow near the trailing edge so that the flow is locally like 
infinite-span-wing flow, these discrepancies can be minor, and zero perpendicular Crcan be a reasonable separ- 
ation criterion. But to be rigorous in general, locating a separation line on the surface requires mapping out the 
skin-friction direction field and choosing a line that satisfies our region-of-origin definition of 3D separation. 


In some general ways, the response of a 3D swept-wing boundary layer to pressure gradients is similar to 
that of a 2D boundary layer on an unswept airfoil. For example, a favorable pressure gradient increases the mag- 
nitude of the skin friction and slows or even reverses the growth of boundary-layer thickness, and an adverse 
pressure gradient does the opposite. But quantitatively, the boundary layer on a 3D swept wing has a response 
to pressure gradients that is very different from that of a 2D boundary layer. To understand the sources of these 
differences and to get a feel for their magnitude, we'll compare the results of boundary-layer calculations that 
were carried out for nominally the same flow situation but used equation sets that included or excluded various 
effects. These equation sets are listed in the Table 8.6.1 in order of decreasing applicability to a general 3D flow 
and therefore decreasing fidelity. 


Table 8.6.1 Equation sets used in calculations comparing different ways of calculating the boundary layer on a 
swept wing (calculated results shown in Figures 8.6.10— 8.6.12) 


ee 3D boundary-layer equations 
set | 


Equation/Boundary-layer equations for infinite-span with sweep and taper, as described in Section 4.3.5 (uses the actual 
surface pressures from a single longitudinal cut across the wing, but assumes the isobar pattern is “conical” even 
if the real pattern is not, and uses the conical coordinate system of Figure 4.3.12a) 


al surface pressures from a single cut as in 2 above, but assumes the isobars are all swept at the midchord sweep 
angle, and uses the perpendicular coordinate system of Figure 4.3.11) 


face pressures as in 2 and 3 above, converted to 2D by Equation 8.6.4, and the chord length converted to 2D by 
Equation 8.6.1, all done with the midchord sweep angle) 


2D boundary-layer equations applied to the “streamwise” velocity component only (uses the surface pressures 
from a single cut as in 2, 3, and 4 above with no sweep adjustment to the pressures or the chord length) 





A wing shape was specially designed by M. D. Murray for these comparisons. The objective was to produce 
a pressure distribution similar to what might be seen on the upper surface of the outboard wing of a swept-wing 
transport airplane at a transonic cruise condition, with a nearly conical isobar pattern for which equation set 2 
should provide a good approximation for the boundary-layer development. The planform is trapezoidal, with a 
quarter-chord sweep of 35°, taper ratio 0.20, and aspect ratio 9. The initial wing shape was lofted with the RAE 
2822 airfoil, for which we saw calculated 2D pressure distributions in Figure 7.4.27b. The airfoil was used as 
the streamwise cross section, thinned according to simple sweep theory (Equation 8.6.2) for the sweep of the 


midchord line. The camber and twist were then inverse-designed to produce the desired nearly conical isobar 
pattern on the upper surface over the mid-to-outboard portion of the span, with a weak shock just aft of mid- 
chord, as shown in the isobar plot in Figure 8.6.10. Calculated outer streamlines and surface streamlines were 
shown in Figure 8.6.8 and display a pattern typical of this kind of swept-wing flow. Freestream Mach number 
was 0.85, the Reynolds number was typical of full-scale flight, and the boundary layer was assumed to be tur- 
bulent, starting on the attachment line. 

The design and the basic 3D flow solution were calculated by an automated viscous-inviscid interaction 
procedure that couples the transonic full-potential method of Jameson and Caughey (1977) with the finite-dif- 
ference boundary-layer method of McLean (1977), solving the general 3D boundary-layer equations with an 
algebraic eddy-viscosity turbulence model. The basic solution thus used boundary-layer equation set | above, 
with the potential-flow velocity vectors at the displacement surface serving as the outer-flow boundary condi- 
tion. The solution procedure cycled between the potential flow and the boundary layer, updating the wing shape 
by the displacement thickness calculated by the boundary-layer calculations, and stopping when a converged 
shape and flow solution were reached. Then the final inviscid-flow pressures at the displacement surface from 
the basic solution at the span station n = 0.622 were used to define boundary conditions for the comparison 
boundary-layer calculations using equation sets 2, 3, 4, and 5. Chord Reynolds number at this span station was 
31.6 million and the sectional lift coefficient was 0.53. 

Figure 8.6.11 shows the pressure coefficient Cp, plus the perpendicular skin friction coefficient Cfperp, the 
perpendicular momentum thickness Qperp, and the surface-cross-flow angle Bsw (angle between the surface- 
shear-stress direction and the direction of the local outer flow, positive when the surface-shear-stress direction 1s 
outboard relative to the local outer flow direction). The latter three are sensitive indicators of pressure-gradient 
effects. 

Through the shock and as the trailing edge 1s approached, the combined effects of adverse pressure gradients 
and sweep are clearly seen: Cfperp decreases, and there is strong outboard turning, reflected in positive (out- 
board) and rapidly increasing Bsw. 

Because the isobars in this case line up well with the constant-percent-chord lines, the assumptions of the 
infinite-span-with-taper theory (equation set 2) should be nearly satisfied, and, indeed, we see in Figure 8.6.11 
that the results from equation set 2 agree closely with those from the full 3D equations. If taper is neglected and 
sweep is matched only at midchord (equation set 3), the predictions don't match the 3D results nearly as well. In 
particular, Bsw at the trailing edge is about 15° larger for equation set 3, and Cfperp is much closer to zero. Both 
of these are indications that equation set 3 is predicting flow closer to separation at the trailing edge. (Note that 
Cfperp would be zero at the trailing edge at the onset of separation of the type illustrated in Figure 8.6.9b.) 

Thus neglecting taper in this case leads us to overpredict the effects of the adverse pressure gradient aft of the 
shock. We can interpret this result physically in terms of the perpendicular component of the local outer-flow 
velocity. When taper is neglected, the sweep at the trailing edge is higher, and the perpendicular outer velo- 
city is lower. This greater reduction of the perpendicular outer velocity corresponds to a larger adverse pressure 
gradient in the perpendicular direction, which puts more “stress” on the boundary layer (1.e., brings it closer to 
separation). 

Now consider equation sets 4 and 5, in which we use 2D planar boundary-layer equations with different 
boundary conditions. First, note that neither of these 2D calculations accounts for any outer-flow curvature, so 
that the surface-cross-flow angle is zero. Next, note that 2D equations applied to the local streamwise compon- 
ent (equation set 5) underestimate the combined effects of the adverse pressure gradient and sweep on Cfperp 
downstream of the shock, compared with equation set 3. Note also that equation set 5 underpredicts the growth 
of Onerp through the shock and downstream, though this is offset by the fact that Operp is too high just ahead of 
the shock. 


2D equations applied to the perpendicular component (equation set 4) predict the effects of pressure gradient 
on Cfperp fairly well, but overpredict the effects on Qperp. Recall the laminar independence principle from Sec- 
tion 4.3.5, according to which the chordwise (perpendicular) component of a laminar, incompressible infinite- 
span boundary-layer flow can be solved for independently of the spanwise component. According to this prin- 
ciple, solving the 2D equations in the perpendicular direction properly accounts for the combined effects of 
sweep and pressure gradient, in the laminar, incompressible case. In the present turbulent, compressible case, 
solving 2D equations in the perpendicular direction compensates for the effects of sweep on Cfperp, but over- 
compensates for the effects on Operp. 

Nash and Tseng (1971) used boundary-layer calculations to make comparisons similar to our comparisons 
of equation sets 3, 4, and 5, thus investigating the effects of sweep on wing boundary layers, but not the effects 
of taper. They used a typical subsonic airfoil pressure distribution with the suction peak near the leading edge 
and no shock. In their comparisons, equation set 4 came closer to equation set 3 than in our comparisons, and 
equation set 5 did worse than in our comparisons. These differences are probably due to the difference in the 
character of the airfoil pressure distributions. Nash and Tseng also proposed an adjustment to their turbulence 
model that made the predictions using equation set 4 match those using equation set 3 more closely. 

So calculations with different boundary-layer equation sets have established that both sweep and taper, when 
combined with pressure gradients, have substantial effects on the development of a swept-wing boundary layer. 
Thus the temptation to use a simple 2D streamwise analysis to estimate boundary-layer quantities on a swept 
wing should always be resisted. For a given streamwise pressure distribution, a 2D streamwise analysis that ig- 
nores sweep can seriously underestimate the combined effects of sweep and pressure gradients on the integral 
thicknesses and the perpendicular skin friction. 

And, of course, if perpendicular skin friction isn't predicted correctly, separation won't be predicted correctly. 
Recall that Cfperp = 0 is the threshold for infinite-span swept-wing separation, and note in Figure 8.6.11 that 
Cfperp at the trailing edge is farther from zero for the 2D streamwise analysis (equation set 5) than for the 
infinite-span-with-sweep analysis (equation set 3). This trend is typical, and it becomes more pronounced in 
flows that actually separate: If a swept wing and an unswept wing have the same streamwise pressure distri- 
bution, and the flow on the unswept wing separates somewhere on the chord, the flow on the swept wing will 
generally separate significantly earlier. Thus if a 2D streamwise boundary-layer analysis is applied to a swept 
wing, it will generally fail to predict separation soon enough and will overestimate the maximum-lift capability. 

To illustrate this, M.D. Murray made comparison calculations (method of McLean, 1977) for one of the 
idealized pressure distributions that Smith (1975) used to study separation in 2D in Figure 7.4.13. Smith's ca- 
nonical pressure distribution for m = 1/2, Xo = 0.25 was converted to a conventional Cp distribution, assuming 
CpTE = 9.20 as was done for the families of idealized conventional pressure distributions in Figure 7.4.14. The 
result is shown as the dashed line in Figure 8.6.12. Calculating the boundary layer for this case as a 2D flow 
(corresponding to the “2D perpendicular” option, equation set 4) yields a predicted separation point in reason- 
able agreement with Smith's result, as should be expected, given the similarity of the eddy-viscosity models 
used in the two codes. Using simple sweep theory to convert this pressure distribution to 35° sweep yields the 
solid curve. Using this swept pressure distribution as input to an infinite-span-swept-wing analysis (equation set 
3) leads to a predicted separation point very close to the predicted 2D separation point. Using the swept pressure 
distribution as input to a 2D streamwise analysis (equation set 5) leads to a predicted separation point much 
farther downstream. This underscores how unsuitable a 2D streamwise analysis will generally be for predicting 
separation on a swept wing. 


Figure 8.6.12 Assumed pressure distributions and calculated separation points for boundary-layer calculations 
illustrating the effect of sweep on separation. The baseline 2D case is Smith's (1975) canonical pressure distri- 
bution from Figure 7.4.13 for m = 1/2, x9 = 0.25 ft. The other calculations were made by M. D. Murray using 

the method of McLean (1977), and used the same equation sets 3, 4, and 5 that were used for the comparisons 
in Figure 8.6.11. The 2D streamwise analysis (equation set 5) predicts separation far downstream of the separ- 
ation predicted by the swept-wing analysis 
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These are the kinds of conclusions we can draw regarding the generic effects of sweep and taper, by compar- 
ing calculations with other calculations, but how do the best of such calculations compare with the real world? 
First, consider what we ought to expect, based on limitations of turbulence modeling. Turbulence models that 
are currently used in routine calculations of swept-wing flows generally assume that the eddy viscosity (see 
Section 3.7) is isotropic, that is that it acts the same in all directions. However, experimental measurements 
have indicated that the isotropic assumption 1s seriously violated in swept flows with adverse pressure gradi- 
ents. For example, in the measurements of van den Berg and Elsenaar (1972) in a boundary-layer test rig (not 
a real “wing” geometry) that approximated infinite-span-swept-wing conditions, the cross-flow eddy viscosity 
was typically less than half the streamwise eddy viscosity 1n a strong adverse pressure gradient typical of the aft 
upper surface of a swept wing. Here, “cross-flow” and “streamwise” were defined relative to the flow direction 
at the boundary-layer edge, the same convention we used in discussing 3D velocity profiles in Section 4.1.2. 

How much difference should this make? We can get some insight from boundary-layer calculations by the 
author (McLean, 1977) for the van den Berg and Elsenaar experiment. One set of calculations used an isotropic 
eddy viscosity model, and another used a crude model in which the cross-flow eddy viscosity was set to 0.4 
times the streamwise eddy viscosity, which is nonisotropic to roughly the same degree as indicated by the ex- 
perimental measurements. The calculated Cy magnitude and minimum flow angle £1 relative to the spanlines 
are shown in Figure 8.6.13. A swept separation line is assumed to be located where 8 goes to zero, and the 
nonisotropic calculation agrees much better with the separation location inferred in this way from the measure- 
ments. Velocity magnitude and direction ($s relative to the outer-flow direction) profiles are compared in Figure 
8.6.14. Here also, the nonisotropic model moves the calculations in the right direction, just not enough to match 
the measurements very well. The main conclusion from these comparisons is that the decreased cross-flow eddy 
viscosity measured in the experiment can have strong effects on the development of a swept-wing boundary 


layer in an adverse pressure gradient, especially as it approaches separation. Thus for the flow on the aft upper 
surface of a swept wing, we should not expect high accuracy from CFD calculations using the usual isotropic 


eddy-viscosity models. 


Figure 8.6.13 Cymagnitude and minimum flow angle 1 relative to the spanlines for the “swept wing” experi- 
ment of van den Berg and Elsenaar (1972), compared with 2.5D boundary-layer calculations by the author 
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Figure 8.6.14 Velocity magnitude and direction profiles for the “swept wing” experiment of van den Berg and 
Elsenaar (1972), compared with 2.5D boundary-layer calculations by the author (McLean, 1977) 
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Now let's look at a real 3D wing case. As of this writing, I have seen only one example with comparisons of 
detailed velocity profiles, a comparison of RANS calculations with the experimental measurements of Brebner 
and Wyatt (1961) on a 45° swept wing in low-speed flow. The RANS code is CFL3D (Biedron and Rumsey, 
1998) with the SST turbulence model of Menter (1994), in which the eddy viscosity is isotropic. Figure 8.6.15 
shows the calculated surface streamlines (shear-stress lines), and Figure 8.6.16 compares calculated and meas- 
ured velocity magnitude and direction profiles at three stations along the chord at 50% semispan on the upper 
surface. The calculated shear-stress lines indicate that the flow is approaching separation at the trailing edge. 
Note that because of the higher sweep, separation involves less turning of the flow than it did in the van den 
Berg and Elsenaar flow. The calculated velocity magnitude and direction profiles agree much better with the 
data than the isotropic boundary-layer calculations did for the van den Berg and Elsenaar flow in Figure 8.6.14. 


Figure 8.6.15 Surface-streamline pattern calculated by CFL3D with the SST turbulence model for the experi- 
mental swept-wing case of Brebner and Wyatt (1961). (Calculation by N.J. Yu. Plot by T.J. Kao) 








Figure 8.6.16 Comparison of velocity profiles calculated by CFL3D with the SST turbulence model with the 
experimental measurements of Brebner and Wyatt (1961). (Calculation by N.J. Yu) 
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There are a couple of possible reasons for the better-than-expected agreement for the Brebner and Wyatt 
flow. The first 1s that the SST model is a modern two-equation model that does better in predicting adverse- 
pressure-gradient effects in 2D flows than does the algebraic eddy viscosity used in the old boundary-layer cal- 
culations, and some of this improvement should carry over to 3D. Another is that there is less flow turning than 
in the van den Berg and Elsenaar flow, though the difference is small. A more important reason 1s probably that 
the adverse pressure gradient in the Brebner and Wyatt flow occupies practically the whole chord and is there- 
fore relatively gentler. In the van den Berg and Elsenaar flow, there was some distance of flat-plate boundary- 
layer growth ahead of the start of the adverse gradient, so that the boundary layer was thicker at the start of the 
recovery, or in airfoil terms, the effective recovery-point location was somewhere aft on the chord (recall our 
discussion of the importance of the recovery-point location in connection with maximum lift in Section 7.4.3). 
The van den Berg and Elsenaar flow should thus be a more difficult test case for the turbulence models. It is 


also probably closer to being representative of the flow on the swept wing of a modern transport airplane than 
the Brebner and Wyatt flow is. 

Though the predictions were better than expected for the Brebner and Wyatt flow, we've seen that an isotropic 
eddy-viscosity model poses a potentially serious limitation in CFD calculations of the development of the 
swept-wing boundary layer in an adverse pressure gradient, which is precisely where the boundary layer has 
its biggest impact on profile drag and on separation behavior. This shortcoming of the turbulence models could 
significantly degrade the prediction of all aspects of swept-wing performance. 


8.6.3 Shock/Boundary-Layer Interaction on Swept Wings 


In Section 7.4.8, we discussed shock/boundary-layer interaction in transonic flow around a 2D airfoil. We noted 
that this interaction thickens the boundary layer and increases profile drag, and that the boundary-layer thicken- 
ing produces a “viscous wedge” effect that smears the shock pressure rise and modifies the downstream condi- 
tion. Instead of looking like a normal shock, the pressure jump is close to that of an oblique shock with the max- 
imum turning angle, for which the downstream Mach number is typically just below sonic. We also discussed 
shock-induced separation and noted that as the upstream Mach number Mj increases, a separation bubble ap- 
pears at M1 ~ 1.30, followed by massive separation at M1 ~ 1.40, largely independent of Reynolds number. 

Sweep complicates this picture surprisingly little. Of course, if the shock is swept relative to the local up- 
stream flow, it will turn the flow slightly in the lateral direction and produce cross-flow in the boundary layer, 
even if there was none to start with. But the turning and the cross-flow are just reflections of the fact that sweep 
introduces a velocity component parallel to the shock, in the lateral direction, and the presence of this parallel 
component has relatively little effect on the flow development that we noted in 2D. If we look only at the velo- 
city component perpendicular to the shock's footprint on the surface, we should see something close to what we 
saw in the 2D case, at least qualitatively. 

The physical argument for this relies on our observations regarding infinite-span swept-wing boundary layers 
in Section 8.6.2. Our general swept wing flow may not be much like an infinite span wing at all: The isobar 
pattern may be highly three-dimensional, and the shock may not line up with the constant x/c lines. But in the 
neighborhood of the shock, the infinite-span simplifications should apply very well. Within the region of the 
shock pressure rise, the pressure gradient perpendicular to the shock footprint 1s much, much stronger than the 
gradient in the parallel direction, and the general sweep of the isobars matches the sweep of the shock footprint 
very closely. To apply the infinite-span simplifications, all we have to do 1s take the sweep of our local “infinite- 
span wing” to be the local sweep of the shock footprint. 

Recall that in laminar, constant-property flow on an infinite-span swept wing the perpendicular (“chord- 
wise’’) velocity component develops completely independently of the parallel (“spanwise”) component, accord- 
ing to what we called the /aminar independence principle. In turbulent flow, the independence principle doesn't 
hold exactly because the Reynolds stress in the chordwise direction is affected by the flow in the spanwise direc- 
tion. But the calculations compared in Figure 8.6.12 indicated that this effect 1s not large, and that the chordwise 
velocity profile is not strongly affected by the presence of the spanwise flow. Applying this observation to the 
region of the shock pressure rise, we can argue that the velocity component perpendicular to the shock footprint 
should develop close to the way it would if the parallel velocity component weren't there (1.e., the way it would 
in a 2D case with the same incoming perpendicular velocity profile). Then given that the gradients of all flow 
quantities in the parallel direction are much smaller than those in the perpendicular direction, the displacement 
thickness should be close to what it would be in 2D, and the smearing of the pressure rise and even the separa- 
tion behavior should also be close. 


The above line of argument served as the basis for a semi-empirical model for swept-wing shock/boundary- 
layer interaction for use in CFD calculations by viscous-inviscid coupling (McLean and Matoi, 1985). 


§.6.4 Laminar-to-Turbulent Transition on Swept Wings 


In Section 4.4.1, we discussed the general physical mechanisms and the associated theories of laminar-to-turbu- 
lent transition in boundary layers. The familiar pattern is for the boundary layer to start laminar at the nose of a 
body or the leading edge of an airfoil and to transition to turbulent after some boundary-layer growth has taken 
place. Transition may be caused by unstable growth of small disturbances or by direct introduction of large dis- 
turbances by the outer flow or by interaction of the flow with surface imperfections. The boundary layer on the 
leading edge of a swept wing doesn't always follow this simple pattern. Because of the spanwise flow along the 
leading edge, which we discussed in Section 8.6.2, the attachment-line boundary layer itself can be turbulent. 
In the usual case of an aft-swept wing, the “upstream” initial condition for the attachment-line boundary layer 
is at the wing-fuselage junction, and the fuselage boundary layer at the junction is usually turbulent. Thus the 
attachment-line boundary layer often gets a turbulent start (sometimes referred to as leading-edge contamina- 
tion) and remains turbulent as far outboard as conditions will support turbulence, which may or may not be the 
full span of the leading edge. It is not unusual for the turbulent attachment-line boundary layer to transition from 
turbulent to laminar somewhere along the span, as the leading-edge radius and the resulting Reynolds number 
of the flow decrease. 

On a forward-swept wing, the attachment-line boundary layer can get a laminar start at the tip. As the 
leading-edge radius grows larger inboard, the boundary layer can become thick enough to become unstable to 
small disturbances, shortly after which it would transition to turbulent. 

So we see that the attachment-line boundary layer can undergo transition of either the laminar-to-turbulent 
or turbulent-to-laminar variety. Two thresholds thus play roles in determining the state of the boundary layer: 
one with increasing Reynolds number at which the laminar boundary layer becomes unstable and one with de- 
creasing Reynolds number at which the turbulent boundary layer becomes unable to sustain turbulence (see 
Poll, 1979, 1989). Given that the attachment-line boundary layer is almost always practically identical locally 
to what it would be on an infinite-span attachment line, the two thresholds will generally be very close to what 
we saw in Figure. 8.6.5. Under infinite-span conditions, the linear-instability threshold for the laminar boundary 


layer occurs at R = 600, above which the boundary layer will always be turbulent. The threshold below which 
the boundary layer cannot sustain turbulence, and must therefore be laminar, 1s at R = 260. Between 260 and 
600, the boundary layer can be either laminar or turbulent, depending on conditions upstream. 

This is the situation for an attachment-line boundary layer on a surface without area suction. If area suction 


is applied along the attachment line, Re will be reduced below the Roe( R ) curve shown in Figure 8.6.5. In the 
case with suction, the laminar-turbulent thresholds remain very close to the same Re values as for no suction, 


not the same R values. Thus a more general way to characterize the thresholds, valid with and without suction, 
is to say that the boundary layer is always laminar for Re < 100, and it cannot be laminar for Re > 240. 


Several different devices have been demonstrated that can divert turbulent flow from the attachment line and 
give the boundary layer a laminar start. The Gaster bump (Gaster, 1965), shown in Figure 8.6.17, stagnates and 
sheds the turbulent boundary layer coming from inboard and starts a new laminar boundary layer at the stagna- 
tion point on the nose of the bump. Gaster's experiments showed the bump to be effective only up to nominal 


R values around 420, presumably because the outboard part of the bump has increased sweep and larger local 


R . Seyfang (1987) tested an assortment of other devices at an R value of about 360. Figure 8.6.18 shows the 


minimum sizes, relative to leading-edge radius, at which they were effective. Whether they would be effective 


at higher values of R. is not known. 


Figure 8.6.17 The Gaster bump (Gaster, 1965), a device for decontaminating the attachment line and giving 
the boundary layer a laminar start 
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Figure 8.6.18 Other devices for decontaminating the attachment line and giving the boundary layer a laminar 
start, tested by Seyfang (1987). Numbers refer to the minimum sizes at which the devices were found to be ef- 
fective, relative to LER (leading-edge radius) 
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Of course, the attachment-line boundary layer provides the starting condition for the boundary layer on the 
rest of the wing. On parts of the span where the attachment-line boundary layer is turbulent, the boundary 
layer off of the attachment line will also be turbulent, unless relaminarization occurs. In relaminarization, the 
boundary-layer turbulence dies out away from the attachment line, which generally requires that the chordwise 
acceleration must become much larger off of the attachment line than it was on the attachment line itself (Laun- 
der and Jones, 1969). This tends to happen only under high-lift conditions in which the attachment line is on the 
lower surface, well off of the leading edge, in which case the stronger acceleration required for relaminarization 


takes place as the flow rounds the leading edge in going from the lower surface to the upper surface. Relaminar- 
ization generally cannot occur when the attachment line is close to the leading edge, as it usually is under cruise 
conditions. In cruise, a turbulent attachment line generally leads to a turbulent boundary layer downstream. 


A long run of laminar flow for cruise-drag reduction thus Is requires a laminar attachment line. Unless suction 
is applied on the attachment line itself, this requires that R be kept below 600, as an ideal upper limit. But 
practical limits are lower. For values of R above 260 without suction, a Gaster bump or other decontamina- 


tion device will usually be required, and the Gaster bump has been shown to work only up to i. = 420. Then, 
even if the Gaster bump is successful, the laminar attachment-line boundary layer outboard will be vulnerable 


to unintended tripping by surface imperfections or contamination as long as R is above 260. Robust design 


for a laminar attachment line thus requires that R be kept below 260 or that suction be applied to thin the 
attachment-line boundary layer and thus to keep Re below 100. 

When the attachment line is laminar, transition to turbulence off of the attachment line follows the general 
outline we discussed in Section 4.4.1: receptivity to disturbances from the outer flow or surface imperfections, 
linear growth of small disturbances, and nonlinear breakdown. The small disturbances can be either traveling 
TS (Tollmien-Schlichting) waves or stationary CF vortices. (Or, if the freestream turbulence level is high, as 
in some wind-tunnel situations, traveling CF vortices may come into play.) In the linear range, TS and CF dis- 
turbances grow independently of each other. For purposes of predicting transition, the TS and CF growth rates 
can therefore be computed separately, and transition identified when the accumulated growth of either passes a 
threshold e”. In practice, it is advisable to apply the variable-n-factor method we discussed in Section 4.4.1, in 
which the values used are determined by comparison with experiments with flow environments and levels of 
surface quality comparable to the target application. 

We see from the above that the possibility of transition by CF instability is another factor that makes design- 
ing a swept wing for laminar flow different from designing a 2D airfoil for laminar flow. As we noted in Section 
4.4.1, the growth of CF disturbances requires a substantial amount of mean cross flow of the kind illustrated 
in Figure 4.1.7b. This in turn requires a pressure gradient. In applications of laminar flow to swept wings, CF 
tends to be critical only in the region of favorable gradient near the leading edge, and it determines how large a 
leading-edge radius and how much leading-edge sweep can be present without provoking early transition. Nat- 
ural laminar flow (NLF) without surface suction is feasible on transport airplane wings, but only with leading- 
edge sweep angles somewhat lower than 1s currently typical of all-turbulent wings. If the sweep is higher than 
this, or the airplane is very large, laminar flow control by suction (see Joslin, 1998) would generally be re- 
quired to prevent early transition by CF growth. CF disturbances tend to have high receptivity to disturbances 
caused by surface imperfections (distributed or discrete 3D roughness), and as a result, maintaining laminar 
flow around a swept leading edge can require very high levels of surface quality and very low levels of surface 
contamination, such as by insects (see Saric, Reed, and White, 2003). 


§.6.5 Relating a Swept, Tapered Wing to a 2D Airfoil 


Before 3D transonic CFD became routinely available, design technologies for swept wings were typically deve- 
loped first on 2D airfoils. Today, 2D airfoil development is not nearly so important, but it is still used for early 
explorations of new ideas. So, what can we say about the correspondence of the performance of a swept, tapered 
wing to that of an “equivalent” 2D, unswept airfoil? In Section 8.6.2, we saw that turbulent boundary layers in 
the two cases behave in a qualitatively similar way, but that there are significant quantitative differences. Thus 
trends in performance should be similar, but the quantitative accuracy of comparisons will be limited. 


And boundary-layer behavior is not the only limitation on the equivalence. In the inviscid part of the flow, 
precise equivalence between 2D and 3D holds only in the case of infinite span with no taper (simple sweep 
theory). There is no rigorous way to take the effects of taper into account in a 2D context, as there is with the 
effects of sweep. Lock (1964) proposed that an approximate kind of inviscid equivalence requires the 2D airfoil 
to duplicate the perpendicular-Mach-number distribution on the surface of the 3D wing, that is, the component 
of Mach number perpendicular to the constant-percent-chord lines. The usefulness of this recipe in practice, 
however, has proved to be limited. If the freestream Mach number is chosen to correspond to the sweep at the 
quarter-chord or midchord of an aft-swept wing, the 3D perpendicular Mach number at the trailing edge is so 
high that the only way to match it on the 2D airfoil is to have an open (squared-off) trailing edge. The equi- 
valence between a wing with a sharp trailing edge and an airfoil with an open trailing edge would always be 
questionable, especially with regard to viscous drag. 

So, while 2D airfoil development may be useful for assessing general trends, its quantitative equivalence to 
a swept, tapered wing is limited. 


8.6.6 Tailoring of the Inboard Part of a Swept Wing 


The inboard part of a swept wing poses a design problem that is more 3D in nature than that of the outboard 
wing. The center plane of symmetry and the fuselage combine to restrict spanwise flow, so that the bound vortex 
lines of an aft-swept wing tend to unsweep and move aft, as shown in Figure 8.6.19. The resulting aft movement 
of the inboard lift loading is unfavorable for profile drag and can increase shock drag. It can also increase trim 
drag because the aft movement of the load requires a larger download on the horizontal tail for trim. To coun- 
teract this, the airfoil shapes of the inboard wing are usually specially tailored to move the loading forward. An 
additional consideration is that under transonic conditions the inboard wing can affect the pressure distribution 
and shock position far outboard. 


Figure 8.6.19 Illustration of the tendency of the bound vortex lines on the inboard part of a swept wing to un- 
sweep and move aft 





side of body 


Chapter 9 


Theoretical Idealizations Revisited 


In the preceding chapters, we looked at a wide range of aerodynamic phenomena, and we saw numerous ex- 
amples of how theories based on conceptual models and simplifying assumptions allow us to construct explana- 
tions and make predictions without having to solve the full equations of motion. In this chapter, we'll take another 
look at the theoretical landscape, this time from the point of view of the models rather than the phenomena, taking 
a kind of inventory of the conceptual tools at our disposal. This will lead us to cover some of the ground we've 
already covered, but this time for the purpose of putting it all in a broad theoretical perspective, rather than the 
more phenomenological one we've taken so far. 

In Section 9.1, we'll look at the quantitative theories, categorizing them according to the type of model and 
what was done to simplify the equations. In Section 9.2, we'll explore the conceptual side, the tools available to 
us for doing Mental Fluid Dynamics (MFD). 


9.1 Approximations Grouped According to 
how the Equations were Modified 


The need for ad hoc models and simplifying assumptions was originally driven by the computational intractability 
of the full equations. This need is not as great now as it once was, but it is still there. Computers and solution 
algorithms have progressed to the point that solutions to the full NS (Navier-Stokes) equations for laminar flow 
can be calculated routinely, even for complicated 3D problems. Practical problems, however, almost always in- 
volve turbulent flow. In Section 3.7, we noted that Direct Numerical Simulation (DNS) of the turbulent motions 
in practical turbulent flows is beyond our reach and will be for decades to come. Thus for practical predictions of 
flows with turbulence, we must settle for the compromised accuracy that comes with turbulence modeling. And 
in many situations, further retreats from the full NS equations are useful as well. 

Of course, with all retreats from the “full” equations come reductions in physical fidelity and range of applic- 
ability, and here we'll try to list them for a wide range of different types of theories. Though the losses in fidelity 
are easy to identify in general terms, they are not usually that easy to quantify. Theoretical arguments can some- 
times determine the order of the error incurred by an approximation (how the error should be expected to grow 
as a function of some small parameter, for example). But the practical significance of the error can generally 
be determined only by comparison with more-accurate theories or with experimental measurements. Errors can 
usually be expected to depend strongly on the particular situation, and knowing what to expect in a given case 
generally requires experience with closely related cases. 

To get a big-picture perspective on the landscape of theoretical approximations, we'll group them in tables of 
the major examples, categorizing them according to the types of things that were done to “simplify” the equa- 


tions. We'll also identify what flow physics was assumed in addition to basic equations, the range of applicabil- 
ity of the resulting theory, and the loss of fidelity, if any. 


9.1.1 Reduced Temporal and/or Spatial Resolution 


For turbulent flows at practical Reynolds numbers, full temporal and spatial resolution of the flow is simply not 
feasible, and we must resort to time-averaging of the turbulent motions, combined with turbulence modeling. 
Then there are some situations in which even time-averaged spatial resolution of the distributions of flow quant- 
ities isn't needed, and the global view provided by control-volume analysis suffices. These two major types of 
approximation are summarized in Table 9.1.1. 


Table 9.1.1 Reduced spatial and/or temporal resolution 
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9.1.2 Simplified Theories Based on Neglecting Something 
Small 


In this class of approximations, some weak effect is either neglected altogether, or is accounted for only in some 
approximate way, often just to first order. For flows at high Reynolds numbers, the effects of viscosity can be 
either neglected or approximated. For flows at very low Reynolds numbers, the effects of inertia can be treated 
in the same ways. And if a body is very slender, the flow disturbances it causes can be linearized. Approxima- 
tions of this type are summarized in Table 9.1.2. 


Table 9.1.2 Simplified theories based on neglecting something small 
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9.1.3 Reductions in Dimensions 


In many situations, the variation of the flow in time or in one or two of the spatial dimensions is so slow 
that 1t can be ignored or approximated. And in some of these situations, such as the flow along a very slender 
streamtube of a larger flow, or the local flow through a shock, the simplifying assumptions become practically 
exact in the limit, and the resulting theories are highly accurate. Approximations of this general type are sum- 
marized in Table 9.1.3. 


Table 9.1.3 Reductions in dimensions 
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9.1.4 Simplified Theories Based on Ad hoc Flow Models 


In our final category, we have theories based on ad hoc models for the structure of the flow field. Such models 
don't have to reflect reality with great fidelity, but only well enough to scale in roughly the right way and to 
yield results that are accurate enough to be useful. The main examples of such theories are summarized in Table 
9.1.4. 


Table 9.1.4 Simplified theories based on ad hoc flow models 
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9.1.5 Qualitative Anomalies and Other Consequences of 


Approximations 


The simplified theories listed in Tables 9.1.1—9.1.4 incur a variety of losses in physical fidelity relative to the 
full NS equations. But to keep this in perspective, we should recall that the NS equations themselves are not 
exact. The continuum representation that they embody is only an approximation to the integrated behavior of 
the molecules that make up the gas. 

As a result, even the full NS equations produce inaccurate predictions when the assumptions underlying 
them are violated. For example, the NS equations don't predict the correct distributions of flow quantities 
through a shock because the very high gradients violate the assumption of local thermodynamic near-equilib- 
rium. However, because the equations embody all the relevant conservation laws, the jump conditions across a 
shock are correctly represented, even though the internal details of the shock are not. 

Solutions to the NS equations also display physically anomalous behavior when boundary conditions change 
too suddenly. For example, when a solid surface has a sharp convex corner, such as at a sharp trailing edge of 
an airfoil, the surface shear stress predicted by the NS equations goes to infinity (singularly) at the corner. This 
happens on such a small scale that the grids used in most NS calculations don't begin to resolve the singular- 
ity or even to hint at its presence. The presence of the singularity has nothing to do with the breakdown of the 
continuum assumptions at very small scales, but of course the assumptions must break down close to the singu- 
larity. 

Approximating the effects of viscosity through boundary-layer theory introduces another kind of possible 
nonphysical singularity. When direct-mode solutions to the 2D boundary-layer equations predict separation 
from a smooth wall, the skin friction goes through zero with infinite slope, the Goldstein singularity depicted 
in Figure 4.2.11. In direct-mode solutions to the 3D equations, the skin friction perpendicular to the separation 
line displays the same singularity. 

Neglecting viscosity can introduce qualitative anomalies in flows over surfaces with sharp corners. In a solu- 
tion to the incompressible potential-flow equation, a sharp concave corner shallower than 90° produces a sin- 
gular point of zero velocity with infinite velocity gradient on both the upstream and downstream sides (Figure 
3.10.2c), and a sharp convex corner causes a singularity with infinite velocity (Figure 3.10.2b). 

The situation at a convex corner is more complicated if the flow is compressible. In an otherwise subsonic 
flow governed by the Euler equations, attached flow around a sharp convex corner cannot produce infinite ve- 
locity, as it would in incompressible flow. It would, however, cause an acceleration to a sonic condition at the 
corner, with a Prandtl-Meyer expansion fan (see Shapiro, 1953) centered at the corner, producing a jump to su- 
personic speed on the downstream side. The corner would thus have a singularity with infinite acceleration, not 
infinite velocity, and downstream of the corner the supersonic flow would be terminated by a shock. This kind 
of attached-flow solution can exist only if the turning angle of the corner does not exceed the maximum Prandtl- 
Meyer turning angle of 130.5°. For typical airfoil shapes with sharp trailing edges, the turning angle around the 
trailing edge is too large (typically in excess of 170°), and the zero-lift flow pattern of Figure 7.1.3a is there- 
fore impossible. This raises the interesting theoretical possibility that even an inviscid airfoil flow, provided it is 
compressible, would still have to satisfy the Kutta condition, as in Figure 7.1.3b. And even if the turning angle 
is less than 130.5°, the existence of an attached-flow solution is not guaranteed. It would depend on how much 
additional subsonic pressure recovery the flow faces downstream of the shock. If the shock terminating the su- 
personic bubble is strong enough, the resulting total-pressure deficit downstream could make it impossible for 
the flow to negotiate the rest of the pressure recovery. Finally, even when an attached-flow solution with a fan 
and a shock exists, it is only a theoretical possibility. In the real, viscous world, the boundary layer will either 


effectively fair over the corner, if it is shallow enough, or separate, eliminating the need for acceleration to su- 
personic flow. 

In 3D numerical solutions of the Euler equations, a sharp convex corner can produce separation of the open, 
free-shear-layer type. An example would be the leading and trailing edges of a sharp-edged delta wing. In these 
situations, numerical dissipation has the same qualitative effect that real viscosity would have in bringing about 
the separation and enforcing a kind of de-facto Kutta condition. The mechanics of the resulting free shear layer 
are discussed in Section 10.4.3 and illustrated in Figure 10.4.1. 

Free shear layers or shocks can also be modeled in solutions to inviscid equations as artificial discontinuities 
(artificial surfaces across which discontinuities are allowed), though even a shock has nonzero thickness in the 
real world. We discussed modeling shocks as discontinuities in Section 3.11.2. When such a shock discontinu- 
ity intersects a solid surface with convex curvature, the pressure and velocity distributions downstream of the 
shock are singular, the Zierep singularity that we discussed in Section 7.4.8. 

A discontinuity tangent to the flow (a slip surface) can be used to model a physical shear layer such as can 
occur downstream of an intersection of shocks of different strengths. And as we discussed in Section 8.1.2, the 
shear layer in the wake of a 3D lifting surface is often modeled as a slip surface, or vortex sheet. Ideally, such a 
vortex sheet should be allowed to deform as required, so as not to have flow across it. In simplified models of 
lifting-surface flows, however, this requirement is usually allowed to be violated. An undeformed shape 1s as- 
sumed for the vortex sheet, and flow is allowed to pass through the sheet, as in the lifting-line theory discussed 
in Section 8.2.3, for example, and in many of the panel methods for lifting 3D potential flow that we'll discuss 
in Section 10.2.3. 


9.2 Some Tools of MFD (Mental Fluid 
Dynamics) 


In thinking about any aerodynamic flow for purposes of basic understanding or mental prediction, idealized 
models can be very helpful, even if they don't apply with great accuracy to the situation at hand. Even a model 
that is only a crude match for the flow in question can provide valuable insight. Of course care must be ex- 
ercised in applying this kind of analysis, to avoid stretching an analogy too far or falling victim to any of the 
common fallacies. 

The physical and theoretical bases for many such models were discussed in earlier chapters. In this section, 
we'll revisit some of these models, both as a summary and with an eye toward the sorts of flow situations in 
which they can be useful. 


9.2.1 Simple Conceptual Models for Thinking about 
Velocity Fields 


There are numerous ways that simple conceptual models can be invoked in thinking about the flow field a body 
will produce. Here we list some and briefly discuss how they apply. 


9.2.1.1 Thinking of a Lifting Surface as Pushing the Flow in the Direction 
Opposite to the Lift Force 


When a lifting surface such as a wing, tail surface, or a vortex generator (VG) produces lift, it pushes the flow 
between the tips (or between the root and the tip, 1f we are talking about a half-wing mounted on a wall) in the 
direction opposite to the lift force. This fact enables a quick mental determination of the direction of rotation of 
the tip vortices and the directions of the “induced” flow elsewhere in the field, given that the cross-stream field 
must look qualitatively like Figure 8.1.2. 

This kind of mental modeling makes it easy to remember design guidelines for VG installations, such as how 
counter-rotating pairs should be grouped (Figure 4.5.2b) to delay vortex lifting, and how co-rotating VGs on 
a swept wing should be toed trailing-edge inboard to counter the outboard-directed boundary-layer cross-flow 
upstream of separation, as discussed in Section 4.5.2. 


9.2.1.2 Reflection as a Model for the Effects of a Solid Surface 


An easy way to model the effects of a solid surface such as the ground near a body is to place a reflection of 
the body on the other side of the surface. For an airfoil in 2D, 1t may suffice to image only the bound vortex as 
we did in Section 7.4.9. For wings in 3D, we must include the trailing vortex wake as we did in Sections 8.2.4, 
8.3.9, and 8.5.1-2. 

But lift is not the only effect that can benefit from being modeled by reflection. The volume displacement 
effect of a streamlined body can be modeled by a distribution of sources and sinks, which can be reflected if 
there is a nearby solid surface. The flow around a low-slung streamlined ground vehicle is strongly influenced 
by the ground, and the shape of such a vehicle can generally benefit from being designed in the presence of its 
reflection. 


9.2.1.3 Flow Directions from Simple Sweep Theory 


A quick estimate of the flow direction at the edge of the boundary layer at some location on a swept wing can 
be made if you know the local pressure coefficient and the local sweep, that is, the sweep of the local constant- 
percent-chord line. Use isentropic relations to calculate the local velocity-magnitude ratio V/Uw. Then draw a 
vector diagram like Figure 8.6.3, using U|| = UosinA. On a wing with taper, simple sweep theory applied in this 
way is of course not exact, but it is a reasonable first estimate. 


9.2.1.4 Identifying Separation in 3D Flows 


Identifying separation in 3D is more difficult than in 2D because the skin-friction field in a 3D flow offers no 
purely local criterion corresponding to zero Cf at the separation point in 2D flow. In 3D, Cr is generally zero 
only at isolated nodal points of separation and is nonzero elsewhere on a line of 3D separation. Identifying sep- 
aration in 3D thus depends more on the skin-friction direction than on the magnitude. It requires knowledge of 
the skin-friction direction field, defined with enough spatial resolution so that the separation line can be identi- 
fied according to the region-of-origin concept described in Section 4.1.4 and illustrated in Figure 4.1.16. Some 
of the kinds of patterns that might have to be deciphered are illustrated in Figures 5.2.13 and 8.6.9. 


9.2.15 Flow over a Slope Discontinuity (Crease) or Curvature 
Discontinuity in a Body Contour 


Practical body shapes often have creases, either concave or convex, along which there is a discontinuity in the 
slope of the surface (or the direction of the surface normal). If the flow is parallel to the crease, the crease should 
have little effect except for changes in boundary-layer physics, which will be significant only if the crease is 
strongly concave or convex. But if the flow has a component perpendicular to the crease, there will be an effect 
on the pressure field. In the near field of the crease, the same assumptions behind simple sweep theory for a 
swept wing should hold, and the effect on the pressure should depend only on the perpendicular component of 
velocity. Then a good starting point for understanding the flow is the conformal-mapping solutions for the 2D 
potential flow past concave and convex corners illustrated in Figure 3.10.2b—d. These solutions are singular, 
with zero velocity and infinite velocity gradient if the corner is concave and infinite velocity if it 1s convex. 
The singularities are examples of the kinds of nonphysical anomalies that we discussed in Section 9.1.5. Of 
course such singularities don't materialize in the real world, but it is helpful to know that the inviscid version 
of the flow would see a large pressure disturbance at the crease. The boundary layer in a real flow tempers 
this behavior and smears out the pressure disturbance. If the angle change is large enough, the crease can cause 
boundary-layer separation, either at a convex corner, as in Figure 4.1.3c, or upstream (in the cross-flow sense) 
of a concave corner, in a situation analogous to Figure 4.1.3b (Imagine a crease at the bottom of the dip). 

A similar situation occurs when there is a curve in the surface along which there is a discontinuity in surface 
curvature, as when a flat (zero curvature) portion of the surface abruptly adjoins a curved portion. This situation 
also produces a singularity in potential flow, one at which the velocity gradient is logarithmically infinite. In 
real life, a viscous boundary layer smears out the discontinuity, but the tendency toward an almost step change 
in velocity can sometimes actually be seen. This is why good aerodynamic design usually targets continuous 
surface curvature as well as slope. 


9.2.1.6 A Source as the Far Field Effect of a Body with Viscous Drag 


Viscous dissipation in the boundary layer and near-wake of a body produces a total-pressure deficit in the wake 
flow, and there is thus a velocity deficit in the wake. In simple situations where the viscous wake is not part of 
the core of a strong vortex wake, the velocity deficit is relative to the adjacent outer inviscid flow, as in Figure 
6.1.3. In the wake vortex of a lifting wing, there is usually a jet in the core, as in Figure 8.1.9, and the velo- 
city deficit due to viscosity is then relative to a stronger jet that would be there in the absence of viscosity. In 
either case, the presence of this deficit means that some mass flux that would otherwise be present in the space 
occupied by the wake must be compensated by greater mass flux outside the wake. It's the same mechanism as 
the displacement effect of a boundary layer (Section 4.2.3), extended to include the displacement effect of the 
viscous wake downstream of the tail of the body. To model the effects of this displacement on the outer flow, 
we can construct an equivalent inviscid flow that has the same outer-flow behavior as the real flow by placing 
sources and sinks on the body surface. These would be positive (sources) where the displacement thickness 1s 
growing and negative (sinks) where it 1s decreasing, as when the near wake “necks down” just aft of the tail 
(see Figure 5.3.4). Outside the near field, the displacement thickness of a viscous wake is generally positive, 
reflecting the fact that the source effect (growth) has dominated over the sink effect and that the net effect is that 
of a source. Far from the body and outside the viscous wake, the net effect is as if a single source were placed 
on or near the body. 


9.2.1.7 A Vortex as the Far Field Effect of a Lifting Airfoil or Wing 


According to the Kutta-Joukowski theorem (Section 7.2), the lift of any high-aspect-ratio lifting surface must be 
accompanied by circulation and bound vorticity. In the far field, only the net vorticity (the total, integrated over 
the chord of the surface) matters. In 2D, the far field thus looks like uniform flow with a vortex superimposed, 
with the vortex usually assumed to be located at the quarter-chord as in Figure 7.2.1d. If there is viscous drag, 
there would also be a source component as described in the previous item. In 3D, the bound vortex cannot end 
at the wingtips, and a horseshoe vortex system, including a trailing vortex wake, must be assumed, as in Figure 
8.1.3. 


9.2.1.8 The Kinematics of Vorticity 


For thinking about the kinematics of flows with distributed vorticity, several ideas from Chapter 3 can be help- 
ful. 

The first applies very locally, and that 1s that the signature of vorticity in the deviations in velocity in the 
neighborhood of a point can be expressed as a solid-body rotation with angular velocity @/2 (Helmholtz's first 
theorem, see Section 3.3.5). If you've convinced yourself that there's no rotation, then you've probably shown 
that there's no vorticity. 

Since most of the flows we're interested in are at reasonably high Reynolds numbers, vorticity diffuses 
slowly, and Helmholtz's third and fourth theorems (see Section 3.8.3) provide helpful guidance, even though 
they apply strictly only to inviscid flow. The third tells us that a vortex tube tends to be convected with the flow 
and that the only migration of vorticity into or out of the tube will happen slowly through viscous diffusion. The 
fourth tells us that the strength of a vortex tube tends to remain nearly constant, changing only slowly through 
diffusion. 

In many examples of physical shear layers, vorticity is concentrated in a relatively thin sheet. An effective 
conceptual model for such flows is an ideal vortex sheet whose strength is equal to the velocity jump across the 
shear layer, as in Figure 3.3.8a. Remember that the velocity jump and the vorticity vector are perpendicular to 
each other, as in Figure 3.3.8c. 


9.2.2 Thinking about Viscous and Shock Drag 


Viscous drag generally has both a skin-friction part and a pressure part (the “viscous pressure drag”). For a 
change in the shape of a body to change the drag, 1t must change either one of these or both. The skin-friction 
part usually behaves in a straightforward way in that increasing the wetted area usually increases the skin-fric- 
tion drag. The viscous pressure drag is usually not so simple. In Section 6.1.6, we discussed how misleading it 
can be to think of the viscous pressure drag in terms of the local surface pressures. For thinking about viscous 
pressure drag, there are two key concepts to keep in mind: 
1. d'Alembert's paradox, that a body in inviscid flow without shocks or vortex shedding has zero drag. 
This is relevant because when the shape of a body is changed, much of the resulting change in the sur- 
face pressure distribution often originates in the outer inviscid flow, and unless there is a change in the 
lift distribution and induced drag, such changes inherently cause no change in pressure drag. For the vis- 
cous pressure drag to change, there must be a change in the displacement effect of the boundary layer and 
wake, which is difficult to estimate mentally. For intuitive purposes, it 1s often more helpful to think in 
terms of: 


2. The fact that the total viscous drag is proportional to the volume integral of the viscous dissipation rate 
in the field, as discussed in Section 6.1.4.6. So an increase in the total viscous drag requires an increase 
in the integrated dissipation rate. 


Thus if you want to predict or understand a change in the viscous drag, avoid thinking in terms of the surface 
pressure distribution and ask instead “where's the dissipation?” 

Likewise, thinking about shock drag in terms of the surface pressure distribution is risky, especially in tran- 
sonic flow. The pressure increments associated with shock drag are typically spread very diffusely over the body 
surface, and again it is more helpful to think in terms of dissipation in the field, 1n this case the Oswatitsch for- 
mula we discussed in Section 6.1.4.8. 


9.2.3 Thinking about Induced Drag 


Questions often arise as to how an increase in span, the addition of a tip device, or a change in spanload will 
affect induced drag. Thinking about such questions requires special care because the physics of induced drag 
is not easy to deal with intuitively and because there are such entrenched popular misconceptions about it. Not 
only is induced drag intuitively difficult, it 1s theoretically uncertain. In Section 6.1.3 we observed that induced 
drag cannot be defined precisely as a separate part of the drag except in terms of idealized approximate theories, 
and in Section 8.3.4 we noted that Trefftz-plane theory is the only practical option for inferring an approximate 
value for the induced drag from the geometry and spanload of a wing. Of course we can't do Trefftz-plane cal- 
culations in our heads, so for purposes of mental estimation we must rely on known results. Two rules of thumb 
that are easy to remember and provide a basis for useful estimating are: 

1. For fixed lift and airspeed, the induced drag of a planar wing goes as the inverse square of the span 

(Equation 8.3.1) and 


2. The effectiveness of a vertical winglet is the same as that of a horizontal span extension with about 
55% of the Trefftz-plane span of the winglet (Figure 8.4.9). 


These are ideal-induced-drag results, valid for spanloads optimized for minimum induced drag. Results for other 
tip configurations were shown in Figures 8.4.6 and 8.4.7. 

When you encounter claims regarding the induced-drag performance of a change to a wingtip configuration, 
you should always ask yourself, “are the claims consistent with the Trefftz-plane-theory rules of thumb? If not, 
is there a plausible reason for a significant deviation from the theory?” This is a high hurdle. No tip configura- 
tion I'm aware of has ever conclusively demonstrated performance better than what Trefftz-plane theory would 
predict. 


9.2.4 A Catalog of Fallacies 


9.2.4.1 One-Way Causation of Pressures or Forces 


According to Newton's second law, the direct cause of any acceleration, or change in the velocity vector, of a 
fluid parcel must be a net force applied to the parcel. In the effectively inviscid flow outside of boundary layers 
and wakes, a pressure gradient 1s the only source of such forces. Thus we can say that the pressure field causes 
the accelerations in the velocity field (though some of the explanations of lift we discussed in Section 7.3.1 
imply causation in the other direction). But as we noted in Section 3.5, it is a fallacy to see the causation as go- 
ing only one way. The cause-and-effect relationship between the pressure field and the velocity field is mutual, 


or circular. The pressure gradient causes the accelerations, and the accelerations sustain the pressure gradients. 
Likewise, the cause-and-effect relationship between the flow field and the integrated pressure force on a body 
is mutual. 


9.2.4.2 Vortices as Agents of Causation 


In Section 3.3.9, we discussed the Biot-Savart law, which allows us to infer part of the velocity field from the 
vorticity distribution. In the Biot-Savart law, the vorticity is the “input,” and the “induced velocity” is the “out- 
put,” which makes it all too easy to think of the vorticity at one location as “causing” the velocity at another. 
Such thinking is of course fallacious. There is no action-at-a-distance going on in ordinary fluid mechanics, and 
there is no way that vorticity can cause anything to happen remotely. 


9.2.4.3 Trying to Influence a Global Flow Field by Tinkering with Vorticity 
Locally 


This one is related to the previous item in that a vortex cannot physically influence what happens at another loc- 
ation. But there is more to it than that. Not only is it not possible to influence things remotely just by changing 
vorticity locally, it is not possible to significantly change the integrated vorticity in a local region by tinkering 
with the vorticity in a local streamtube. If you change the vorticity in one streamtube, compensating vorticity 
must appear adjacent to it, by kinematic necessity. This is why the erroneous wingtip device ideas discussed in 
Section 8.4.1 don't work. Another example, from Section 6.1.10, is that it's not possible for a propeller to “in- 
duce” circumferential velocities outside of its own slipstream. 


9.2.4.4 Surface “Streamlines” (Skin-Friction Lines) as Indicators of Flow 
Direction 


The surface “streamlines” visible in oil-flow photos like Figures 4.1.11 and 4.5.1b give the impression that they 
indicate the general direction of the flow over the part of the body in question. This impression is often mislead- 
ing. Flow direction can change drastically within a short distance off the surface in a 3D boundary layer. The oil 
streaks indicate only the direction of the surface shear stress. A short distance off the surface the flow direction 
can be very different. 


9.2.4.5 Zero Cf as the Universal On-the-Surface Signature of Separation 


In steady 2D flow, the skin-friction coefficient Cf must go through zero at a separation point. In 3D, however, 
the magnitude of C¢ is zero only at isolated singular points and is nonzero at typical points along a separation 
line. Identifying a separation line in 3D therefore requires looking at the global pattern of the skin-friction dir- 
ection field and applying the region-of-origin concept as described in Section 4.1.4. 


9.2.4.6 The Boundary-Layer Displacement Surface as an Effective “Solid 
Wall” 


In Section 4.2.3 we defined the concept of a boundary-layer displacement surface as the effective location of 
the surface for an equivalent outer inviscid flow matching the actual flow far from the body. However, the idea 
that a displacement surface represents an effective “solid wall” is correct only in a limited way. A displacement 


surface 1s defined by a particular boundary-layer flow field consistent with a particular outer flow. The displace- 
ment surface so defined represents an effective solid wall only for that particular flow. If something is done to 
change the flow situation, the original displacement surface is no longer an effective solid wall. For example, 
if the displacement surface is defined for the wall boundary layers of a wind tunnel with an empty test section, 
it represents an effective solid wall only when the test section is empty. It is not an effective solid wall for the 
flow around a model placed in the test section. 


9.2.4.7 Pressure is Constant across a Boundary Layer 


One of the assumptions of first-order boundary-layer theory (Section 4.2.1) is to ignore the pressure gradient 
normal to the surface and thus to assume that the pressure 1s constant in the normal direction. It is therefore 
natural to adopt this assumption into our intuition and to think of the pressure as constant across the thickness 
of a boundary layer. This is not so bad if the wall is flat, but it isn't very accurate if the wall has pronounced 
curvature. In the real world, there must be a normal pressure gradient even within the boundary layer, consistent 
with the local mean velocity and streamline curvature. 


9.2.4.8 The “Laminar Sublayer” 


In the older literature, the sublayer of a turbulent boundary layer was referred to as the “laminar sublayer,” and 
many commentators at the time apparently thought of the flow in the sublayer as actually being laminar. The 
flow in the sublayer is like laminar flow only in the sense that the Reynolds stress vanishes as the wall is ap- 
proached, and the shear stress becomes all viscous, as we discussed in Section 4.4.2. Otherwise, in terms of the 
presence of substantial velocity fluctuations, the flow in the sublayer is anything but laminar, as can be seen in 
flow visualizations such as Figure 2.1f and in DNS calculations. Thus it is much more accurate to refer to the 
sublayer as the viscous sublayer. 


9.2.4.9 Pressure Drag or Thrust of a Portion of a Complete Body 


We sometimes like to assess the aerodynamic performance of a vehicle by looking at the drag-producing and 
thrust-producing parts separately. Such an assessment is always ambiguous because it requires making an ar- 
bitrary decision as to how to divide up the surface of the vehicle. Furthermore, we found in Section 6.1.3 that 
dividing the surface of a complete (closed) body into portions raises serious questions about the meaning of 
the pressure “drag” or “thrust” on each portion. The integrated pressure force on a portion of a body generally 
contains “spurious” contributions that are not properly seen as either drag or thrust. 


9.2.4.10 Reducing Pressure Drag by Increasing Pressure on Aft-Facing 
Surfaces 


In Sections 6.1.6 and 9.2.2 we discussed how misleading it can be to think of pressure drag in terms of the local 
surface pressures on parts of the surface and to think that the pressures can be manipulated in an essentially 
inviscid way by design of the surface shapes so as to influence the pressure drag. These kinds of changes to the 
surface pressure distribution usually produce little or no change in the viscous pressure drag. 


9.2.4.11 Inviscid “Interference” Drag 


As we noted in Section 6.1.9, the mutual inviscid buoyancy effect that occurs when two bodies with volume 
are “flown” in proximity is not properly considered interference drag. In a shock-free inviscid flow, such effects 
would always add up to zero for the total configuration. The total drag can change only if one or more of the 
drag-producing mechanisms in the field is affected. Changes to induced drag and shock drag are the only inter- 
ference mechanisms that are effectively “inviscid” in the near field and can change total drag. 


9.2.4.12 Base Drag and Hoerner's Jet Pump 


In Section 6.1.7, we noted that the concept of “base drag” of bodies with squared-off tails is ambiguous at best. 
We also discussed Hoerner's correlation showing that the pressure on the blunt base of a body becomes less low 
as the body fineness ratio increases, and concluded that this is more likely a potential-flow effect than a result of 
Hoerner's proposed “‘jet-pump” mechanism in which the separating boundary layer “insulates” the wake from 
the jet-pump effect of the outer flow. 


9,2.4.13 Explanations of 2D Airfoil Lift 


There are many misconceptions in wide circulation in physical explanations of lift. These are discussed in detail 
in Section 7.3.1 and summarized in Section 7.3.3 under “popular misconceptions.” 


9.2.4.14 The “Dependence” of Induced Drag on Aspect Ratio 


In Sections 8.3.1 and 8.3.4, we found that in both lifting-line theory and Trefftz-plane theory the induced drag 
depends on the span of a wing, but not on the chord or area. However, when we nondimensionalize induced 
drag in the usual way, dividing by qS so as to be consistent with the way we nondimensionalize other forces, 
aspect ratio appears in the formula for the induce-drag coefficient. This gives the false impression that induced 
drag depends on aspect ratio. 


9.2.4.15 2D Biplane Lift Reduction as a Loss of “Efficiency” 


In Section 8.3.12, we discussed how biplane pairs of airfoils incur a loss of lift at constant angle of attack, but 
we found that the loss of sectional “efficiency” is not due to the lift loss but is due instead to an induced-thick- 
ness effect. The loss in maximum sectional L/D is much smaller than what would be inferred from the lift loss. 


9.2.4.16 Integrated Downward Momentum in the Trefftz-Plane “due to” 
the Trailing Vortices 


A classical analysis of the flux of vertical momentum in the Trefftz plane behind a lifting wing looked at it 
in terms of velocities “induced” by the trailing vortices, which were assumed to be infinitely long. The result, 
Equation 8.5.2, indicated an integrated downward momentum corresponding to the lift. However, this turns out 
to be wrong because the double integral that was evaluated is nonconvergent. Assuming the vortices are finite 
in length, and taking the limit as the length goes to infinity yields the correct answer of zero integrated vertical 
momentum “induced” by the trailing vortices. There is an integrated vertical momentum in planes behind the 
wing, but it is in the velocities “induced” by the bound and starting vortices, not the trailing vortices. Details, 
including the effects of a ground plane, are discussed in Section 8.5.3. 


9.2.4.17 An Expectation of High Accuracy in Calculations That Depend on 
Turbulence Modeling 


In Section 3.7, we discussed the nature of turbulence and the strategies we have available for modeling it in 
computational fluid dynamics (CFD) calculations. Given the complexity and non-locality of turbulent motions 
and the way that models must gloss over much of this complexity, we should not expect high accuracy in calcu- 
lations using turbulence models. 


Chapter 10 


Modeling Aerodynamic Flows in Computa- 
tional Fluid Dynamics 


Since the late 1960s, the capabilities of computers and computational fluid dynamics (CFD) codes, and our know- 
ledge of how to use them, have all grown tremendously. Over that time, CFD-based analysis, design, and optim- 
ization methods have revolutionized the practice of aerodynamics. 

In some applications (e.g., the design of the cruise configuration of a transport airplane), knowledgeable use 
of CFD can now routinely produce good designs that don't need further refinement in the wind tunnel, only veri- 
fication. For such applications, the role of wind-tunnel testing has changed from the screening of many candidate 
designs to the detailed evaluation of just a few. For applications where the geometry and/or the flow physics are 
more complicated (e.g., design of high-lift systems, or the analysis of off-design flight conditions), the wind tun- 
nel still plays a major role in design development, but it will diminish as CFD capabilities advance. 

CFD offers decided advantages over the wind tunnel in several ways. With CFD, we can explore a much larger 
number of design geometry variations than it is practical to test in the wind tunnel. And of course CFD can be run 
without interference from wind-tunnel walls or model supports. CFD can model full-scale Reynolds numbers, 
which in wind-tunnel testing 1s possible only in very expensive cryogenic pressure tunnels. With some additional 
computational effort and coupling with a structural model, CFD can simulate something closer to the real aer- 
oelastically deflected shape of a vehicle than is possible in the wind tunnel, where it is usually possible at best to 
match the shape at only one flight condition. Last but not least, CFD has design and optimization capabilities that 
are now essential to our design practices. 

With these advantages come costs, however. Defining the surface geometry and generating the grid for a cal- 
culation can represent a significant investment in engineering time. For 3D Reynolds-Averaged Navier-Stokes 
(RANS) calculations requiring high spatial resolution, computer time 1s still a significant cost, though it is di- 
minishing rapidly as computers and algorithms improve. Then there is the personnel cost: Effective use of CFD 
is a specialized skill that requires time and practice to develop. Effective users must often spend a good part of 
their careers learning the mechanics of lofting surface geometry, generating grids, and running codes as well as 
developing the judgment needed for effective use of the results. 

Judgment is needed because the simulation of reality that CFD can provide is usually far from perfect. A fun- 
damental physical limitation on accuracy comes from turbulence modeling, which we discussed in general in 
Section 3.7 and for swept wings in particular in Section 8.6.2. Inadequate grid resolution also often degrades ac- 
curacy. A user may inadvertently use a grid that is too coarse, or, in many cases the densest grid one can afford to 
use isn't dense enough to provide the resolution needed for an accurate simulation. 

The rapid advances in CFD capabilities in recent years have made it all too easy to forget that grid resolution 
can still be such a limitation. Capabilities that were on the cutting edge not long ago are now available at little 
cost. But no matter how much the capabilities advance, it seems that there are always applications of engineering 
interest that are at or beyond practical grid-resolution limits. Thus, for the foreseeable future, there will be a mar- 


ket for the most advanced codes available. Such capabilities at the cutting edge will always be costly and will 
be available only to those with deep pockets. 


These are some of the costs and limitations inherent in CFD. Something else we should be aware of 1s a major 
psychological factor that influences how CFD is used and how the results are perceived. This is the tendency 
toward an overly optimistic perception of the accuracy of CFD among the entire community of code developers, 
code users, and customers (those who use or rely on the results). Much of this is just basic human nature, as I 
know from my own experience. When we see any sort of theory that has a reasonable physical basis and predicts 
the right trends, we have a tendency to forget the limitations and give the theory more credence than it deserves. 
In general, resisting this tendency requires vigilance. In the case of CFD, however, the problem 1s more serious 
than it was for “simpler” theories, and we tend not to be vigilant enough. A major contributor to this problem is 
the digital nature of the CFD “medium.” CFD results are now almost universally viewed using advanced digital 
graphics, and the effects are powerful. These days it is common to see a complicated flowfield, predicted with 
all the right general features, and displayed in glorious detail that looks like the real thing. Results viewed in 
this way take on an air of authority out of proportion to their accuracy. In this regard, modern CFD is a very 
seductive thing. 

With this general overconfidence comes a tendency, usually unintentional, to downplay the limitations of 
CFD's physical fidelity, to show and publish biased samples of results, and thus to oversell the codes' capabil- 
ities. Overconfidence induces us to think that the best comparisons with experimental data are the most repres- 
entative and that the not-so-good comparisons probably reflect problems with things other than the code, such 
as shortcomings in the experiments. Of course experimental data are never perfect, but the CFD is at fault more 
often than its believers would like to think. The user community seems to be almost as prone to this kind of 
subtle self-deception as the developers are. 


This tendency toward overconfidence, combined with the “high-tech” aura surrounding CFD, leads to a kind 
of overuse, in the sense that CFD 1s often used in situations where other options would be more appropriate. 
In many cases where a simpler theory or even a simple desktop experiment would provide an answer with less 
effort, or even a better answer, there is a temptation to use CFD anyway. A similar syndrome is in play in the 
overuse of other expensive tools such as the wind tunnel: the urge to hit a problem with the biggest hammer 
available. The tendency is understandable. If something goes wrong later in the program, you're less likely to 
be criticized if you used the “best” tool available. 

So CFD is far from perfect, its capabilities are often overrated, and it can mislead us if we let it. But it 1s such 
a powerful tool that there is no way that the modern practice of aerodynamics could get along without it. In the 
rest of this chapter, we'll take a broad overview of CFD from a perspective similar to that of the rest of the book; 
that is, we'll focus on the basic physics and conceptual issues that CFD raises. We'll look at how CFD works, its 
strengths and weaknesses, and at some of the things users need to know to be effective. 


10.1 Basic Definitions 


¢ CFD: Using computer codes to generate numerical solutions to equations of motion for fluid flow, for 
purposes of making qualitative or quantitative predictions of flow behavior, or designing aerodynamic 
shapes that produce desired flow characteristics. 

¢ CFD analysis: Computing the flow field produced by a given aerodynamic shape at a given flow condi- 
tion. Outputs typically include flow quantities throughout the field, pressures and shear stresses at bound- 
ing surfaces, and integrated forces and moments on bounding surfaces. 


¢ CFD design or optimization: This is similar to analysis in that the same flow equations are solved, but 
different in that a new (“designed”) surface geometry is one of the outputs. Surface pressures or some 
other objectives, such as drag, are imposed, along with a starting (“seed’’) surface geometry. 

¢ Preprocessing: Generation of geometry definitions for surfaces and in some cases the generation of the 
computational grid. 

¢ Postprocessing: Generation of force and moment data for performance or loads analysis, and solution 
visualization (plots of quantities on surfaces or in the field). 


10.2 The Major Classes of CFD Codes and 
Their Applications 


Practically any fluid-flow model that is amenable to numerical solution can become the basis for a CFD code. 
From the full Navier-Stokes (NS) equations to the simplest lifting-line model, the possibilities run the gamut 
of the theoretical approximations summarized in Chapter 9. In this section, we'll look at the main categories of 
CFD codes and the applications for which they're currently suited. In the future, the range of things that can be 
considered practical will of course expand. 


10.2.1 Navier-Stokes Methods 


10.2.1.1 Direct Numerical Simulation (DNS) 


The full unsteady NS equations are solved without a turbulence model, generally for the purpose of studying 
some aspect of turbulence physics. Even if the mean flow is 2D, a 3D unsteady calculation is required because 
turbulence is always 3D and unsteady, and spatial and temporal resolution must be fine enough to resolve the 
smallest details of the turbulence. This kind of simulation is so computationally demanding that it is generally 
limited to incompressible flow, simple flow geometries, and Reynolds numbers much lower than those typical 
of practical applications. 

The typical Direct Numerical Simulation (DNS) calculation is still a major undertaking, and applications 
of DNS have been limited to fundamental physics studies. Turbulent boundary layers, wakes and other shear 
layers, and laminar-to-turbulent transition have been investigated in DNS studies. A DNS solution can provide 
much greater detail in a given flow than it 1s possible to record in an experiment. For some aspects of turbulence 
structure, especially for quantities like pressure-velocity correlations or vorticity fluctuations that are difficult 
or impossible to measure experimentally, DNS has provided much of our knowledge. As an example, Figure 
10.2.1 shows instantaneous isocontours of turbulence quantities in the wall region of a flat-plate boundary layer, 
calculated by Robinson, Kline, and Spalart (1988). Compare the streaky structure visible here with that of the 
experimental visualization in Figure 2.1f. 


Figure 10.2.1 An example of the kinds of turbulence physics that can be simulated in DNS calculations: in- 
stantaneous contours of constant u’ and u'v’ in a flat-plate boundary layer. Calculation by Robinson, Kline, and 
Spalart (1988). (Used with permission from Spalart) 
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10.2.1.2 Large-Eddy Simulation (LES) 


The full unsteady NS equations are solved so as to resolve the larger-scale motions of the turbulence that can 
be resolved by the computational grid, while the effects of sub-grid-scale motions are modeled by a turbu- 
lence model. Sophisticated algorithm technology is required to blend the calculation smoothly and unobtrus- 
ively between the resolved scales and the modeled effects (see Sagaut and Germano, 2002). Large-Eddy Simu- 
lation (LES) can handle much more complicated geometries and higher Reynolds numbers than DNS, and it is 
therefore used for some practical applications in addition to fundamental physics studies. Still, practical applic- 
ations are extremely demanding computationally, and LES is used only sparingly. An example of instantaneous 
quantities from a DNS solution was seen in the isovorticity surfaces in the wake of a circular cylinder in Figure 
o.3.26. 


10.2.1.3 Detached-Eddy Simulation (DES) 


This is similar to LES, except that the large-scale motions that are resolved by the solution are limited to separ- 
ated parts of the flow, and turbulence modeling is applied for everything else. The algorithm issues are discussed 
in Spalart (2009). Results of a Detached-Eddy Simulation (DES) calculation for the flow around a circular cyl- 
inder were shown in Figure 5.3.3. For practical applications, DES is much more economical than LES, but it is 
still very demanding computationally. DES is sparingly used in studies of noise generated by separated flow. 


10.2.1.4 Unsteady Reynolds-Averaged Navier-Stokes (URANS) 


The full unsteady NS equations are solved with a turbulence model with the intention of resolving unsteadiness 
only on length and time scales much longer that those associated with “turbulence.” The grids and time steps 


can therefore be much coarser than for either LES or DES, and the calculations are thus much less expensive. 
However, Unsteady Reynolds-Averaged Navier-Stokes (URANS) is still expensive enough that applications are 
much less common than those of RANS. URANS has obvious applicability in cases where flow unsteadiness 
is forced by changes 1n the onset flow or by unsteady motion of the body. It can also simulate some kinds of 
spontaneous unsteadiness that arise from within a flow with steady boundary conditions, as, for example, buf- 
feting associated with unsteady movement of separation, but this capability is limited. The damping imposed 
on such motions by the turbulence model is not generally realistic, with the result that the motions are not often 
realistically predicted. 


10.2.1.5 Reynolds-Averaged Navier-Stokes 


The full RANS equations with a turbulence model are solved with the intention of predicting a steady mean 
flow. The unsteady equations are usually used, and the solution is advanced in time from some initial state until 
a steady state is reached. Because only the final steady solution 1s of interest, not the time history, the solution 1s 
advanced in a nonphysical pseudo-time in which the time step varies form point to point in the domain, taking 
advantage of the fact that numerical stability allows longer time steps in some places than in others. Pseudo- 
time-stepping is generally combined with other convergence-acceleration schemes (see various articles in VKI, 
1995). This allows a steady solution to be reached much faster than it could be if time-accurate URANS were 
used. 

By their nature, RANS codes seek steady solutions to the equations. But what happens if the real flow under 
the conditions in question is unsteady on length and time scales longer than those of ordinary boundary-layer 
turbulence? That depends on the details of the particular solution algorithm. But before we look at what the 
codes actually do, let's speculate briefly about what we'd like them to do. 

Ideally we might think we'd like a code to predict the onset of real unsteadiness by failing to reach a steady 
solution. Even as an ideal, however, this is problematic: At what level of unsteadiness would we like the code to 
fail? Besides, practically speaking, we'd really like the code to continue giving usable results in situations where 
the real flow has significant unsteadiness, as, for example, at maximum lift of an airfoil or wing. How accurate 
or useful the results would be under such conditions would have to be evaluated (This kind of evaluation is part 
of what 1s referred to as “CFD validation,” which we'll discuss further in Section 10.5), but having a code “quit” 
early 1s a practical nuisance. 

Of course what we'd like the codes to do and what they actually do are two different things. The behavior of 
RANS codes in situations where the real flow is unsteady varies widely depending on the details of the solution 
algorithm and on the particular flow situation. It ranges from failing near the onset of real unsteadiness to plow- 
ing onward and getting steady solutions well past the onset. The Boeing GGNS code, for example, produces 
steady solutions for 2D airfoils at 90° angle of attack (see Allmaras et al., 2009). What such solutions mean, 
physically speaking, is an interesting question. 

A common assumption or option in many RANS codes is to solve Thin-Layer Navier-Stokes equations in 
which the viscous and turbulent diffusion terms have been omitted for one or two of the coordinate directions, 
as described in Table 9.1.2. This is akin to a boundary-layer-theory assumption in that diffusion is neglected in 
one or two directions parallel to viscous shear layers, but it is unlike first-order boundary-layer theory in that 
the pressure variation perpendicular to shear layers is still accounted for. In principle, the thin-layer assumption 
entails some loss of fidelity, but it can be offset by an improvement in numerical accuracy. Grids used 1n 3D cal- 
culations are often not fine enough to resolve the viscous terms in directions parallel to solid surfaces, with the 
result that when the full NS equations are solved, numerical dissipation produces exaggerated surface-parallel 
diffusion, leading to nonphysical effects. For example, some participants in a NASA drag-prediction workshop 


(see Mani et al., 2010) observed that a 3D separation in a wing-body junction was unrealistically large in their 
full NS solutions but not in their thin-layer solutions. 

RANS codes account for an overwhelming portion of all NS calculations in practical applications. The most 
natural applications are in analysis, in which the body geometry and onset flow conditions are given as inputs, 
and the flowfield quantities are outputs. RANS is also used in inverse design, where the objective is to determ- 
ine the body shape that produces a particular pressure distribution, and in optimization, in which there can be 
several kinds and combinations of objectives. But design and optimization are not the natural forte of RANS 
codes in their current form. 

There is currently no direct way to solve the inverse design problem in RANS. One way of attacking inverse 
design is by the ad hoc CDISC scheme, in which the body shape is updated based on a formula that uses sur- 
face curvature and the mismatch between the current pressure solution and the target (see Campbell and Smith, 
1987). It works after a fashion, but it is cumbersome. The other approach 1s called “modal” inverse design, 
which uses a number of shape-change modes whose amplitudes are to be determined, as we discussed in Section 
7.4.10 in connection with 2D airfoil design. Doing modal inverse design with a RANS code requires using an 
external optimizer to determine the modal amplitudes so as to minimize some measure of the deviation between 
the actual pressure distribution and the target. 

Doing other kinds of optimization (e.g., minimizing drag) using RANS codes is currently very expensive and 
time consuming. In the usual approach, an external optimizer is used to guide the optimization, and the RANS 
code is run many times to evaluate design candidates, thus providing function evaluations, usually a large num- 
ber of them in the course of a typical optimization. External optimizers can be of either the global type, which 
can in principle find a global optimum, or the gradient-based type, which can find only a local optimum but 
can home in on it very precisely. Sometimes both types are used 1n sequence, a global optimizer to identify the 
neighborhood of the global optimum, followed by a gradient-based optimizer to home in closely on the optim- 
um. LeDoux et al. (2004) describe a system that can use both types. 

Gradient-based optimizers of course require derivative information, that is, rates of change of the objective 
with respect to the design variables (shape-change modes, for example). If the RANS code is used only to 
provide function evaluations, these derivatives must be calculated by finite differences between flow solutions, 
and each derivative thus requires an additional flow solution. When finite-difference derivatives are used, ap- 
plications are typically limited to small numbers of design variables and single-point optimization (optimization 
at a single flow condition). An alternative approach for obtaining derivative information is to use a suitably 
modified version of a CFD code to solve an adjoint problem (see Elliot and Peraire, 1996). A single adjoint 
solution can provide derivatives for any number of design variables simultaneously, thus greatly reducing the 
number of flow solutions required. 

Most practical aerodynamic optimization problems really require multipoint optimization, 1n which the ob- 
jective function takes into account more than one operating condition (more than one lift coefficient or Mach 
number, for example). This is currently prohibitively expensive and time consuming to do in RANS with finite- 
difference derivatives, and 1s still quite expensive even if adjoint derivatives are used because a separate ad- 
joint solution is required for each operating condition. Most multipoint aerodynamic optimization is currently 
done with coupled viscous/inviscid codes (Section 10.2.2), which incur much lower expense and flow time than 
RANS. 


10.2.2 Coupled Viscous/Inviscid Methods 


Coupled viscous/inviscid codes take advantage of the basic inner/outer structure of many high-Reynolds flows 
that we discussed in Section 3.6 and in the introduction to Chapter 4. The boundary-layer equations are used in 


the inner region, inviscid equations in the outer region, and the two solutions are matched through the displace- 
ment thickness or surface transpiration, as discussed in Section 4.2.3. Such methods are nearly always limited 
to steady flows and have been implemented in both 2D and 3D. 

The main advantage of such methods is that they require roughly an order of magnitude less computing than 
RANS. They are thus more practical than RANS for multipoint aerodynamic optimization or modal inverse 
design. A further advantage of viscous/inviscid coupling 1s that the inviscid portion of the method can represent 
small changes in surface shape by transpiration from the surface instead of through the modification of the grid 
that is required in RANS methods. TRANAIR (Melvin et a/., 1999) uses this approach to reduce the number of 
times the grid must be modified and thus to speed up modal design and optimization in 3D. The 2D airfoil codes 
XFOIL and MSES (Drela, 1989, 1993) do not use transpiration, but they do provide the option of solving the 
inverse-design problem directly, that 1s, defining the new surface shape grid point-by-grid point and not relying 
on shape-change modes. 

For flows that are attached all the way to the rear of a body or have only shallow separated regions, the 
physical fidelity of coupled viscous/inviscid codes is comparable to that of RANS. Coupled viscous/inviscid 
codes typically fail to converge for flows with massive separation, but RANS also has problems in such flows. 
Because of the limitations of the boundary-layer equations, coupled viscous/inviscid codes miss viscous-flow 
details such as junction flows (e.g., necklace-vortex separation). 

The inviscid component of a coupled code can be any method that models the body geometry with sufficient 
fidelity and produces a sufficiently realistic and smooth prediction of the pressure or velocity distribution on the 
surface (an Euler, full-potential, or panel method, as discussed in Section 10.2.3). The boundary-layer compon- 
ent can be either an integral method or a field (differential) method. In earlier codes, coupling was often just 
sequential iteration back and forth between an inviscid module and a boundary-layer module, usually with some 
under-relaxation of the displacement thickness or surface transpiration to reduce oscillations. Convergence was 
not very robust, and flows with separation were typically off limits. More-recent codes solve the inviscid and 
boundary-layer equations in more simultaneous ways and use convergence-acceleration schemes such as New- 
ton's method (see Drela and Giles, 1986). These newer methods can often converge for flows with some separ- 
ation. 

Coupled viscous/inviscid codes are available for a variety of 2D and 3D applications. For low-speed airfoils 
XFOIL (Drela, 1989) is probably the most versatile in that it handles laminar separation bubbles with reason- 
able accuracy. For single-element and multielement airfoils and transonic flow, MSES (Drela, 1993) offers a 
full range of analysis, design, and multipoint optimization options. For 3D subsonic, transonic, and superson- 
ic flow (potential-flow plus integral boundary layer), TRANAIR (Melvin et al., 1999) handles analysis and a 
broad range of design and optimization applications. 


10.2.3 Inviscid Methods 


Codes for inviscid flows cover a wide range in physical fidelity from the full Euler equations to incompressible 
potential flow with small perturbations. At any of these levels, the flow can be steady or unsteady, though codes 
for steady flow are by far the most common. Inviscid methods by themselves generally apply only to flows 
for which neglecting viscous effects is justified, and many entail further restrictions, as we'll see next. Inviscid 
codes encompass field methods for the nonlinear full-potential or Euler equations, and panel methods for linear 
potential flow. 


10.2.3.1 Euler Methods 


The Euler equations are just the NS equations with the viscous and heat-conduction terms omitted. The equa- 
tions apply to flows at any Mach number, as long as the equation of state that 1s used remains valid, and viscous 
and heat-conduction effects are negligible. 


Discontinuities (shocks and slip surfaces) can appear in solutions to the Euler equations. In numerical solu- 
tions, these are generally “captured” by the numerical method and are therefore “smeared” over distances of at 
least several grid cells. Shocks numerically captured in this way tend to approximate the “real” discontinuous 
ones that would occur in exact solutions to the equations. Smeared slip surfaces tend to emanate from sharp 
edges, where the artificial dissipation associated with the numerical method causes the flow to separate much 
like a viscous flow would. In effect, artificial dissipation enforces something like the Kutta condition that we 
discussed in connection with the potential flow around airfoils in Section 7.1. The smeared slip surface that 
emanates from a sharp edge is thus similar to a real physical shear layer, except that its diffusion (smearing) 1s 
artificial and not realistically predicted. We'll consider these issues further in Section 10.4.3. 

The Euler equations are applicable to flows with or without vorticity, but given the lack of viscosity, there 
are only limited ways that vorticity can arise. It must either be fed into the domain through the inflow boundary 
conditions, be generated by passage of the flow through a shock of nonuniform strength, or result from “separ- 
ation” from a sharp edge. The equations themselves have no mechanism for vorticity to diffuse, only for it to be 
convected with the flow, but in numerical solutions there will be some diffusion due to artificial dissipation. 


A major advantage of Euler over RANS is that Euler doesn't need to resolve boundary layers and therefore 
doesn't require the dense grids that RANS requires in such regions. Grid generation is therefore easier, and 
computing times are much shorter. However, Euler methods by themselves have limited applications. Transonic 
airfoil and wing flows, for example, are quite sensitive to the displacement effect of the boundary layer, and ac- 
curate simulation requires RANS or coupled viscous/inviscid methods. In Figure 7.4.29, we saw an example of 
the large effects viscosity can have on the pressure distribution and shock location on a transonic airfoil. On the 
other hand, in some supersonic flows the boundary layer remains much thinner than in typical transonic flows, 
and thus supersonic flows can be reasonable applications for Euler methods. Some flows involving separation 
from sharp edges, such as the leading-edge-vortex flow over a delta wing, can also be appropriate applications 
for Euler, though assuring adequate grid resolution of the separated shear layer is not always easy. 


10.2.3.2 Nonlinear Potential (Full-Potential) Methods 


In the full-potential equation, the velocity is represented as the gradient of a potential, as it is in incompressible 
potential flow, but the “full” equation is nonlinear because it effectively incorporates the “exact” 1D is entropic 
relation (Equation 3.11.3) for the density as a function of the local Mach number (see Garabedian and Korn, 
1971). Solutions therefore cannot be constructed by superposition as in the panel methods, and a field grid is 
required. 

Slip surfaces cannot be “captured” as they are in Euler solutions, so that to model the vortex wake of a 3D 
wing in full-potential, there must be a grid surface behind the wing, on which a wake boundary condition is ap- 
plied. We'll discuss what this means in terms of the numerics and physical modeling in Sections 10.3 and 10.4. 
Shocks can be captured in full-potential as they are in Euler. The pressure jump across a shock is reasonably 
predicted, as long as the shock is not too strong, but the shock drag imposed on the body surface is erroneous, 
and shock drag must be determined by means of the Oswatitsch formula applied in the field. We'll discuss this 
issue in some detail in Section 10.4.1. 

Full-potential methods are applicable to subsonic, transonic, and supersonic flows up to low supersonic Mach 
numbers. Strong shocks are not accurately represented, but the weak shocks typical of many transonic and su- 


personic cruise applications are adequately predicted. Like Euler methods, full-potential methods by themselves 
have only limited applications. In many of the applications where we might like to use such methods, we really 
should include viscous effects, either through a coupled viscous/inviscid code or RANS. 


10.2.3.3 Linear Potential (Panel) Methods 


Panel methods solve either the incompressible potential-flow equation (Laplace's equation 3.10.2) or one of the 
versions applicable to compressible flow with small disturbances (see Liepmann and Roshko, 1957, Section 
10.2). Because the potential equation is linear, solutions can be constructed by superposition. In panel methods, 
the basic solutions that are superimposed are those for flows around elementary singularities that are generally 
distributed on panels placed on the contours of the body and on other surfaces where the flow simulation re- 
quires a discontinuity 1n the inviscid flowfield. The singularities automatically satisfy the potential equation, so 
the equation itself isn't explicitly used. Instead, a surface integral equation that represents the boundary condi- 
tions is solved for the strengths of the singularities. No field grid 1s required. This is an example of a general 
class of methods referred to as boundary integral methods. 

Unlike Euler and full-potential, panel methods do not allow shocks. Slip surfaces are allowed and can be 
used to model shear layers such as boundaries of propeller slipstreams or vortex wakes of lifting surfaces in 3D. 
Propeller discs can also be modeled in an average sense (averaged over the disc, ignoring the discrete propeller 
blades) by surfaces of discontinuity that produce the appropriate velocity jumps. When surfaces of discontinuity 
divide the flow domain into regions with different total-pressure, as in the case of a propeller disc and slipstream 
boundary, special care is needed to ensure that the formulation 1s consistent. All such surfaces of discontinuity 
must be paneled, so that the velocity jumps across them can be “produced” by the appropriate singularities. 

Of course a “real” slip surface is a stream surface across which there is no flow, and to model one with a 
paneled surface ideally requires solving for the location and shape of the paneled surface along with the distri- 
bution of singularity strength, which is a nonlinear problem. Some 3D panel codes have the option to determine 
the shapes of slip surfaces by iteration in a process called “wake relaxation,” but the process rarely converges 
and must usually be truncated after a few iterations. One reason for this 1s that vortex wakes tend to roll up at 
their edges, and when they are modeled as thin sheets they form such tight coils that no reasonable paneling can 
resolve them. 

Given the difficulty of iterating for real wake shapes, most panel calculations are done with fixed wakes that 
are fixed in space and must therefore generally allow flow through them. What this means to the physical mod- 
eling 1s discussed in Section 10.4.3. 

Historically, panel methods provided the first practical CFD capabilities for complex 3D configurations, and 
they are still useful for some applications, such as providing rough estimates of the effectiveness of high-lift 
systems. But linear compressibility has a very limited range of applicability, and, as with Euler and full-poten- 
tial, panel methods without viscous coupling have limited applications. 


10.2.3.4 Lifting-Surface Methods 


Lifting-surface methods are essentially 3D panel methods that model only the camber lines of lifting surfaces, 
not the thickness. Vortex wakes must of course be paneled, just as in general panel methods. 

Relative to general panel methods, physical fidelity and range of applicability are reduced. In return, lifting- 
surface methods provide ease of geometry definition, and short computing times. Versatility is also a strong suit, 
as many lifting-surface codes have options for inverse design (designing the camber shape to produce a given 
load distribution) and optimization, in addition to analysis. Because they don't model the thickness of lifting 


surfaces, and therefore predict only load distributions, not realistic surface-pressure distributions, lifting-surface 
methods are not suitable for viscous/inviscid coupling. 


10.2.4 Standalone Boundary-Layer Codes 


In the early days of boundary-layer CFD in the 1960s and early 1970s, most boundary-layer calculations were 
done in a “one-shot” mode. The outer-flow boundary conditions for the boundary layer were taken from a single 
inviscid calculation (usually potential) or from pressures measured in an experiment, and the boundary-layer 
code was run once, without coupling to account for the effect of the boundary layer on the outer flow. A stan- 
dalone boundary-layer code was then a multipurpose tool that could take inputs from a variety of sources. 

Now, most viscous-flow CFD is done with coupled viscous/inviscid coupling codes or with RANS, and there 
are fewer applications for standalone boundary-layer calculations. One of the few remaining applications is 
to provide the mean-velocity field (boundary-layer velocity profiles) for laminar-flow stability calculations, as 
done, for example, by the eMalik code (Malik, 1992). 


10.3 Basic Characteristics of Numerical 
Solution Schemes 


There are many different kinds of numerical methods in use in CFD codes. In this section, we'll look at some of 
their basic characteristics, concentrating more on the operational and practical issues than the mathematical. 


10.3.1 Discretization 


The flow problems we wish to solve with CFD methods are continuous problems in that they involve flow 
quantities and boundary conditions that are at least piecewise continuous. To attack such problems numerically 
requires discretizing the spatial domain in some way such that solution quantities need to be determined at only 
a finite number of locations. Here, we'll look very briefly at some of the ways that a continuous flow problem 
can be converted into a discrete problem that can be solved numerically. The objective in such methods is gen- 
erally to define the discrete problem such that its solution spatially converges to a solution of the continuous 
problem as the discretization is refined. We'll discuss such convergence further in Section 10.3.3. 

In panel methods for linear potential equations (Section 10.2.3), a grid of panels 1s defined on all body sur- 
faces exposed to the flow and on other surfaces where the flow field is required to be discontinuous, such as 
vortex wakes. Solution quantities defined on these panels are parameters defining the distribution of singularity 
strength to be solved for. The number of parameters per panel depends on the order of the mathematical repres- 
entation of the singularity distribution. When there is more than one parameter per panel, continuity conditions 
between panels constrain the parameters so that there is only one independent parameter to be determined per 
panel. 

The boundary conditions in a panel method are evaluated and imposed at a single control point on each panel. 
For a non-through-flow condition on a solid surface, for example, the component of velocity in the direction of 
the panel normal vector is evaluated at the control point. The boundary value that is to be constrained on each 
panel is influenced by the singularity parameters on all the panels through a matrix of influence coefficients. 


Imposing the boundary conditions on all the panels results in a set of linear algebraic equations to be solved for 
the unknown independent singularity parameters. In principle, no iteration is required because the problem is 
linear, but for large numbers of unknowns, an iterative method of solving the system is often the fastest. Once 
the values of the singularity parameters are known, other influence coefficients are used to calculate flow out- 
puts such as velocity vectors and pressures at the control points. Calculating flow quantities at points other than 
the control points requires calculating influence coefficients for each point where the solution is to be evaluated. 

In methods for any of the nonlinear field equations (NS, Euler, full-potential, or boundary-layer), the spatial 
domain must be filled with a grid consisting either of points or of cells and their bounding surfaces. In cell-based 
schemes, the flow variables can be defined and stored at either the cell centers or the vertices. The equations 
can be used in differential form (finite-difference methods) or control-volume form (finite-volume methods), 
and within each of these categories there are a variety of ways the equations can be discretized. Different meth- 
ods represent flow quantities to different orders and converge to the exact solution at different rates with grid 
refinement. We'll discuss grids and spatial convergence further in Sections 10.3.2 and 10.3.3. 


10.3.2 Spatial Field Grids 


A spatial field grid for solving nonlinear equations typically consists of a space-filling volume grid, and a grid 
surface that lies on the surface of the body, the surface grid. 

Grids can be either structured or unstructured. The simplest kind of structured grid is one that can be morph- 
ed into a rectangular mesh with ordered rows, columns, and planes (in 3D) of points or cells. Such a grid can be 
thought of as defining a curvilinear coordinate system in which locations are defined by their coordinates a in 
grid parameter space. Figure 10.3.la shows a 2D example, a grid wrapped around an airfoil in an arrangement 
called a C-grid, with grid parameters denoted by s and t. An unstructured grid usually consists of triangular or 
tetrahedral cells not arranged in ordered rows, as shown in Figure 10.3.1b for a 2D airfoil. An unstructured grid 
doesn't have a multidimensional grid parameter space. 


Figure 10.3.1 Examples of structured and unstructured grids for a 2D airfoil. (a) A single block of structured 
grid wrapped around the airfoil in an arrangement called a C-grid. (b) An unstructured grid of triangular ele- 
ments around the same airfoil. 
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A grid can consist of a single block, as in Figure 10.3.1, or an arrangement of multiple blocks. The blocks of 
a multiblock grid can cover the spatial domain in a non-overlapping fashion, in which each point in the domain 
is included in only one grid block, and adjacent blocks abut each other precisely, as in Figure 10.3.2a. An altern- 
ative is that grid blocks can overlap, as in Figure 10.3.2b. In the case of abutting grid blocks, special conditions 
must be applied across block boundaries to ensure proper continuity of the flow solution. In overlapping grids, 
continuity of the solution 1s enforced by interpolation algorithms that can be quite complicated to implement. 


Figure 10.3.2 Examples of multiblock grids. (a) Non-overlapping. This is an extreme blowup of a 2D grid 
around the blunt trailing edge of an airfoil, where a separate grid block has been used to resolve the region be- 
hind the trailing edge. (Plot by N. J. Yu.) (b) Overlapping. This view shows two grid surfaces extending down- 
ward from the surface of a wing in a 3D volume grid. The one with the straight upper edge is part of the gener- 
al grid around just the wing and abuts the wing lower surface, and the one with the curved boundary is part of 
a collar grid to resolve a small fairing protruding from the wing surface. (Plot by E. R. Setiawan.) 
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Structured grids can be morphed around body contours in many different ways, which raises interesting 1s- 
sues of grid topology. A sharp corner or crease in the body contour 1s called a lost corner if part of a single grid 
block wraps around it as in Figure 10.3.3a. Solution accuracy tends to suffer in the neighborhood of a lost corner 
because of the high distortion of the grid. A physical corner is more naturally represented if the grid parameter 
space also has a corner, so that the surfaces on opposite sides of the corner represent different directions in para- 
meter space. For a concave corner in the computational domain, this can be done with the corner of a single grid 
block, as in Figure 10.3.3b. A convex corner must generally be represented by a corner between grid blocks, as 
in Figure 10.3.3. 


Figure 10.3.3 Grid topology at a sharp corner of a boundary. (a) A lost corner where part of a single grid block 
wraps around. (b) A corner of a single grid block fitted to a concave corner. (c) A corner between grid blocks 
fitted to a convex corner. 





(b) (c) 


Even with the flexibility afforded by multiple grid blocks, it is not always easy to avoid lost corners, and they 
are tolerated in many multiblock grids. For example, the H-grid topology is often used on wings and airfoils, 
with lost corners at the leading and trailing edges, as in Figure 10.3.4. For complex configurations, especially in 
3D, overlapping grids and unstructured grids offer great flexibility and are seeing increasing use. 


Figure 10.3.4 An H-grid for an airfoil, with a 90° lost corner at the leading edge and a subtle lost corner at the 
trailing edge 
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When a viscous boundary layer is part of the flow solution, the grid must be able to adequately resolve the 
details of the boundary-layer velocity profile. For a turbulent boundary layer, this requires resolving the viscous 
sublayer. In practice, codes often provide an assessment of how well this requirement was satisfied after the 
calculation is completed, by displaying the thickness of the first grid interval off the wall in turbulent-boundary- 
layer wall units (units of y" as defined by Equation 4.4.4). A good value for this thickness is in the neighbor- 
hood of one or two. This of course means having a normal-direction grid spacing at the wall that is very small 
compared to the boundary-layer thickness, which has two serious consequences, if unreasonably large numbers 
of grid points are to be avoided: 

1. The spacing along the surface must be much larger than the near-wall normal-direction spacing, mak- 
ing for grid cells with very high aspect ratios, which can degrade accuracy or cause numerical instability. 


2. The normal-direction spacing must stretch away from the surface, quickly increasing the spacing out- 
side the viscous sublayer. Strong stretching also degrades accuracy. 


An appreciation for the typical velocity gradient that must be resolved at the bottom of a turbulent boundary 
layer can be gotten from the velocity profiles plotted in Figure 4.4.5. And the situation becomes increasingly 
difficult as Reynolds number increases. A typical grid for resolving a turbulent boundary layer is illustrated in 


Figure 10.3.5. 


Figure 10.3.5 Typical grid for resolving a turbulent boundary layer, including the viscous sublayer. This is a 
crop of a small portion of the airfoil C-grid in Figure. 10.3.1la 





Recall that in panel methods for potential flow the paneled geometry has to include known shear layers such 
as the vortex wakes of wings and the boundaries of propeller slipstreams. The same requirement applies to 
methods for the nonlinear full-potential equation: the “surface grid” must include surfaces in the field where any 
discontinuity, such as a vortex wake, is to appear, and the flow solver must be set up to apply the appropriate 
boundary condition there. 


10.3.3 Grid Resolution and Grid Convergence 


Recall from Section 10.3.1 that our objective is to have the solution to the discrete problem spatially converge 
to a solution of the continuous problem as the discretization is refined. The discrepancy between a discrete solu- 
tion and a continuous solution is called the spatial truncation error, and it should decrease as the grid 1s refined. 
In the limit of small grid spacing, the truncation error in principle should vanish in proportion to the grid spacing 
to some power called the order of convergence, which can generally be deduced by an analysis of the formu- 
las used in discretizing the equations. I said “in principle” because in practice a given computer carries only so 
many digits in its arithmetic calculations, and before spatial truncation error reaches zero, numerical round-off 
error Will begin to increase. 

In practice, we don't generally know the continuous solution (if we did, we wouldn't need to be carrying out 
the discrete calculation), and we therefore don't know the actual spatial truncation error. But we can estimate it, 
based on the grid spacing and the gradients in the discrete solution. Then there are various ways of averaging 
the estimated error to define different kinds of error norms. A rapidly developing technology called adaptive 
gridding uses such information to automatically and selectively refine the grid and to achieve a minimum level 
of error for a given number of grid points. 


Whether or not adaptive gridding is used, there are practical limits to the grid resolution that can be used, 
imposed sometimes by the amount of computer memory available, but more often by the amount of computing 
time one is willing or can afford to spend on the solution. In practice, the degree of spatial convergence that is 
achieved by the finest grid the user decides to use varies greatly, depending on the details of the flow solution 
and on the user's skill in setting the problem up, particularly with regard to gridding. 

Following the better known gridding rules doesn't always guarantee a level of accuracy the user would like 
to see. For example, meeting the requirement for a small value of y" at the first grid point off the wall doesn't 
guarantee that the grid is adequate to resolve the whole boundary layer. In many grids, the stretching factor 
away from the wall is so severe that the grid becomes too coarse to resolve the outer part of the boundary layer 
accurately. Providing grid to adequately resolve wakes and other shear layers off of the body is difficult when 
their locations are not known a priori. The same thing applies to the resolution of shocks. Viscous effects that 
involve flow gradients in directions other than surface-normal can also be a source of difficulty, as grid spacings 
in these other directions are often chosen with only inviscid effects in mind, something we'll touch on again in 
Section 10.4.2. These are all areas in which robust adaptive gridding could be very helpful. 


10.3.4 Solving the Equations, and Iterative Convergence 


Solving the equations in a CFD code involves different processes depending on the equations and the particular 
solution scheme: 
1. Panel methods for potential flow involve the solution of a set of linear algebraic equations, which in 
principle requires no iteration, only the inversion of a matrix. 


2. In unsteady flow problems, the solution must be marched forward in time from a known initial condi- 
tion. At each new time step, the solution at each point in the grid is calculated based on values on a small 
stencil of surrounding points at one or more previous time steps. 


3. Nonlinear steady-flow problems are generally solved by time marching, line relaxation, or global re- 
laxation with Newton's method. All of these involve repetition, often generically referred to as iteration. 
Time-marching schemes use the unsteady equations and march the solution forward in time until it settles 
to a steady state. Line-relaxation methods typically start from an initial guess and then repeatedly sweep 
through the grid line-by-line, improving the solution iteration-by-iteration until it satisfies the discrete 
equations to within some tolerance. Global relaxation schemes use Newton's method to update the solu- 
tion for the entire grid at once, a step that generally requires inverting a matrix. 


4. Viscous/inviscid coupling methods in the early days iterated back and forth between the inviscid and 
viscous flow analyses. In newer methods, the coupling is done simultaneously with the iteration for the 
nonlinear flow solution, and Newton's method 1s often used to update the solution at each iteration. 


Time-marching methods use numerical schemes that are either explicit or implicit. In explicit schemes, the 
solution at each new time step is explicitly calculated point-by-point in the spatial grid, with the spatial deriv- 
atives in the equations calculated from values at neighboring points that are already known from previous time 
steps. In implicit schemes, the spatial derivatives are represented in terms of values at the new time step. Thus 
in an implicit method, the new solution at each point depends on a set of implicit relations that must be solved 
simultaneously, either by matrix inversion or iteration. Explicit methods require less computation per time step 
than implicit methods but have the disadvantage that numerical stability imposes limits on the size of the time 
step that can be taken. For implicit methods, on the other hand, there is no stability limit on the time step. 

The maximum allowable time step for an explicit method depends on local conditions in the grid and gen- 
erally varies from point to point. For overall stability in an explicit time-accurate unsteady-flow calculation, 
the time step used globally must not exceed the shortest of the local maximum time steps. When only a final 


steady-flow solution is sought, the solution need not be time-accurate, and nonphysical pseudo-time marching 
is often used, in which the time step varies from point to point according the local stability criterion, rather than 
the global one. The time history in such a calculation is not realistic, but the same final steady state 1s reached, 
generally in far fewer time steps than a time-accurate calculation (a calculation with constant global time steps) 
would have required. 

Settling down to a final steady-state solution, whether by time marching, pseudo-time marching, or relaxa- 
tion, is called convergence. Of course, the changes in solution quantities from one iteration to the next decrease 
as convergence 1s approached, but small changes are not a reliable indicator of convergence. The real meas- 
ure of convergence is how well the discrete equations for the steady problem are satisfied. Take any one of the 
steady discrete equations and arrange it with all the terms on one side. You now have an expression whose value 
should be zero for a solution to the steady problem. At any given point in the grid, at any given iteration of the 
solution procedure, the discrepancy from zero is called the residual. The goal of the iterative solution process 1s 
to drive the residuals to zero for all of the equations for every point in the grid. 

How well real CFD codes meet this goal in practice varies widely. Convergence is typically assessed accord- 
ing to the maximum residual or an average, usually rms. Many of the earlier RANS codes can reduce the av- 
erage residual only 3 or 4 orders of magnitude, at which point they “stagnate.” (The residual stops decreasing.) 
The better new codes can drive the residual to machine zero. Remember, however, that no matter how low the 
residual is, the residual measures only how well the discrete equations are satisfied. A low residual says noth- 
ing about how well the discrete solution approximates the continuous solution (1.e., a low residual says nothing 
about the spatial truncation error). 


10.4 Physical Modeling in CFD 


How well a CFD code models the physics of a particular flow depends on the applicability of the equations 
and boundary conditions to the flow in question. In this section, we'll address some of these physical modeling 
issues. 


10.4.1 Compressibility and Shocks 


As we pointed out in Section 3.11, there are small-disturbance regimes, both subsonic and supersonic, but not 
transonic, in which the effects of compressibility can be linearized. However, this kind of treatment of com- 
pressibility has very limited ranges of applicability and is no longer used much. Predicting real compressibility 
effects in most applications requires nonlinear equations: NS, Euler, or full-potential. 

In transonic and supersonic flows, shocks nearly always appear. In principle, a solution to the NS equations 
predicts the correct jump conditions across a shock and nearly the correct shock thickness, but the detailed dis- 
tributions across the shock are not accurate because the simple representation of the viscosity in the NS formu- 
lation is inadequate, as we saw in Section 3.1. In practice, this is never an issue because grids are never fine 
enough to produce spatially converged solutions through shocks. With practical grid densities, there is always 
numerical dissipation such that a shock is smeared over at least several grid intervals and is therefore orders of 
magnitude thicker than a real shock. Fortunately, the jump conditions across the shock can still be accurately 
predicted because they are independent of the details of the dissipative process, whether real or numerical. Of 
course the jump conditions lose accuracy if the degree of artificial shock smearing is such that the smearing 
encroaches significantly on flow gradients upstream or downstream. 


Shocks can also be approximated in numerical solutions to inviscid equations, either by shock fitting or shock 
capturing. Shock fitting explicitly addresses the mathematical problem of representing shocks as discontinuities 
in solutions to multidimensional inviscid equations: The geometry of the surface of discontinuity is explicitly 
determined as part of the solution process, and the appropriate jump conditions are explicitly imposed. Shock 
capturing avoids the problem of having to explicitly define the shock surface. Given the right kind of numer- 
ical dissipation, a smeared approximation to the shock forms naturally within the numerical grid. Although the 
smeared structure is nonphysical, the conservation properties embodied in the equations result in the correct 
jump conditions across the shock without requiring explicit imposition of jump conditions. As with a shock in 
an NS solution, the jump conditions lose accuracy if the degree of shock smearing is such that the shock en- 
croaches significantly on flow gradients upstream or downstream. Shock capturing is by far the more common 
of the two strategies because it is much simpler to implement, especially in 3D. 

When a shock appears in an Euler solution, artificial dissipation produces the same loss of total-pressure 
as would occur in a real shock or a numerically smeared NS shock. In the full-potential equations, the flow 
is assumed irrotational, and the vorticity and loss of total pressure that appear downstream of a real shock are 
not allowed. As a result, when a shock is modeled as a discontinuity or captured in the grid in a potential-flow 
solution, it is not possible to enforce all of the conservation laws across the shock. Either conservation of mass 
or conservation of momentum, or a combination of the two, must be waived. Most full-potential methods use 
differencing schemes that capture shocks automatically in such a way that mass is conserved, and conservation 
of momentum is waived. Such differencing schemes are referred to as conservative. The spurious momentum 
produced at a shock in a conservative scheme is mirrored in a nonzero contribution to the pressure drag on the 
body. In the weak-shock limit, this pressure drag is equal to the “real”? shock drag that would have occurred in 
a solution to the Euler equations, but as shock strength increases, 1t becomes seriously in error. A formula that 
quantifies this error was derived by Steger and Baldwin (1971). 

The error is significant for anything but the very weakest of shocks. Thus a reasonable prediction of shock 
drag in a potential-flow method requires an approach other than surface-pressure integration. The only real 
choice is to identify streamtubes that pass through the shock in the flowfield, apply either the Oswatitsch for- 
mula (Equation 6.1.11) or Tognaccini's (2003) higher order formula to each one, and to integrate (sum) the res- 
ults. This is not a simple thing to do, and algorithms that can do it robustly can be quite complicated. It requires 
accurately determining the location and the upstream flow condition in each streamtube where the flow enters 
the shock, even though the shock is smeared in the streamwise direction over several grid cells. And the up- 
stream Mach number that is required is the shock-perpendicular Mach number, which means that the direction 
of the shock normal must be accurately determined. The shock location on adjacent streamtubes often doesn't 
define a smooth enough “shock surface” to support an accurate determination of the normal direction. Instead, 
the shock normal direction is usually assumed to coincide with the direction of the pressure-gradient vector, 
which is usually well-behaved. 


Heat-transfer effects can be modeled in solutions to the NS equations or the boundary-layer equations, 
provided the appropriate temperature or heat-flux boundary conditions are specified, and the turbulence model 
is extended to calculate the turbulent heat transport. 


10.4.2 Viscous Effects and Turbulence 


As I claimed in Chapter 2 and we discussed further in Section 3.1, the continuum formulation embodied in the 
‘full’ NS equations represents molecular viscous stresses and heat fluxes in a way that is nearly exact, for nearly 
any practical purpose. Most practical applications in aerodynamics, however, involve turbulent flow, for which 
we must resort to RANS equations with turbulence modeling, as we saw in Section 3.7. This entails a substan- 


tial loss in physical fidelity, and for swept wings (Section 8.6.2) and many other practical applications we must 
settle for CFD predictions that are far from perfect. Turbulence modeling also increases computing time and 
can reduce the robustness of convergence of many numerical schemes. And as we saw in Sections 10.3.2 and 
10.3.3, turbulent boundary layers impose difficult demands on the grid, requiring very small spacing near the 
wall to resolve the viscous sublayer. 

Other complicating factors that impact the fidelity of viscous-flow CFD are separated flow, laminar-to-turbu- 
lent transition, and laminar separation bubbles. The fidelity of turbulence models, not perfect even for attached 
boundary layers, deteriorates further in separated regions. Flows with long runs of laminar boundary layer can 
be very sensitive to the location of transition. Even if the CFD code has a built-in transition prediction, such 
flows are difficult to predict accurately because transition is sensitive to environmental factors: surface quality 
(part of the “environment” seen by the flow), freestream turbulence, and acoustic disturbances. Even if such 
factors are known, the sensitivity of transition to them 1s often not. And most current viscous-flow codes don't 
have built-in transition prediction anyway. Laminar separation bubbles with turbulent reattachment are handled 
by very few CFD codes, the MIT MSES (Drela, 1993) code being one of the few. Laminar separation bubbles 
play an important role in stall behavior in many inlet and airfoil flows, and the inability to handle them is one 
of the more serious shortcomings of current RANS codes. 

The “full” NS equations provide for all of the components of the molecular viscous stress that are likely to 
be of any consequence in any flow of a simple fluid in the continuum regime. Of course predicting the turbulent 
counterparts of these stresses accurately is problematic, but the “full” RANS equations at least have provisions 
for all of them. That said, there are many practical situations in which only some of them are significant, and 
we needn't account for them all. The thin-layer NS equations neglect the streamwise viscous (and turbulent) 
diffusion terms and all terms involving cross derivatives, thus saving computing time and reducing artificial 
dissipation, allowing the use of grids that are too coarse to resolve the neglected terms accurately. Many RANS 
codes have the option of running “full” or “thin-layer.” Finally, coupled inviscid/viscous methods make use 
of the boundary-layer approximation (Section 4.2), in which only the viscous (and turbulent) transport normal 
to the surface is accounted for, and the normal-direction momentum equation is omitted, which is adequate in 
many flows in which the boundary layer remains thin. 


10.4.3 Separated Shear Layers and Vortex Wakes 


Separated shear layers and vortex wakes are naturally captured in solutions to the NS equations. The funda- 
mental limitation on the accuracy of prediction is the fidelity of the turbulence modeling, though in practice grid 
resolution is often just as serious a limitation, especially in 3D. 

Surprisingly, shear layers (vortex wakes) emanating from sharp edges can also arise in numerical solutions 
to the Euler equations. First, note that the Euler equations themselves, before they are discretized, allow contact 
discontinuities (slip surfaces across which there is zero normal velocity and a jump in tangential velocity) as 
weak solutions to the equations. Such slip surfaces are the Euler-equations analog to the vortex sheets we saw 
in idealized models of lifting flows around 3D wings (Sections 8.1.2 and 8.3.4). Thus in principle, a shear layer 
can be modeled in the Euler equations as a discontinuity and fitted, just as a shock can be. But in practice, shear- 
layer fitting is even less practical than shock fitting because shear layers often have edges (as would come from 
the tips of a lifting wing) where they tend to roll up into tight spirals, as we saw in Section 8.1.2. Such rolled-up 
shapes are all but impossible to deal with by shear-layer fitting. Therefore, just as with shocks, it is much more 
common to allow smeared versions of vortex wakes to be captured in the numerical grid. 

Shear layers captured in Euler solutions have interesting features, as analyzed and described out by Powell et 
al. (1987). Like real shear layers, they are of course characterized by a “jump,” across the layer, in the velocity 


vector. And, like real shear layers, they tend not to support much of a pressure jump across the layer. Thus when 
an Euler shear layer is wet on both sides by fluid from the freestream, there tends to be not much of a jump 
across it in velocity magnitude, only in direction. The velocity component perpendicular to the velocity jump 
tends to be nearly constant through the layer, as illustrated in Figure 10.4.1. There is thus a deficit in velocity 
magnitude inside the layer, which is accompanied by a deficit in total-pressure, since the static pressure varies 
relatively little. 


Figure 10.4.1 [Illustration of how the velocity vector varies through a shear layer captured in a solution to the 
Euler equations, as explained by Powell et al. (1987). Vi and V2 are the velocity vectors on the two sides of 
the shear layer, V2 — V1 1s the velocity jump across the layer, and the dashed vectors are velocities at interme- 
diate levels within the layer. The component perpendicular to V2 — V1 is nearly constant, and at intermediate 
levels there are therefore deficits in velocity magnitude and total-pressure 








View. 


When a shear layer or wake is captured in an NS solution or an Euler solution, inadequate grid resolution 
often degrades accuracy because the location of the shear layer is not usually known a priori, and it is usually 
not practical to provide sufficient grid density to resolve a shear layer throughout the region where one might 
be expected to appear. 

In panel-method codes based on potential-flow theory, the wakes of lifting surfaces must be paneled. In Sec- 
tion 10.2.3, we discussed wake “relaxation” schemes that attempt to determine a wake configuration that 1s also 
a stream surface (no through flow). These schemes, however, are not typically robust because they come up 
against the same problem we discussed above in connection with shear-layer fitting in Euler solutions: Vortex 
wakes tend to roll up into tight spirals. In a panel method, no reasonable panel spacing would be able to resolve 
a tightly wound spiral. 

Thus it is much more common to fix the wake in space and model it in a way that is equivalent to the assump- 
tions of Trefftz-plane theory. A wake sheet is defined that extends back from the trailing edge of each lifting 
surface, usually in such a way that its cross-section in a Trefftz-plane view remains constant. In surface panel 
methods, the sheet is paneled with doublet singularities whose strengths are distributed so as maintain a velocity 
jump across the sheet that is equivalent to having vortex lines oriented straight back, at least in plan view. A typ- 
ical wake paneling is shown in Figure 10.4.2. In finite-difference field methods for the full potential equation, 
the sheet is identified with a particular grid surface, and a velocity-jump boundary condition is imposed across 
the sheet that has the same effect as the singularity distribution used in panel methods. 





Figure 10.4.2 Typical paneling of a wake sheet behind a lifting surface in a potential-flow panel method. (Plot 
by M.F. Smith.) 





In both panel methods and field methods for 3D potential flow, the spanwise distribution of effective shed 
vortex strength is determined as part of the flow solution so that the upper- and lower-surface pressures are 
equal at the trailing edge, which is equivalent to imposing the Kutta condition for a 2D airfoil (Section 7.1). For 
high-aspect-ratio wings, the distribution of effective shed vortex strength works out to be consistent with the 
spanload on the lifting surface, just as in Trefftz-plane theory. 

A potential-flow wake-vortex sheet defined in this way has approximately the right distribution of vortex 
strength where it leaves the trailing edge, but it reflects reality less accurately downstream because in a real 
wake the vortex lines migrate generally outboard, and the sheet rolls up at the outer edge. Well downstream of 
the wing, some details of the overall flowfield, such as the downwash field at the location of an aft tail, will not 
be accurately predicted as a result. 

In Section 8.3.4, we discussed how a wake sheet in which the vortex lines are aligned in the freestream dir- 
ection is not force-free, but is drag-free. We noted that this doesn't guarantee that the induced drag calculated 
by Trefftz-plane theory based on the distribution of vortex strength in the wake is correct, but only that in a 
shock-free potential flow it 1s consistent with the pressure drag on the configuration. Still, short of a “relaxed” 
stream-surface wake, a drag-free wake with vortex lines in the freestream direction would seem to be the best 
option we have. A drawback in practice 1s that if solutions are desired at more than one angle of attack, a dif- 
ferent wake definition is needed for each one. Of course it is easier to define just one wake anchored to the 
configuration and to allow it to tilt relative to freestream when the configuration is analyzed at different angles 
of attack. But how much is accuracy degraded if we do so? 


Our discussion so far might lead one to think that tilting a potential-flow wake sheet up or down in this way 
would be a very bad thing to do, because then the wake would not be drag-free. However, counterintuitive as it 
might seem, a wake that is tilted by a modest amount will usually have very little effect on the results that usu- 
ally matter. Tilting the wake up or down will usually have almost no visible effect on plots of pressure distribu- 
tions and very little effect on the integrated lift and moments. The spanload shape will also typically be changed 
very little, and as a result, the induced drag calculated by Trefftz-plane theory will not be significantly changed. 
The Trefftz-plane drag calculation gives the total induced drag on the entire paneled configuration, including 
the wake sheet. So presumably when a wake is tilted and takes on a drag force, the integrated pressure drag on 
the configuration surfaces will change by roughly the same amount in the opposite direction. But this usually 
involves subtle changes to the pressure distributions that are practically invisible on a pressure-distribution plot 
and have almost no effect on lift and moments. So as long as we follow the usual practice of inferring induced 
drag from the Trefftz-plane calculation and not from pressure integration, a tilted wake will have little effect on 
the results we actually use. 


10.4.4 The Farfield 


In most applications in external aerodynamics, the flow approaches a uniform condition in the farfield. When 
we simulate such a flow in a panel method, this condition is automatically enforced by the superposition of a 
uniform flow and the singularities that are used to simulate the body and its wake. The contributions from the 
singularities die off with distance in just the right way to simulate a free-air flow in an infinite domain. 

In methods with field grids, getting the farfield “right” is a more complicated problem. For an outer boundary 
at a finite distance, there is no practical set of boundary conditions that can provide a perfect simulation of flow 
in an infinite domain, and we must generally settle for something less than perfect. A minimum requirement 
should be that the solution converges toward the right behavior as the outer boundary is moved farther away. 
For a given set of flow equations (full-potential, Euler, or NS), there are several choices of farfield boundary 
conditions that are acceptable in this sense. For boundary conditions of this kind, the farther away we put the 
outer boundary, the less impact imperfections in the boundary conditions will have. But the bigger the domain 
gets, the more grid points or cells we will have to use. So regarding the impact of the farfield boundary condi- 
tions, there is a trade to be worked between fidelity and cost, just as there is regarding spatial truncation error. 

What the “right behavior” is at the outer boundary is complicated by the fact that vortex wakes associated 
with lift in 3D persist over long distances downstream (forever, 1n the case of a gridded wake in full-potential). 
We must therefore allow some strong flow disturbances to pass through the downstream boundary. However, 
even if we didn't have these disturbances, enforcing uniform flow at the farfield boundary wouldn't be an option 
because it requires more boundary conditions than we're allowed to specify in any of our equation sets (potential 
flow, Euler, or NS). 

In full-potential methods, because there is only one unknown, only one boundary condition can be imposed. 
Thus only one component of the velocity vector can be specified, and the others must be allowed to be determin- 
ed by the solution. Specifying uniform flow is not allowed because it would require specifying two components 
in 2D or three components in 3D. 

The situation with the Euler or NS equations is more complicated. Recall from Section 3.4.4 that we have six 
unknowns in Euler, and eight in NS. The combinations we are allowed to specify depend on whether the flow is 
entering or leaving the domain, whether the flow is subsonic or supersonic, and whether the equations are Euler 
or NS. These issues are discussed in Hirsch (2007). 

For internal flows in ducts, including solid-wall wind tunnels, the appropriate boundary condition at the wall 
for any equation set would be no through-flow (normal component of velocity equals zero). For the NS equa- 
tions, a no-slip condition and a thermal boundary condition would also be required. Wind tunnels with holes 
or slots in the walls are more difficult. The holes or slots can be modeled individually, but conditions that are 
easy to impose on the flow through them cannot always be relied on to model the situation accurately. Model- 
ing of slotted walls is discussed, for RANS by Bosnyakov et al. (2008), and for viscous/inviscid coupling by 
Krynytzky (2004). There are also models available that area-average the effects of the holes or slots, but with 
some further loss of fidelity, and these have been largely superseded by models that treat the slots or holes as 
discrete. Internal flows also require boundary conditions upstream and downstream, the usual combination be- 
ing stagnation temperature and pressure upstream, and static pressure downstream. 


10.4.5 Predicting Drag 


In previous chapters, we discussed many aspects of the physics of drag: The drag of streamlined and bluff bodies 
in general in Section 6.1, the profile drag of airfoils in Section 7.4.2, the shock drag of airfoils in Section 7.4.8, 
and lift-induced drag in Section 8.3. A common thread in all of this is that drag depends on subtle flow details 


that CFD cannot be expected to predict perfectly. Turbulence modeling, with all its imperfections, will always 
compromise the prediction of viscous drag, even in attached flows. Separated flow is not predicted well by CFD 
unless extraordinary measures are taken (DNS, LES, or DES), and this compromises prediction of drag in any 
flow involving separation. Shocks and vortex wakes must be captured in grids that are often not as fine as they 
should be, compromising the prediction of shock drag and lift-induced drag. 

To these difficulties we must add that drag is particularly sensitive to any lack of iterative or spatial conver- 
gence in the flow solution. A lack of spatial convergence in particular shows up as artificial dissipation, produ- 
cing spurious drag that looks like additional viscous drag or shock drag. 

Although CFD drag predictions are generally far from perfect, there is a widespread assumption that pre- 
dicted drag increments due to small changes in the configuration or the flow conditions are much more accurate 
than predicted absolute levels. There is some justification for this, in that the increments we are usually inter- 
ested in are small fractions of the total drag, and the error in an increment 1s therefore likely to be smaller than 
the error in the absolute level. But there is no other theoretical reason to expect an advantage in dealing in in- 
crements, and we're depending on the assumption that the bias in the absolute level doesn't change so much 
between the two calculations as to swamp the increment we're looking for. For this assumption to be justified, 
we must be diligent about making the two calculations as comparable as possible, avoiding unnecessary changes 
to the grid, and minimizing truncation errors and residuals. 

CFD can certainly produce useful drag predictions. We just need to remember that they are not infallible. 


10.4.6 Propulsion Effects 


In CFD modeling of full airplane configurations it is not usually practical to resolve propeller blades or the in- 
ternal details of turbojet or turbofan engines. Such systems are usually modeled by inflow and outflow surfaces 
in the grid, where boundary conditions can be applied to simulate the time-averaged effects of the propulsion 
device. Boundary conditions used for this purpose can be fixed or can be formulated to respond in some real- 
istic way to the surrounding flow conditions, so as to model the response of the device. This kind of modeling 
of propulsion is similar to the basic actuator disc theory we discussed in Section 6.1.10 in connection with the 
physics of propellers, but it is often extended to include compressibility and heat addition. 


10.5 CFD Validation? 


“CFD validation” is a term that is heard frequently among users of CFD codes in industrial environments, 
among their customers, and in the engineering literature. But what does it mean? If we take the words literally, 
there are two relevant definitions to consider (Webster's, 1976): 


¢ Validate: To confirm the validity, and 
¢ Valid: Seen to be in agreement with the facts or to be logically sound. 


According to these definitions, successful CFD validation requires that the code in question be found to agree 
with “the facts,” which I would take to mean experimental measurements covering the whole range of flows for 
which the code is to be used. However, for most practical applications, CFD codes tend not to agree with the 
facts as well as we would like, over as wide a range of flows as we would like, and we must conclude that CFD 
validation in any literal sense is usually an unattainable goal. 

This is not to suggest that the enterprise we call “CFD validation” is hopeless, just that it is misnamed. It can 
actually serve a very necessary purpose. When we compare a code against a wide range of test cases, we learn 


how physically accurate the solutions are likely to be and what kinds of biases to expect in particular situations. 
Then when we use the code in an application, we have a basis for making uncertainty estimates and applying 
adjustments to the results. Looked at in this light, the purpose that CFD testing serves is really more calibration 
than “validation.” 

Of course the knowledge gained in CFD testing helps us in our application work only if we remember to 
actually use it. Even when we know the limitations of a code, we have a natural tendency to forget them and to 
be overly optimistic about the accuracy of the results, something we noted in the introduction to this chapter. 


10.6 Integrated Forces and the 
Components of Drag 


In principle, determining total integrated forces and moments from a solution of the NS equations is simply a 
matter of carrying out the integrations over the body surface called for in Equation 5.4.1 for the lift, Equation 
6.1.1 for the drag, and corresponding integrals for moments. For NS codes, these forces and moments from 
surface-pressure and surface-shear integration are the primary force outputs. 

But in the case of the pressure drag, surface-pressure integration yields only the total and doesn't determine 
the “components”: Shock drag, induced drag, and the viscous contribution to pressure drag. Recall from Section 
6.1.3 that these drag “components” are not rigorously separable, and that they can be determined only through 
the appropriate theoretical idealizations. Estimating shock drag requires finding the shocks in the flowfield, 
identifying streamtubes that pass through them, applying the Oswatitsch formula (Equation 6.1.11) to each one, 
and integrating (summing) the results, as we discussed in Section 10.4.1. Estimating induced drag requires a 
Trefftz-plane calculation (Equation 8.3.10). 

Other ways of integrating flowfield quantities can also be used with solutions to the NS equations. Vorticity 
and total-pressure data on a single cross-plane downstream of the configuration can be used, as described in 
Section 6.1.4.7, or more-general formulas involving volume integration of the viscous dissipation can be used, 
as described by Tognaccini (2003). An advantage claimed for the latter method 1s that drag determined in this 
way converges faster with grid refinement than does drag determined by surface-pressure integration. 

For viscous/inviscid interaction methods, surface-pressure integration doesn't work at all for determining the 
pressure drag. Instead, it requires building up estimates of the “components”: shock drag and induced drag as 
described above, and viscous profile drag from one of the variants of the Squire-Young formula described in 
Section 6.1.4.5, applied either at the wing trailing edge or somewhere in the viscous wake downstream. 

When propulsion inlets and exhaust streams are represented in CFD calculations, special care is required to 
ensure that thrust/drag bookkeeping 1s done in a physically consistent way. Drela (2009) discusses the physical 
issues that arise when propulsion effects are present in the flowfield. 


10.7 Solution Visualization 


A CFD field solution defines the flowfield everywhere and can therefore tell us a great deal more about a flow 
than just the integrated forces. On solid surfaces, there are surface distributions of pressure, temperature, and 
shear stress as well as patterns of skin-friction lines. Away from the surfaces there are volume distributions of 


scalar quantities: pressure, total-pressure, density, temperature, turbulence kinetic energy, and eddy viscosity. 
And there are volume distributions of vector quantities: velocity and vorticity. The velocity field defines stream- 
lines, streaklines, particle paths, and timelines; and the vorticity field defines vortex lines. Understanding such 
multidimensional information requires graphic visualization, which can range from simple 2D Cartesian plots 
to elaborate 3D perspective presentations, both still and animated. In this section, we'll look at how the various 
sorts of displays serve different purposes in illuminating different aspects of a flowfield. 

Graphic visualization is the most powerful tool we have for gaining any intuitive understanding of complex 
quantitative information, and the computer has greatly expanded the range of visual displays that it 1s practical 
to produce. But before we get into the specifics of applications in aerodynamics, it's worth reflecting on the 
general nature of the graphic medium and how it limits what we can actually do. 

First, the nature of the visual sense itself imposes limitations on our ability to view real 3D information. Our 
binocular vision provides only very low resolution in the depth dimension and works only at short distances. So 
for quantitative purposes, we see little more than a 2D projection of what's in front of us. Much of what we per- 
ceive about depth comes from prior knowledge and context: We know the tree is closer to us than the mountain 
because we know something about the relative sizes of trees and mountains and because the tree blocks part of 
the view of the mountain. So blockage of one object by another helps us with depth perception, but it is also 
a limitation: Along any given direction from our eyes, we can usually see only one thing at a time. Unless an 
object in our field of view is nearly transparent, it blocks our view of anything behind it. 

Next are the limitations imposed by the physical media we typically use. We have no practical alternative to 
viewing graphics on computer screens and paper surfaces, which are basically 2D displays that take no advant- 
age of our binocular vision. A depth dimension can be conveyed only through perspective, shading, and context. 
As aresult, displays that depict 3D geometry generally incur some degree of spatial ambiguity. 

A 2D display medium offers us only five basic graphical constructs for displaying spatial distributions of 
quantitative data: 

1. A Cartesian plot of a scalar quantity versus one coordinate. 


2. A contour plot of a scalar quantity versus two coordinates. A contour plot can be plotted in the 2D 
plane, or it can be constructed on a curved surface in space, such as the surface of a body, and viewed in 
perspective. 


3. A shaded isosurface showing how a single value of a scalar quantity is distributed in space in three 
dimensions. 


4. An array of arrows showing the distribution of a two-component vector in two spatial coordinates. 
5. Space curves, viewed either in 2D or 3D, representing a direction field. 


We've used examples of all of these to illustrate flowfield phenomena in earlier chapters. Now let's look at 
how we can best represent different aspects of flowfields, using these constructs. 


In 3D flow, the overall pattern of pressure on the surface of a body is most clearly visualized in an isobar 
contour plot using either colors or lines, as in Figure 10.7.1. In this plot of the pressure distributions on the up- 
per surface of a swept wing at two different angles of attack, the changes in the position and the sweep of the 
shock are visible at a glance. However, subtle details of the local chordwise pressure distributions, such as the 
magnitude of the shock pressure rise, the slope of the “rooftop” ahead of the shock, and whether or not there is a 
re-expansion aft of the shock are not easy to see in a contour plot and are visualized better in Cartesian pressure 
plots for station cuts along the span, like airfoil Cp plots, of which there are several examples in Section 7.4. 
Putting data for more than one spanwise station on a single plot tends to be more confusing than helpful, and it 
is usually better to show multiple stations as an array of plots, as in Figure 10.7.2. This kind of display shows 
subtle details with much greater pressure resolution than a contour plot, but it doesn't provide nearly the same 
feel for the overall pattern. 


Figure 10.7.1 Isobar contour plots of the pressure distribution on the upper surface of a swept wing at two dif- 
ferent angles of attack. The shock 1s visible as the concentration of contours traversing most of the span. Eta 
values indicate spanwise stations for which Cartesian plots of the pressure distribution are shown in Figure 
10.7.2. CFL3D solutions and plots by B.J. Rider. Wing geometry is the Common Research Model developed 
by Vassberg et al. (2008) 
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Figure 10.7.2 Cartesian plots of the pressure distributions at stations along the span of the wing shown in Fig- 


ure 10.7.1. Such a plot shows subtle local details of the pressure distribution not easily seen in the contour plot 
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Though showing multiple spanwise stations on a single Cp plot typically results in a jumble, a sequence of 
conditions (angles of attack or Mach numbers) at a single station on a single plot tends to form a “family” of 
curves that are easily readable and can be quite informative. Examples for 2D airfoils were shown in Figures 
7.4.26 and 7.4.27. 

Skin-friction lines are often used to visualize the distribution of shear-stress direction on body surfaces in 3D 
flow. Skin-friction lines plotted from CFD solutions are analogous to the oil-flow visualization pictures from 
experiments, and they have the same strengths and weaknesses that we discussed in Section 4.1.2. They are 
useful mainly for showing the surface signatures of the kinds of 3D separation we discussed in Chapter 5, and 
an example of a complicated separation pattern on a swept wing is shown in Figure 10.7.3. Note that the shock- 
induced separation with reattachment looks very similar to the pattern sketched in Figure 8.6.9a and that the 
trailing-edge separation is similar to the pattern in Figure 8.6.9b. 


Figure 10.7.3 A plot of computed skin-friction lines on a swept wing. RANS calculation and plot by N.J. Yu 
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As pointed out in connection with Figure 4.1.11, skin-friction lines have the disadvantage that they can give 
the misleading impression that they represent the general flow direction over the surface. We must keep in mind 
that skin-friction lines represent only the surface-shear-stress direction, and that the flow direction can be very 
different only a short distance off the surface, as seen, for example, in the flow direction plotted for a swept- 
wing boundary layer in Figure 8.6.13. 

Visualizing 3D flow structure out in the field poses interesting challenges because of the essentially 2D 
nature of the graphic constructs we have available. We've already noted that in a 2D representation of 3D space 
the depth dimension is ambiguous, which can make interpretation difficult and error-prone. Animation in which 
the 3D configuration represented by the graphic rotates continuously, or a screen display in which the viewer 
can rotate the configuration at will, can make things much easier to understand. 

It is often effective to show distributions of flow quantities on planar cuts in the field, where the cuts are dis- 
played as part of a perspective view of a 3D configuration. In Figure 10.7.4, isobars are plotted in a longitudinal 
cut in the flow around a wing, clearly showing the shock that caused the separation visible in Figure 10.7.3. Of 


course, the relatively low pressure resolution of an isobar plot is still a disadvantage, as it was in the case of 
surface pressures. 


Figure 10.7.4 Isobars in a longitudinal cut of a 3D flow around a wing, seen in a perspective view. Same flow- 
field as in Figure 10.7.3. RANS calculation by N.J. Yu. Plot by A.M. Malone 





Streamlines, streaklines, or particle paths displayed as space curves in a perspective view of the field can also 
be quite useful. The effectiveness of such a plot can vary widely, depending on how many curves are plotted 
and whether they are distributed effectively, aspects that are controlled by the code user. Each curve must be 
defined by the coordinates of a point through which it passes, and graphics software generally requires these to 
be chosen by the user. Some skill and trial and error are usually required to make an effective streamline plot. A 
plot needs to have a sufficient number of curves, as curves too sparsely spaced can miss important details of the 
flow. And the curves need to be positioned so as to mark the details that are of interest, particularly vortices, as 
we'll see next. 

Vorticity is often a key to understanding the structure of a flow, and concentrated vortices in particular are 
indicators of interesting things happening in the flow. In steady flows, concentrated vortices can be seen in 3D 
streamline plots, but visualizing a vortex this way is a hit-or-miss proposition if one doesn't know ahead of time 
where the vortex will form. Figure 10.7.5 illustrates this problem using the tip vortex behind a wingtip as an 
example. In (a), the streamlines are released too far inboard and fail to show evidence of the vortex. In (b), the 
streamlines are released close enough so that some of them orbit the vortex, indicating its location. 


Figure 10.7.5 Streamlines in the flowfield around a wing indicating the location of the tip vortex. Same flow- 
field as in Figures 10.7.3 and 10.7.4. RANS calculation by N.J. Yu. Plots by A.M. Malone. (a) Streamlines re- 
leased too far inboard of the tip fail to mark the vortex. (b) Streamlines released close enough to the vortex or- 
bit the vortex, marking its location 
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There are several kinds of displays that can make a vortex visible, but depend less on prior knowledge of the 
location. Figure 10.7.6 shows three examples using flow quantities in a cross-stream plane behind a wingtip, in 
the same flowfield as Figures 10.7.3—10.7.5. In (a), streamlines constructed from velocity vectors projected in 
the cut clearly orbit the vortex and make it visible. For this kind of plot to work, however, the cutting plane must 
be roughly perpendicular to the vortex. In (b), contours of constant vorticity magnitude mark much of the vortex 
sheet behind the trailing edge, not just the concentrated vortex rolling up at its edge. In (c), contours of constant 
pressure clearly indicate the location of the vortex core and don't show the vortex sheet inboard. A drawback to 
these three types of display is that a weak vortex would likely be difficult to see. 


Figure 10.7.6 Visualizing the tip vortex in a cross-stream cut in the field behind a wingtip. Same flow field as 
in Figures 10.7.3, 10.7.4, 10.7.5. RANS calculation by N.J. Yu. Plots by A.M. Malone. (a) Streamlines from 
velocity vectors projected in the cut. (b) Contours of constant vorticity magnitude in the cut. (c) Contours of 
constant pressure in the cut. 
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Other approaches to visualizing vortices without prior knowledge of their locations have been proposed. 
Saderjoen et al. (1998) identify two main classes of telltales: 


1. Point-based flow quantities, of which the vorticity magnitude = |V x V| is the most widely familiar 
but has the disadvantage that it doesn't distinguish between shear layers and concentrated vortices, as we 
saw in Figure 10.7.6b. Because concentrated vortices in steady flows must generally be aligned with the 
prevailing local flow direction, the normalized helicity V - w/(|V| « |@|) may be a better choice than the 
vorticity magnitude. 


2. Curve-based quantities based on nonlocal properties of the streamline pattern. The basic idea was put 
forward by Robinson (1991): “[A] vortex exists when instantaneous streamlines mapped into a plane nor- 
mal to the vortex core exhibit a roughly circular or spiral pattern, when viewed from a frame of reference 
moving with the center of the vortex core.” Saderjoen et al. (1998) give two examples of automatic vor- 
tex telltales based on this idea, both applied to projections of the streamlines into a plane. One involves 
calculating the locations of centers of curvature at many points along many streamlines and then calculat- 
ing the density of these center points in the plane, called the curvature center density. The other involves 
following streamlines and calculating the winding angle. A vortex is identified when a winding angle of 
2m 1s reached, and the end point is near the starting point. 

Complicated unsteady vortex structures in wakes and turbulent boundary layers can be clearly visualized in 

terms of isovorticity surfaces, as in Figures 5.3.3a and 10.2.1. 


10.8 Things a User Should Know about a 
CFD Code before Running it 


Running a CFD code without knowing what's in it is risky. It is too easy to make poor choices of input paramet- 
ers and get a solution that is not as accurate as it should be, and not understanding the limitations of a code can 
lead one to misinterpret the results. This section provides a brief check list of the things a user should know. 


What equations does the code solve? Do these equations adequately model the physics of the problem? 


This includes not just the basic equations, but the turbulence model and the provisions, if any, for modeling 
laminar-to-turbulent transition. These considerations determine the applicability of the method to the problem 
at hand, and the physical fidelity of the results. 


What order of spatial convergence does the numerical method provide, and how fine a grid resolution can 
be used on your particular problem? Can adequate grid density be concentrated where it is needed? 


These determine whether adequate numerical accuracy can be obtained. 

What provisions does the code have for monitoring grid convergence and iterative convergence? 
These are important for monitoring the numerical accuracy of the solution. 

Where did the code come from? 


¢ In-house developed? 
¢ Vendor supplied? 
¢ Vendor supplied and in-house modified? 


This affects where you may have to go to get questions answered knowledgeably. 
What is the code’s state of development? 


¢ Experimental? 


¢ Well developed and documented, but as a code only? 
¢ Well developed and documented as a code and integrated into the process in which you are using it? 


The code's status in this regard will affect: 


¢ The reliability of execution and convergence. 
¢ The user-friendliness of the inputs and outputs. 
¢ The degree of automation of pre- and post-processing steps. 


Has it been used for similar applications, and how good were the results? 


If this is a pioneering application, be prepared to spend considerable time in trial-and-error adjustment of 
input parameters and in assessing the quality of the results. 


Are consultation and support available? 
If not, arriving at good results could prove to be very time consuming. 


In light of all of the above, is the code really applicable to the problem at hand? 
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ABSTRACT 


In order to realize the ballistic control of the railgun and the flight stability of the projectile, a new 
type of railgun is designed, which can control the muzzle velocity and rotation rate. The method of 
the muzzle velocity and overload control is to adjust the voltage or other parameters of pulse power 
supply. It would be easy to change velocity accurately in large wide. Another widespread concern 
problem is launching the spinning stability projectile by railgun. This paper designed a new 
structure of additional rails to generate an unsymmetrical magnetic field to produce rotational 
torque in armature. The structure is simple and can control the rotation rate by linear changing the 
barrel parameters. The calculation formulas of interior ballistic are derived by Biot-Safar law. The 
important parameter is the deflection angle of the additional rails relative to the symmetry plane of 
main rail. The larger the angle, the greater the rotation torque generated in the armature. To 
maintain the flight stability of the projectile, the barrel structural parameters should be proportional 
to the projectile structural parameters. When changing the muzzle velocity, the rotation rate will 
also be the equal proportion change. So that the gyro stability is the same. The experiment proves 
that the railgun designed in this paper can launch the projectile to rotate. And the rotational pro- 
jectile may not cause the transition or much arcs. This method expands the application of the 
railgun. 

© 2018 Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http:// 


creativecommons.org/licenses/by-nc-nd/4.0/). 





1. Introduction 


With the development of electromagnetic launch technology, a 
series of problems are gradually solved [1], so that the railguns are 
close to practical. Electromagnetic launch has some advantages [2], 
in addition to the high muzzle velocity, the control of the ballistic is 
also an important advantage. Just control the current waveform, the 
ballistic is easy to adjustment [3]. The accurate control of the bal- 
listic is a large improvement to conventional weapons. In the 
future, the railguns may automatically design the best ballistic ac- 
cording to the launch conditions of environment and shooting 
angle. No need to master the knowledge of adjusting the sight or 
firing table (Fig. 1). 

The control of ballistic has lots of application demand [4]. For 
example, in the police firearms, there are often occur accidentally 
injures. When face to a suspect, whether to use the gun is often 
confusing the policeman, as it always leads to a mis-shot. On the 





* Corresponding author. 
E-mail addresses: tangbo90@126.com (B. Tang), Li_haiyuan@163.com (H.-y. Li). 
Peer review under responsibility of China Ordnance Society 


https://doi.org/10.1016/j.dt.2018.07.022 


other hand, if the policeman using non lethal weapons, they 
could not fight back with the armed criminals. This problem will 
be solved after using the ballistic controlled railgun. In the face of 
unarmed suspects, the ballistic could be regulated as a non lethal 
model. Good ballistic control will be significance (Fig. 2). 

The control of muzzle velocity also requires the projectile 
satisfying the flight stability. At present, the most common in 
electromagnetic launch is the tail wing projectile [5]. Because the 
current electromagnetic gun now cannot make the projectile 
rotating at high speed. It most uses the tail wing stable mode. This 
method uses armature to push the projectile, and the armature is 
separated from the projectile after the exit. It is disadvantageous as 
the armature will cause additional damage. Therefore, this paper 
proposes an integrated projectile without the separation of arma- 
ture and projectile. For the stability of the flight, the projectile must 
be rotating at high speed [6]. It may be used in various projectiles, 
such as high explosive shells, small caliber projectiles and kinetic 
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Fig. 1. Concept map of small caliber railgun and projectile. 
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Fig. 2. The application of controllable ballistic railgun. 
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Non rotating 


Fig. 3. Railgun launching Spin-Stabilized Projectile. 


energy projectiles (Fig. 3). 

The method of driving the projectile to rotate is using elec- 
tromagnetic force. And the armature moving forward is also 
driven by electromagnetic force. By controlling the electromag- 
netic force, it can control the muzzle velocity and the rotation 
rate of the projectile. A new design method of railgun is proposed 
in this paper. The muzzle velocity and rotation rate are propor- 
tional to the structure of the barrel, projectile and the parameters 
of the pulse power supply (PPS). Besides, the muzzle velocity is 
proportional to the rotation rate, if a velocity satisfies the sta- 
bility condition. When adjusting the velocity, it can still satisfy 
the stability condition. Experiments show that the method can 
stabilize the small caliber projectile and achieve the purpose of 
ballistic control. It provides a novel idea for the development of 
railgun. 


2. Theoretical analysis 
2.1. Design principle 


The principle of electromagnetic launch is simple. Taking railgun 
as an example, the current in rail generates magnetic field and 
action with the armature to generate electromagnetic force. The 
innovative idea of this paper is adding additional rails to control the 
interior ballistics. It is simple to generate rotating torque in arma- 
ture and enhance the launch efficiency [7]. 

As shown in Fig. 4, the additional rails are located above and 
below the main rails and can be shifted laterally, which is different 
from the usual series structure. The center of the additional rails 
forms a certain angle relative to the symmetry plane of the main 
rail. The additional rails of this structure will generate the x direc- 
tion magnetic field in the armature, thus forming a rotational tor- 
que to drive the armature rotation. The main influence od 
rotational torque is dh which is the distance between the additional 









Additional 
rail 





Main rail 


_ 





Fig. 4. Railgun structure and the parameters. 


rails and the center symmetry plane. Or saying the offset angle. The 
rotation rate can be regulated by adjusting the offset position of the 
additional rails. 

According to Biot-Safar law, the magnetic induction intensity in 
the y direction at the center of the armature is 


Uolline COS a COS 


Aur - (1) 


By = 2 //By,-dQ = 2 f/f 
Q Q 
In the formula, Iijne represents the current in a line in the rail, 
Biine represents the magnetic induction intensity produced by the 
line current [8]. The a indicates the angle between the line and the 
armature center. The @ and rcan be seen in Fig. 4. The magnetic field 
in the y direction acts with the current in the armature to generate 
the forward electromagnetic force. 
The magnetic field in the x direction generated by the additional 
rails at the armature tail arm is a formula which produces rotational 
torque in the armature. 


By = [Biine2d@ + [Biinesd2 


——— “)a0 (2) 
IQ r3 








7 uctiine COS a (= 65 
fa Arr 

In Formula (2), subscript 2 and 3 represent two additional rails 
seen in Fig. 4. The magnetic fleld in the x direction is mainly 
generated by the additional rails and forms the rotational torque. 
Because the main rails are symmetrical relative to the armature, the 
force generated by x direction magnetic field is also symmetrical, 
and there is no rotation torque. So it may just calculate the addi- 
tional rails. 


2.2. Principle of muzzle velocity control 


The control of muzzle velocity depends mainly on the current 
waveform, as the voltage of PPS. For the multi module, it can 
generate the square current waveform to control the overload. The 
power supply system consists of N modules, each module capaci- 
tance is Cy, the initial voltage is U,,, the Inductance value is Ly, the 
equation group is [9] 
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dl 
—UcK lr Lear oe Up =0 UK >0 
dl 
Lat Up =0 UK <0 
dU, 
I — no rs UK >0 (3) 
Uc =0 Uc <0 
n 

I=Sok 

k=1 
k=1,2,---,n 


If Up is used as the voltage at ends of the rail, the circuit equation 
of the barrel system is written as follows 


Up = (Lo +L’) o + IRa + (Ro + R’x)I +IL'V (4) 


The magnetic induction intensity of two main rails in the y di- 
rection is 


0.5s+wy, 
By, = 5h / eT ee a (5) 
Me eee x4/x2 +22 + h4 


The Formula (5) is difficult to write specific expressions, and the 
final solution can be obtained by numerical calculation. 


The magnetic flux density of two additional rails in the y di- 
rection is 


S2 
__ Hol _ 
Byo = no | (An — Ar)dx (6) 
Si 


Add the magnetic field generated by the additional rails to the 
main rails 


Where 

Ay _ arctan 2a ta /2) (8) 
xy/x2 +22 + (dh — hy /2)? 

Ap = arctan Zant ta /2) (9) 
x4/x2 +22 + (dh + hp /2)? 

S; = 0.5s + w, + ds (10) 

Sp = 0.55 + w, + ds + wp (11) 


All parameters in the formula are the structural parameters of 
the barrel seen in Fig. 4, they can be regarded as constants when 
calculating. 


2.3. Principle of rotating speed control 


The principle of rotating torque can be simply understood as 
deflection of two additional rails. Which generate magnetic B, in 
the tail arm of the armature, as shown in Fig. 5. The B, interacts 
with current in the armature tail to generate torque. After the 
armature is rotated, the current is still in its original position, thus 





Fig. 5. Principle of the rotational torque generation. 


continue accelerating armature rotation. 
The magnetic flux density of the additional rails in the x direc- 
tion generated one side of the armature tail is 


/ 


So S, 
_ Mol _ _ 
Be = go [we Li)dx+ f (La L,)dx (12) 
Si S 
Where 


= 21 y2 
L Pee (13) 


(dh + hy /2)? + x2 


—————, (14) 
S, =s+w,+ds (15) 
S, =S+W, +ds+ wy (16) 


According to the ampere force formula, the electromagnetic 
force and torque are obtained 


F = Byls (17) 


The muzzle velocity and rotation rate of armature or projectile 
can be obtained by 


t 
F 
v= | oat (19) 
0 
> M 
p= | Sat (20) 
O 


The m and C represents the mass and moment of inertia of 
armature and projectile. The v is muzzle velocity, r is rotation rate. 
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2.4. Ballistic stability 


For the small caliber railgun, the useful ballistic of a projectile is 
mainly the straight line. First consideration is gyroscopic stability 
represented by gyro stability factor Sz [10] 


Cry 
Se= (a) a ae 


The A means the moment of inertia of the projectile equator, kz 
means the coefficient of projectile turning moment. To be stable, it 
needs S,> 1. 

For the relationship of muzzle velocity and rotation rate 


f_2A 
vy C 


If it want to be stable, it needs S, > 1, so 


kzS¢g (22) 


r . 2A 
v C 

When the parameters of the launcher and the projectile are 
determined, muzzle velocity and rotation rate can be regarded as 


the product of the additional rails deflection coefficient and current 
integration 


i=2 / Madt = k; (4) / dt (24) 


y= = | Fae = k,(0) f Pat (25) 


In Formulas (24)—(25), the coefficient can be obtained from the 
previous formula, which is related to the structural parameters of 
the barrel. Therefore, if we want to satisfy the flight stability of the 
projectile, we need to satisfy the following formula 


r kr (4) 2A 


—_= = > 
ae (4) C 

The item on the left side of “>” is related to the structural pa- 
rameters of the barrel, on the right side is related to the structural 
parameters of the projectile. Which means muzzle velocity and 
rotation rate are fixed proportion relation. This indicates that when 


change the muzzle velocity of the projectile, the rotation ratio will 
automatically follow the change. The flight will be stable 


kz (23) 


m M 
Vkzxe E (26) 





hd? Po 2 2 
— — x ai Kmz(M)0 = Akzv“ 6 (27) 
On 


Mz 


The k, can be estimated by formula. kjmz (M) can be obtained by 
checking the tables [11], M; is the turning moment, h is distance of 
pressure center and centroid of projectile, d is diameter, 6 is angle of 
attack, p is air density, H(y) means height. 


3. Discuss 
3.1. Controlling of muzzle velocity 


The muzzle velocity of the projectile is related to the value of 
current. For the capacitive energy storage pulse power supply the 
current is proportional to the voltage of PPS. Therefore, adjusting 
the voltage can obtain the required current. Then the muzzle ve- 
locity may be controlled. Fig. 6 shows four kinds of muzzle velocity 
and four corresponding current waveforms in the same linetype. 
The velocity curve is with circle mark. The faster the speed, the 
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Fig. 6. Muzzle velocity controlled by current wave. 


shorter the launching processes time. The muzzle velocity of pro- 
jectiles can be controlled very accurately in theory. But at present, 
for the electrical contact and power system errors such reasons, it is 
not easy to realize. 


3.2. Controlling rotating speed 


When the two additional rails are shifting from the main rail 
symmetry, the magnetic induction intensity is obtained as shown 
in Fig. 4. The magnetic field is deflection, and the magnetic field has 
downward direction component at the tail of the armature shown 
in Fig. 7. That generates rotational torque and driving the armature 
rotation. 

The direction of current in the armature is shown in Fig. 8. The 
current flows into the central part of the armature from the tail arm, 
which produces rotating torque with the magnetic field. In order to 
keep this direction of current, there should be some slots in the rear 
of the armature. This structure also has another advantage, the tail 
arm has the outward expansion electromagnetic force. It can make 
the armature contact with the rail closely, increasing the electrical 
contact [12]. 

The influence of the deflection angle of the additional rails to the 
symmetry plane of the main rail on the forward electromagnetic 
force and rotation torque is shown in Fig. 9. The forward electro- 
magnetic force decreases slightly with the increase of the deflec- 
tion angle when the current is constant, the influence of the 
deflection angle on the forward electromagnetic force is not large. 





Fig. 7. Asymmetrical magnetic field generated by additional rails. 
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Fig. 9. The influence of the deflection angle of the additional rails. 


The rotation torque increases linearly with the deflection 
angle. According to the formula, to meet the condition of the gy- 
roscope stability, the rotation torque relative to the electromag- 
netic force should be more than a certain value. In Fig. 9, the 
torque value on the left of the stable point is smaller than the 
certain value, it can not be spin stability. On the right the projectile 
would be sable. 

The rotation rate of the projectile is also proportional to the 
initial voltage of the PPS, as shown in Fig. 10. The higher the voltage, 
the greater the deflection angle, the higher the rotation rate. The 
adjustment range of the rotation rate is large, high voltage and little 
angle is suitable for the low rotation rate fin-stabilized projectile. 
Which means the method proposed in this paper is suitable for 
rotation rate control. 


Rotation rate/ 
(rad +s"') 


1.4105 
1.2*10° 
a 1.0« 10° 
Os, 
3 810" 
fa" |] 
£ 
< 6* 10! 
4x 104 
2x10! 
0 








500 1 000 1 500 2 000 
Voltage/V 


Fig. 10. Rotation rate controlled by deflection angle and vlotage of PPS. 


3.3. Relationship of muzzle velocity and rotating speed 


As shown in Fig. 11, when the parameters of the barrel are 
determined, the muzzle and rotation rate are increasing with the 
voltage of the pulse power supply. The stable factor Sg is basically a 
constant value. Which means that the flight stability is satisfied at 
one muzzle velocity, the other muzzle velocity will meet the sta- 
bility conditions too. Do not need to match the parameters of the 
barrel and projectile again. It is proved that electromagnetic launch 
is suitable for the control of muzzle velocity. 


4. Experiment analysis 


In previous papers, there have been many analyses of the effects 
of pulse power parameters on muzzle velocity [13,14]. This exper- 
iment did not repeat again. For the experiment of rotation projectile 
launched by rail is seldom to see. In the experiments of Challita, 
many times of projectiles rotation were observed in the round bore 
railgun [15]. It is believed that it was caused by electromagnetic 
force. But they did not want the rotation projectile, as some solu- 
tions to avoid rotation was proposed. 

In this experiment, it is mainly to study the feasibility of the 
rotating of the projectile driven by electromagnetic force. The 
structure of the launcher, as the principle of this paper, has been 
designed for the experiment, shown in Figs. 12—14. 

This launcher has a detachable structure. It can change the 
structure of the rails. The arc control device is installed at the muzzle. 
The internal structure is like the principle described in Fig. 4. 

Using the capacitive PPS as the energy source, the total current is 
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Fig. 11. Relationship between muzzle velocity and rotation rate. 





Fig. 12. Launcher for the experiment. 


B. Tang et al. / Defence Technology 633 


busbar 








Fig. 13. Pulse power supply model. 
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Fig. 14. Experiment results. 


about 140kA. At the beginning of the muzzle voltage, there is a 
negative current due to induced eddy current. The voltage always 
keeps a low amplitude during the launching, which indicates that 
the projectile has no transition and proves that the projectile has 
good electrical contact properties. 

The movement of the projectile is photographed by high 
speed camera. From Fig. 15, the projectile has good attitude and 
flight stability and the projectile has rotated during the flight. It 
can be read from the high-speed camera pictures, the rotating 
angle is about 30°, the experiments have got the rotation rate is 
over 100000°/s, more than tens of thousands rounds per min- 
ute. This proves that the electromagnetic force can drive the 
projectile to rotate. Besides, there is no arc around the projectile 
just like conventional projectile flight. Which means that the 
rotational projectile did not cause the transition or produce 
much arcs. This proves the rotational projectile is feasible for 
railgun. 


Black mark 









5. Conclusions 


The ballistic control of the railgun has a good application pros- 
pect. In the future, the railgun will be able to set the optimal bal- 
listic automatically and achieve the firing accuracy. For the muzzle 
velocity and rotation rate control, this paper designs a new type of 
railgun. Which produces asymmetrical magnetic by additional rails 
and generates rotation torque in armature. The structure is simple 
to realize, and the rotation rate can be controlled conveniently by 
changing the deflection angle. Which can be accurately controlled 
within a large range. In principle, it is feasible to adjust the muzzle 
velocity and rotation rate control before or during the launch. 
Changing current wave or the barrel parameters can achieved by 
electronic control or mechanical action. 

According to the analysis, to maintain the flight stability of the 
projectile, the structure parameters of the barrel and the projectile 
are required to satisfy a proportional relationship. If the flight 
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Fig. 15. Spin-stabilized projectile launched by railgun. 
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stability is satisfied at one muzzle velocity, the other muzzle ve- 
locity will meet the stability conditions too. Using the voltage of PPS 
and the deflection of the additional rails will easy to control the 
muzzle velocity and rotation rate. The experiment proves that the 
railgun designed in this paper can launch the projectile rotating 
and ensure the flight stability. Also proves that the rotational pro- 
jectile may not cause the transition or produce much arcs. 
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Modeling Multirotor Aerodynamic Interactions 
Through the Vortex Particle Method 


Eduardo J. Alvarez* and Andrew Ning" 
Brigham Young University, Provo, Utah, 84602 


Distributed electric propulsion and vertical take-off and landing has recently opened a 
new design space for urban air mobility. However, the use of multiple rotors operating 
in close proximity introduces complicated aerodynamic interactions that are not well un- 
derstood, are not captured through conventional design tools, and need to be addressed 
in the conceptual design stage. This study investigates the accuracy of the viscous vortex 
particle method (VPM) in modeling rotor-on-rotor aerodynamic interactions in a side- 
by-side configuration as encountered in tilt-rotor, quadrotor, and distributed propulsion 
aircraft. The VPM approach has the potential to enable the use of mid/high fidelity mod- 
els capturing multirotor interactions during conceptual design. Validation of the individual 
rotor is presented in both hovering and forward-flight configurations at both low and high 
Reynolds numbers. Validation of the hovering multirotor is then presented, followed by 
a detailed parametric study of rotor-to-rotor interactions during hover and forward flight, 
constructing the response surface of thrust, torque, and propulsive efficiency as a function 
of operational parameters. 


I. Introduction 


Technology convergence in the past ten years has recently opened a new design space in electric aircraft, 
enabling the use of distributed propulsion and electrical vertical take-off and landing (eVTOL) for urban air 
mobility./* The concept of eVTOL in an urban setting is forecasted to evolve into a $1.5 trillion “flying car” 
industry by the year 2040.° However, this unconventional aircraft configuration poses technical challenges 
that still remain to be solved.° A strong noise signature’® and a complicated transition maneuver” !° 
are examples of the challenges encountered in eVTOL aircraft, both stemming in some degree from the 
aerodynamic interactions between rotors and lifting surfaces.!' As an example of eVTOL configuration, Fig. 1 
shows NASA’s GL-10 tilt-rotor prototype aircraft in hovering mode during take off, prior to transitioning 
into wing-borne forward flight. 





Figure 1: NASA GL-10 eVTOL prototype aircraft in hover mode during take off. Credits: NASA Lang- 
ley/David C. Bowman. 


*Doctoral Candidate, Department of Mechanical Engineering, ealvarez@byu.edu, AIAA Member. 
T Assistant Professor, Department of Mechanical Engineering, aning@byu.edu, AIAA Senior Member. 
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The use of multiple rotors operating in close proximity introduces complicated aerodynamic interactions 
that are not well understood, are not captured through conventional design tools, and need to be addressed 
in the conceptual design stage.'* 1+ For instance, Yoon et al.!° investigated the effects of separation 
distance between rotors on a quad tilt-rotor aircraft in hover, discovering a myriad of beneficial and counter- 
productive interactions between rotors and airframe. Using an unsteady detached-eddy Reynolds-average 
Navier-Stokes (RANS) solver, they simulated the rotors operating at a high Reynolds number with and 
without the aircraft body. As they varied the separation between rotors, they observed a drop in thrust as 
low as 4% at a separation 10% of the rotor diameter, and observed an ingestion of recirculating vorticity 
from above the rotors into the wake caused by rotor interactions. When the fuselage and wings were included 
in the simulation, the rotors were benefited from body-on-rotor interactions increasing the thrust by about 
3%. However, both wing and fuselage create a large download caused by the rotor downwash impinging 
on them, resulting in a net loss of vertical force of about 16% due to the accumulation of rotor-on-rotor, 
rotor-on-body, and body-on-rotor interactions. ‘These complicated dynamics between rotors and the aircraft 
body illustrate the need to better understand the aerodynamic interactions incountered in eVTOL aircraft 
and address them in the early stages of design. 

Coaxial and tandem rotor interactions have been studied for decades in the rotorcraft community, mean- 
while the side-by-side multirotor configuration has remained relatively unexplored. Recent experimental 
works!® 2° have studied the counter-rotating configuration where neighboring rotors spin in opposing di- 
rections (clockwise versus counterclockwise), however, no attention has been given to the co-rotating con- 
figuration where neighboring rotors spin in the same direction (both clockwise, or both counterclockwise). 
Also, these studies only considered the hovering case, meanwhile the effects of multirotor interactions in 
forward flight remain unexplored. Numerous studies of quadrotor and helicopter forward flight are found in 
the literature, but forward flight in a distributed propulsion eVTOL configuration differs from a quadrotor 
or helicopter configuration. In the latter, the flight velocity produces an edgewise or tilted flow since the 
rotors provide both the lift and thrust of the vehicle. In contrast, distributed propulsion eVTOL aircraft 
are typically wing-borne during cruise, and the axes of rotation of the rotors are roughly aligned with the 
direction of flight, with the velocity of the aircraft adding directly to the axial inflow and advance ratio 
of the rotors. Even though previous studies have shown that multirotor interactions could drop the thrust 
performance by only a small amount??'%?° (~4% relative to the isolated rotor), they also show that noise 
could be increased by as much as 3dB,!'*!° corresponding to an acoustic signature two times more intense, 
which is a critical factor to consider in the design of urban mobility aircraft. Experimental results indicate 
that this increase in noise is directly related to unsteady loading during aerodynamic interactions. Hence, ac- 
curately modeling these aerodynamics interactions will lay the groundwork for future aeroacoustic modeling 
and noise reduction during the conceptual design stage. 

In this study we will investigate the accuracy of the viscous Vortex Particle Method??:?? (VPM) in mod- 
eling rotor-on-rotor aerodynamic interactions in a side-by-side configuration as encountered in tilt-rotor, 
quadrotor, and distributed propulsion aircraft. The viscous VPM is a meshless method for solving the 
vorticity form of Navier-Stokes equations in a Lagrangian scheme. The VPM is a direct numerical simula- 
tion of vorticity-governed flows that accurately preserves vortical structures and is absent of the numerical 
dissipation associated to conventional mesh-based approaches like RANS and large-eddy simulation (LES). 




















Figure 2: Low-fidelity VPM simulation of eVTOL aircraft (left) and high-fidelity VPM simulation of multi- 
rotor wake mixing?’ (right). 
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This method has been shown to achieve a continuum of fidelity levels, from low fidelity apt for conceptual- 
design?* *! to high fidelity apt for research-quality computational fluid dynamics,°** °° while offering a con- 
siderable speed up over a RANS or LES approach. Fig. 2 shows a low-fidelity simulation of a full-aircraft 
configuration and a high-fidelity simulation of wake mixing of two counter-rotating rotors from previous 
work by the authors using VPM.?! In recent work, VPM has been coupled with a noise code based on the 
Ffowcs Williams-Hawkings equation, successfully predicting rotor acoustics associated to unsteady loading.*© 
Furthermore, the method has been shown to be CPU and GPU parallelizable, and scalable in heterogeneous 
systems for high performance computing.*’ °° Thus, the VPM approach has the potential to enable the use 
of mid/high-fidelity aeroacoustic models capturing multirotor interactions during conceptual design. 

In Section II, the theory of the VPM will be summarized and the propeller model will be presented. 
In Section III, validation of the individual rotor will be presented in both hovering and forward-flight con- 
figurations at both low and high Reynolds numbers. Finally, validation of the hovering multirotor will be 
presented in Section IV, followed by a detailed parametric study of rotor-to-rotor interactions during hover 
and forward flight, constructing response surfaces of thrust, torque, and propulsive efficiency as a function of 
operational parameters. The parameters will be tip-to-tip separation distance s, Reynolds number Rep (as- 
sociated with the RPM), advance ratio J (associated with the flight speed), and direction of rotation (counter 
or co-rotation). The response surfaces will be constructed from mid-fidelity VPM simulations sweeping in 
all four operational parameters, resulting in a total of 1152 simulations. 

















II. Modeling Methodology 


In this section, the theory of the viscous vortex particle method is summarized detailing the implemen- 
tation used in this study (Section II.A). A propeller model based in VPM and blade elements is presented in 
Section II.B, and the test configurations along with respective rotor geometries are described in Section II.C. 


II.A. Viscous Vortex Particle Method 
IT.A.1. Governing Equations 


A viscous, incompressible flow dominated by vorticity can be resolved by formulating the Navier-Stokes 
equations into their vorticity form. This is done by taking the curl over the original momentum equation 


1 
V x (Getta Vu) =Vx (~ivp + Vu) 


leading to an expression that is not dependent on the pressure field: 
Dw 
Dt 
The material derivative expressed in Eq. (1), and the material-conservative nature of the vorticity makes 


the w field especially fit for a Lagrangian description. In order to do so, the field is discretized into vortex 
particles?” as a radial basis function interpolation of basis ¢, and coefficients Le 


w(x,t) © > Tp (t)¢o(x — xp(t)). (2) 


— (w V)u + yV7wW. (1) 








Each particle represents a volume of fluid that travels with the local velocity as in Eq. (3), where x, is the 

position of the p-th particle. The coefficient [,,, termed vectorial circulation or vectorial vortex strength, is 

defined as T, = [| wdV and approximated as T, ~ w,vol,, where w, is the vorticity associated with the 
vol, 

p-th particle of volume vol,. Applying this particle discretization to Eq. (1), it is obtained that the vectorial 

circulation of each particle evolves as in Eq. (4), where the first right-hand-side term is the vortex stretching, 

while the second represents a scheme for modeling the viscous diffusion vV2w. 





d 

Sxp(t) = u(xp(t),t) (3) 
d d 

<T,(t) = (p(t) Vue (®),t) + ST p(t) (4) 
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IT.A.2. Velocity Kernel 


By the Helmholtz Decomposition Theorem, the incompressible velocity field u dominated by vorticity is 
defined by some vector potential field w as u(x,t) = V x (x,t). Then, from the definition of the vorticity 
field (w = V x u), we get the three-dimensional unbounded Poisson’s problem 


Ve = —W, 


which can also be rewritten using the particle approximation from Eq. (2) as 
V°a(x) © — »— PCa (X — Xp). 
p 


The Poisson’s problem yields that the velocity induced by the field of vortex particles is calculated as 


u(x) =) go (x —x,) K(x —x,) xP), (5) 


where gz is a normalized smoothing function calculated from the chosen basis function ¢,, and K (x) = 
ear is the singular Newtonian kernel resulting from the three-dimensional Green’s function of the un- 
bounded Poisson’s problem. In this study we implement the Gaussian basis function 


1 2 
bo (8) = ag exp ( - SEP). 





II.A.3. Fast Multipole Method 


The governing equations, Eqs. (3) and (4), coupled with the velocity kernel Eq. (5) require the calculation 
of all particle interactions with a computational complexity of order O(N?), where N is the number of 
particles. ‘This complexity is prohibitive since the number of particles readily escalates to the order of the 
hundreds of thousands in mid-fidelity simulations. In order to ease the computation in this study, the fast 
multipole method (FMM) is used for the computation of the governing equations. The FMM approximates 
pairwise interactions of particles through spherical harmonic functions,*?»*! reducing the original compu- 
tational complexity to a problem of order O(N). The computation of the vortex stretching is performed 
through an efficient complex-step derivative approximation as explained in previous work,?! implemented in 
a modified version of the open-source, parallelized FMM code ExaFMM.??:49 


IT.A.4. Viscous Diffusion 


In this study, the viscous diffusion term in Eq. (4) is solved through the core spreading method** coupled with 
the radial basis function (RBF) interpolation approach developed by Barba,?*:4° “8 which avoids the need 
for particle splitting while allowing the viscous VPM to be a truly meshless method. In the core spreading 
method, a Gaussian radial basis allows the viscous diffusion term in Eq. (1) to be solved by spreading the 
smoothing radius as 


T= Dn (6) 
however, this approach is a convergent approximation to the Navier-Stokes equations only for small core 
sizes.“4 This introduces the need to resize all cores to their initial size after they grow larger than an 
arbitrary threshold. A popular solution to this problem is the splitting of overgrown particles, but this 
increases the number of particles exponentially in time. Barba introduced an alternative method for core 
resizing by resetting all core sizes o to an initial value after a certain time, and estimating the new vectorial 
circulations [’,, that preserves the same vorticity field w through an RBF interpolation. This method not 
only takes care of the core-overgrowth problem, but also allows for spatial adaptation, and the RBF can be 
efficiently implemented in the FMM evaluation through the generalized minimal residual (GMRES) method. 
In our implementation, we have assumed a uniform core size throughout the particle field, allowing us to 
solve the RBF through the conjugate gradient method. 
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®@ Lifting surface particles 

® Trailing circulation 
Unsteady loading 
Blade element 





Figure 3: Blade and wake deployment in the propeller model. Particles are colored by their respective source 
of vorticity; arrows indicate the direction of vortex strength I,. 


II.B. Propeller Model 


In this study, each blade of the propeller is modeled as a rotary lifting surface where the physics of interest 
have been broken down into three aspects: load distribution, blade-induced velocity, and wake-induced 
velocity. The load distribution is calculated through two-dimensional blade elements, and is used for deriving 
the circulation along the lifting surface. In turn, the blade-induced velocity is obtained from the circulation 
distribution by embedding vortex particles along the surface that preserve such circulation, as shown in 
Fig. 3. As the blade moves, the trailing vorticity is shed off the trailing edge as free-particles from both the 
unsteady loading and shed circulation. ‘Thus, blade and wake-induced velocities are all computed through the 
FMM, allowing for efficient scaling of the fidelity of the simulation to the order of millions of particles, and 
all unsteady dynamics are resolved as the VPM steps in time. Fig. 4 shows the computation flow diagram 
of the simulation. 

In the simulation setup step (blue block in Fig. 4), the user specifies the rotor geometry and blades 
are discretized into elements. ‘Two-dimensional aerodynamic airfoil characteristics of each blade element are 
precomputed through XFOIL at the corresponding local Reynolds number, and a Prandtl-Glauert compress- 
ibility correction is applied to the lift curve, thus capturing both viscosity and compressibility effects. Both 
lift and drag curves are adjusted to capture three-dimensional drag and stall-delay effects*? encountered in 
rotor blades, and the Viterna method”? is applied to obtain post-stall +180° extrapolations of these curves. 

In the first time step of the simulation, the inflow at every blade element is calculated from freestream 
and rotational velocity and used for calculating the effective angle of attack (AOA) of the element. From 
the precalculated airfoil lift and drag curves, the effective angle of attack is used to look up the lift and drag 
at every element, and the load distribution of every blade is thus determined. In all subsequent time steps, 
the propellers are rotated and vortex particles are shed off trailing edges. ‘The Navier-Stokes equations are 
solved in their vorticity form (Eq. (1)) calculating the convection, stretching, and viscous diffusion of the 
vortex particles (Eqs. (3) and (4)). The velocity induced by the wake and lifting surfaces is then added to 
the freestream and rotational velocities in order to calculate the new effective angle of attack, and the load 
distribution is updated (gray block in Fig. 4). The process is then iterated until the end of the simulation, t, 
as shown in the green block. This algorithm is similar to the method developed by Jo and Lee,?*'3! except 
that we omit the minor iterations and let load distributions convergence in time as the wake is deployed. 
Also, we model the lifting surfaces through embedded vortex particles in order to perform all computation 
through the VPM, which allows us to scale the computation efficiently in order O(N) through the FMM. 

In a previous study?! we have shown that the VPM is able to fully characterize the evolution of the 
wake, from near field to far field and transition into turbulent breakdown. Furthermore, the VPM is a 
direct-numerical simulation approach, requiring an increased spatial resolution in the turbulent breakdown 
regime to ensure convergence. In order to ease computation in this study, the wake will be partially removed 
once in the breakdown regime as described in Alvarez and Ning.?! 
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INPUTS 
Blade geometry, 
number of rotors, 


separation distance, 
RPM, advance ratio 


SIMULATION 





Rotate rotors and shed wake 







PRECOMPUTATION BLADE ELEMENT 


Update x, and I, of particles 
from Eqs. (3) and (4) Calculate induced velocity 
(WPM) 


Discretize blades into elements 


2D airfoil properties (XFOIL) 


Update load distributions 


Calculate effective AOA 
3D airfoil corrections 


Look up c; and cy, 
from effective AOA 





OUTPUTS 


VPM wake, 
thrust, and torque 





Figure 4: Computation flow diagram of the propeller model. 


II.C. Test Cases 


In order to simulate both hovering and forward-flight multirotor configurations at both low and high Reynolds 
numbers, two different rotor geometries are used. The hovering case uses a quadcopter rotor matching the 
geometry of the DJI Phantom II rotor described by Zhou et al.1® and Ning,'? and the low-Reynolds 
simulation of side-by-side hovering rotors is compared to reported particle image velocimetry (PIV) and 
load cell measurements. ‘This rotor is 240mm in diameter, and its geometry approximates the optimum 
rotor design obtained by imposing a constant effective angle of attack along the blade. The geometry was 
digitized assuming the twist distribution 6(r) = 5.3°+6.3°+ (with r the non-dimensional radial position) and 
an uniform E63 airfoil shape transitioning to an E856 airfoil towards the hub. The chord and leading edge 
distributions are generated as described by Ning.!” All simulations are run at 4860 RPM and no freestream 
velocity, matching the experimental configuration resulting in a diameter-based Reynolds number Rep of 
6.5 x 10° and chord-based Reynolds of Re, of 6.2 x 10*. Rep and Re, are calculated at the 70% blade span 
as 





Vr9%D Vr9m%C 
Rep = 0% and Re, = EN 
y y 





where ¢ is the mean chord, and Vzoy is the effective speed resulting from both freestream and local rotational 
velocity at 70% the blade span. 

The forward-flight configuration uses the APC thin-electric 10x7 rotor operating at a high Reynolds 
number. This rotor is a hobby-grade model commonly used on small unmanned aircraft, it is readily 
available, and its 10-inch diameter makes it a good fit for mid-size wind tunnel testing. Moreover, its 
performance has been measured and indexed in the UIUC Propeller Database,°?! and verified by McCrink 
and Gregory” both experimentally and through blade-element momentum theory. The blade is generated 
assuming an uniform NACA 4412 airfoil shape, using the chord and twist distributions digitized by McCrink 
and Gregory. Note that forward flight in distributed-propulsion aircraft implies a fully axial inflow, instead 
of the edgewise flow encountered in rotorcraft. The performance of the individual rotor is validated against 
experimental data across a cD of advance ratio J at a constant Rep of 1.5 x 10° and Re, of 1.2 x 10°, 

3 Mes RPM 


where J is calculated as J = —s, with n = ~~. 
n 60 


The thrust coefficient C'r, torque coefficient Cg, and propulsive efficiency 7 hereby reported are defined 
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as 





T Q TV 6 
Cr = ——,, Co=-—-> ; 
. pn? D4 “ 
In all validation cases, temporal integration is performed in a third-order Runge-Kutta scheme at a time 
step equivalent to a 5° blade rotation; blades are discretized into 50 elements; and spatial resolution of the 
wake is dictated by imposing a smoothing radius twice the distance between contiguous particles shed off 
the blade tip, shedding two particles at every time step. Particles are shed all along the blade trailing edge. 


III. Single-rotor Validation 


Prior to multirotor modeling, the propeller model is first validated in a single-rotor configuration at both 
low and high Reynolds number. In this section the single rotor is simulated in both hover and forward flight, 
and results are compared to experimental measurements reported by Zhou et al.,'® Ning,'!? and McCrink 
and Gregory.°” 


III.A. Hover Case 


The hover configuration uses the DJI Phantom II rotor operating at a low Reynolds number (Re, = 6.2 x 10*) 
and constant 4860 RPM. The thrust coefficient Cr of the rotor throughtout the simulation is shown in 
Fig. 5 along with the experimental mean thrust and fluctuation. The wake starts to develop and convect 
downstream during the first two iterations, and the near field becomes fully developed after only three 
iterations as indicated by the convergence of thrust. The periodic spikes observed at six, ten, and fourteen 
revolutions correspond to every instance when the turbulent breakdown region has been trimmed. The wake 
after eighteen revolutions is shown in Fig. 6 (left), where features of the topology can be clearly identified: a 
well-defined structure in the near field (z < 0.5D), an onset of leapfrogging at z + 0.5D, and a transition into 
turbulent breakdown at z ¥ 1.0D. The simulation converges to a mean C'y of 0.1013 and a steady fluctuation 
(standard deviation) of 0.001, meanwhile Zhou et al.1® and Ning’? report an experimental mean C'p of 0.1007 
and a fluctuation of 0.008. This results in an error of only 0.5% between the simulation and experimental 
mean C'r, meanwhile the fluctuation is largely underpredicted. We conjecture that the fluctuations observed 
in the experimental measurements of the individual rotor are caused by interactions with the test stand not 
captured in our simulations (e.g., mounting pole, hub, and mechanical vibrations). 

Fig. 7 compares the ensemble-average velocity in the single rotor (left figures) observed experimentally 
and predicted at a plane located a distance 0.1D downstream. ‘The simulation shows a streamtube slightly 
more contracted than what is observed experimentally. Fig. 8 shows the in-plane vorticity of both the 
experimental PIV and the simulated single rotor (left figures). The vorticity distribution is obtained at a 

















Thrust coefficient Cr 





0 2 4 6 8 10 12 14 16 18 20 
Number of revolutions 


Figure 5: Convergence of thrust coefficient Cy (blue dots) in hovering single-rotor simulation. Solid black 
line corresponds to the experimental mean thrust and dashed lines enclose the standard deviation as reported 
in Zhou et al.!® and Ning.'? 
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Figure 6: Wake of single-rotor and multirotor hover simulations. Arrows scaled by the vortex strength of 
every particle. 


phase-locked angle of 120°, where the phase is defined as the angle between the vertical yz-plane and the 
position of the rotor blade. It can be seen that the VPM simulation correctly predicts an onset of tip-vortex 
leapfrogging at a downstream distance of about z = 0.5D. Thus, the propeller model seems to captures all 
the physics of interest at low Reynolds number in a hovering single rotor configuration. 


IIIl.B. Forward-flight Case 


The forward-flight configuration uses the APC 10x7 rotor operating at a high Reynolds number (constant Re, 
of 1.2 x 10°) across a sweep of advance ratio J. Fig. 9 compares the predicted thrust, torque, and propulsive 
efficiency to the experimental values reported by McCrink and Gregory.°? It is observed that both Cr 
and Cg match the experimental values at low and moderate advance ratios, meanwhile they start to drop 
earlier than what is seen experimentally at high advance ratios. ‘This is also observed in the blade-element 
momentum predictions by McCrink and Gregory, which hints that this may be an issue with the geometry 
description (7.e., inaccurate airfoil shape or twist distribution differing from the actual rotor) rather than the 
modeling method. The discrepancies in C’r and Cg cancel out as the propulsive efficiency 7 is calculated, 
resulting in a good agreement across all advance ratios up to J & 0.725. Thus, the VPM propeller model 
seems to be valid at high Reynolds number across low and moderately-high advance ratios in forward-flight 
single-rotor configuration. 


IV. Maultirotor Results 


In this section, the hovering multirotor case is compared to the experimental measurements reported by 
Zhou et al.'® and Ning,!® validating the VPM multirotor model. Finally, a parametric study of rotor-on-rotor 
interactions across all advance ratios and Reynolds numbers is presented. 


IV.A. Hover Case 


The hovering single-rotor case of Section III.A is now extended to multirotor configuration by adding an 
identical rotor in counter-rotation at a tip-to-tip distance s of 0.05D. Counter-rotation means that one 
propeller rotates clockwise, meanwhile the other rotates counter-clockwise. The simulation shows that the 
multirotor wake mixing onsets a topological instability that moves the turbulent-breakdown region closer 
to the plane of rotation, as shown in Fig. 6 (right). Fig. 8 compares the in-plane vorticity field between 
single-rotor and multirotor configuration. Both simulation and experiment show that tip vortices lose their 
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Figure 7: Velocity field at plane z = 0.1D of single rotor (left) and multirotor (right, s = 0.05D) in hover 
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configuration. Top row shows the experimental ensemble average (retrieved from Zhou et al.*>), middle row 
shows the simulation ensemble average, and bottom row shows the simulation phase-locked field at 120° (see 
blade positions). Colormap corresponds to axial velocity, while arrows indicate swirl velocity. 
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Figure 9: Comparison between simulation and experimental thrust, torque, and propulsive efficiency in 
forward-flight single-rotor configuration. 
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structure as wakes mix in-between the rotors (x = —0.5D region, compare left and right figures). 

Next, the separation distance s was varied to determine its effects on thrust. Fig. 10 shows the mean 
thrust and fluctuations compared to what Zhou et al.!® observed experimentally, normalized by the values 
obtained in the single-rotor configuration. Every time that blades pass through the region of wake mixing, the 
loading of the blade drops, decreasing the mean thrust and increasing the fluctuation, which is accentuated 
as the tip-to-tip distance becomes small. It is especially important that this unsteady loading is accurately 
captured since the noise augmentation observed experimentally stems from these fluctuations. Here we see 
that the VPM propeller model is able to capture these rotor-on-rotor interactions with reasonable accuracy*, 
making it a good candidate for a future application in noise prediction. 

Zhou et al. observed an accentuated upwash and a recirculation region in the near field between the rotors 
(shown in Fig. 7, top right), and conjectured that the flow separates along the blades every time they pass 
through this region, leading to the observed drop in blade loading. The simulation (Fig. 7, middle right) 
shows the same accentuated upwash observed experimentally, however the recirculation region is absent. The 
phase-locked field (Fig. 7, bottom right) shows a more accurate topology, yet no recirculation is seen. Since 
the drop in thrust shown in Fig. 10 agrees well between simulation and experiment in spite of not capturing 
such recirculation region, we conclude that the effects of the recirculation pocket are negligible and that the 
induced upwash is the main contributor to the drop in blade loading. 




















IV.B. Hover and Forward-flight Parametric Study 


Previous studies on side-by-side multirotor interactions available in the literature have only explored the 
hovering configuration since fully characterizing the response surface beyond a hovering configuration poses 
a great challenge due to its multidimensionality. Leveraging the computational speed of the VPM, we have 
swept across all ranges of advance ratio J, Reynolds number Rep, separation distance s, and direction of 
rotation (counter and co-rotation). This resulted in a total of 1152 simulations? requiring a wall-clock time 
of 4 days in a desktop computer Intel(R) Xeon(R) CPU E5-2699 v3 @ 2.30GHz. 

Fig. 11 shows the response surface of propulsive efficiency 7 and thrust fluctuation (standard deviation 
of Cr) in counter-rotating APC 10x7 rotors at low, moderate, and high Reynolds numbers. Efficiency values 
have been normalized by the corresponding value obtained in single-rotor configuration. The solid surfaces 
correspond to radial basis function interpolations fitted to data points, whereas each data point corresponds 
to one VPM simulation. In the efficiency response surface (left figures), it is observed that multirotor 
interactions are detrimental across all advance ratios and Reynolds numbers, dropping the performance as 
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Figure 10: Effects of rotor-on-rotor interactions on thrust as the separation between rotors is decreased, 
normalized by their respective values in the single-rotor configuration. 








“Fluctuations are underpredicted as the experimental measurements include interactions inherent to the experimental setup 
(mounting pole and hub) not captured in the simulation, as mentioned in Section II].A. However, simulation and experiment 
show good agreement after normalizing both by their respective single-rotor values. 

>These are mid-fidelity simulations with 25 elements per blade and 18 time steps per revolution, shedding four particles 
per step with a smoothing core overlap twice the distance between contiguous particles, simulating a total of six revolutions in 
moderate/high advance-ratio cases, and 13 revolutions in low advance ratio cases (J < 0.2). 
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Figure 11: Response surfaces of propulsive efficiency (normalized by corresponding single-rotor value) and 
thrust fluctuation (standard deviation) in counter-rotating APC 10x7 multirotor. Solid surfaces correspond 


to radial basis function interpolations fitted to data points (circles). Each data point corresponds to one 
VPM simulation. 
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the distance between rotors become small. Furthermore, this drop in performance becomes more accentuated 
at low advance ratios. This is better observed in Fig. 12, which shows slices of the response surface at multiple 
Reynolds numbers at s = 0.05D, which evidences a trend of decreasing performance as the advance ratio 
decreases. Another observation from Figs. 11 and 12 is that a higher Reynolds number accentuates the 
negative interactions. The opposite was observed in an experimental study by Shukla et al.,4® +!” which may 
indicate that this trend on Reynolds number is dependent on the specific rotor design being studied (the 
APC 10x7 is designed for high-Reynolds operation with a loading distribution gradually decreasing towards 
the blade tip, whereas Shukla et al. used an untapered, untwisted blade highly loaded at the tip, designed 
for low-Reynolds testing and strong tip vortices). 

The response surface of thrust fluctuation (Fig. 11, right figures) shows that fluctuations are accentuated 
at hover and near-hover configurations (J < 0.2) across all Reynolds numbers . This means that hover and 
near-hover can be expected to be the configurations of greatest noise associated to unsteady loading. It 
is also observed that Reynolds number has the effect of increasing thrust fluctuations accordingly. ‘This is 
explained by the fact that a higher Reynolds number correlates to a higher RPM, which leads to an increased 
blade loading and stronger blade-tip vortices that accentuate rotor-on-rotor interactions in the APC 10x’. 
All trends already mentioned are consistent between counter and co-rotating configurations. Finally, Fig. 13 
compares the effects of rotation direction (counter or co-rotation) on propulsive efficiency across Reynolds 
numbers and advance ratios at s = 0.05D, showing that a counter-rotating configuration (e.g., one propeller 
rotating clockwise while the other counter-clockwise) has a slight penalty over co-rotation at moderate and 
high advance ratios (J > 0.25) that is accentuated at a high Reynolds number, droping the efficiency as 
much as 0.5% lower than the co-rotating configuration at Rep = 1.5 x 10°. 





V. Conclusion 


In this study we have shown the capability of the viscous vortex particle method to capture multirotor 
aerodynamic interactions in both hover and forward-flight configuration as encountered in distributed propul- 
sion eVTOL aircraft at both low and high Reynolds numbers. Single-rotor validation has been presented, 
showing an error of only 0.5% between experimental and predicted thrust coefficient in hover, and good 
agreement on predicted propulsive efficiency across all advance ratios in forward flight configuration. Wake 
dynamics in multirotor hovering configuration showed a good qualitative agreement between simulation and 
particle-image velocimetry measurements. ‘The simulation indicates that the accentuated induced upwash 
in between the rotors is the main contributor to the drop in blade loading during rotor-on-rotor interaction. 
The multirotor VPM model was shown to capture the thrust drop observed experimentally as the tip-to-tip 
distance between rotors became small, along with the associated thrust fluctuation. 

A parametric study of rotor-on-rotor interactions during hover and forward flight in the APC 10x7 rotor 
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revealed the following: 


e Multirotor interactions are detrimental across all advance ratios and Reynolds numbers, dropping the 
propulsive efficiency as the distance between rotors becomes small. 


e The performance drop becomes accentuated at low advance ratios. 


e The largest thrust fluctuations (and potentially highest noise signature) are encountered in hover and 
near-hover configurations. 


e At moderate and high advance ratios (J > 0.25), a counter-rotating configuration is slightly less 
efficient than a co-rotating one. 


e A higher Reynolds number accentuates the interactions encountered in a rotor with lightly loaded 
blade tips, as is the APC 10x7. 


These tentative conclusions require further verification with higher-fidelity simulations reducing numerical 
uncertainty along with more exhaustive experimental validation, which will be performed in future work. 

The aerodynamic validation of the VPM multirotor model hereby presented lays the groundwork for 
the future development of an aeroacoustic method for predicting the noise signature associated to unsteady 
loading in multirotor configurations, with the potential of integrating these analyses in the conceptual design 
stage of eVTOL aircraft. 
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Evolutionary Optimization of a Morphing Wing with Wind 
Tunnel Hardware-in-the-Loop 
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Active wing morphing is a fertile technology in the realm of unmanned and micro air 
vehicles: low flight loads and flexible wing materials allow substantial deformation with 
relatively low actuation power. Proper shape management of these wings for maximum 
performance is challenging: numerical optimization methods may struggle to accurately 
predict the three-dimensional flow separation, transition, and reattachment expected at the 
low operating Reynolds numbers. Conventional experimental optimization techniques (a 
response surface fit through a designed experiment) may be unable to represent the 
discontinuous objective functions commonly seen in aerodynamic applications, and the 
number of function evaluations can become overwhelming with high-fidelity actuation 
mechanisms. This paper outlines an effective experimental technique for optimizing a 
morphing airfoil: a genetic algorithm with wind tunnel hardware (strain gage sting balance) 
in the loop. Optimal wing shapes are found to maximize the measured lift or efficiency 
through a sweep of angles of attack, for a wing with a single actuation point (camber control) 
and two actuation points (camber and reflex control). Aerodynamic and electrical hysteresis 
limits the repeatability of sting balance measurements, complicating the convergence of the 
genetic algorithm, but not preventing adequate location of optimal designs. 


Nomenclature 
01 = angle of attack 
C = wing chord 
Cp = drag coefficient 
CL = lift coefficient 
L/D = lift-to-drag ratio 
sp = servo position 
Oe = free-stream velocity 
X, Z = chordwise and vertical positions 


I. Introduction 


APID advancements in actuator and material technologies have enabled the use of practical wing morphing 

platforms: aircraft are no longer confined to fixed wing geometries that can only suit a single flight regime. 
Such a capability allows for optimal wing shapes for a disparate array of flight maneuvers. Active wing morphing is 
a biologically-inspired concept: birds constantly change there wing shape to successfully undertake glides, descents, 
and aggressive maneuvers’. Along this analogy, a versatile morphing aircraft can achieve high lift for maximum 
payload carrying capacity, high speed to quickly move from waypoint to waypoint, or high efficiency during cruise 
flight*. Utilizing a morphing structure with asymmetric deformations can provide highly agile maneuvering through 
complex urban environments. Such a vehicle can also maintain stability through the most volatile of flight 
conditions, such as erratic and gusty airflow among buildings and structures. Active morphing is a particularly 
attractive option for unmanned and micro air vehicles (UAVs and MAVs): low flight loads and highly flexible wing 
structures allow a relatively small amount of actuation energy to inflict a substantial shape change over the wing. 
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Furthermore, poor efficiency characteristics (as necessitated by the low Reynolds numbers inhabited by such 
vehicles: 10° - 10°) and strenuous weight management challenges for static stability dictates a need for the 
benevolent aerodynamics that can be provided by active morphing. 

With the ever-increasing complexity of morphing mechanisms comes the challenge of proper shape 
management to achieve desirable, real-time flight characteristics. The work presented in this paper is one approach 
to such a challenge. A closed-loop wind tunnel with a strain-gage sting balance is used as a hardware-in-the-loop 
objective function for optimizing a multiple design variable morphing wing structure. The optimal airfoil shape 
(found by varying the camber and/or reflex) is located to maximize lift or efficiency for a range of pre-stall angles of 
attack, using a conventional evolutionary genetic algorithm (GA). Such a strategy must be thought of as a more 
efficient means of experimental optimization (when compared to conventional response surface-based techniques) 
as the number of design variables increases. While only two are considered in this work (actuation at two locations 
along the wing root), the general technique is successfully demonstrated, permitting future research in a larger 
design space. Furthermore, the complex and poorly understood flow fields about low aspect ratio MAV wings at 
low Reynolds numbers also limits the usefulness of conventional numerical optimization schemes. 

Several examples of efficient wing morphing mechanisms can be found in the literature. Abdulrahim et al.° 
demonstrate the use of a set of torque rods imbedded in a membrane micro air vehicle wing: actuation of the rods 
asymmetrically twists the wing for roll control. Similar mechanisms were used to control the Wright Flyer, with 
active wing twisting facilitated by a series of cables. Higher fidelity methods can also be considered. The 
electroactive polymer actuator* (EAP) is similar to muscle tissue with regard to stress and force output, and is 
capable of large-scale linear motion. Macro fiber composites (MFC) are a thin compliant actuator constructed of 
orthogonal layers of uni-directional piezoceramic fibers and copper electrodes encased in layers of acrylic and 
Kapton’. Strelec et al.° discuss the use of shape memory alloys (SMA) for wing morphing, noting a specific need to 
prevent variations in temperature for successful implementation. 

Early numerical airfoil optimization work is found in the work of Hicks et al.’, who use a gradient-based 
feasible direction method to optimize low drag transonic airfoils. Further gradient-based applications are presented 
by Lian et al.*, who conduct shape optimization of a low aspect ratio micro air vehicle wing. A sequential quadratic 
programming optimizer is used in conjunction with a Navier-Stokes solver to maximize the lift-to-drag ratio by 
varying a series of control points over the wing, with constraints on the wing shape’s convexity. Obayashi and 
Tsukahara’ discuss the use of three distinct schemes for airfoil optimization: gradient-based, simulated annealing, 
and evolutionary genetic algorithms. The genetic algorithm is found to be superior, despite requiring a larger 
number of function evaluations. This is thought to be due to a smaller dependence on the initial design, as well as 
the presence of potentially discontinuous aerodynamic objective functions. A genetic algorithm is implemented by 
Namgoong et al.'° for airfoil optimization with strain energy (for the actuators to morph from one shape to the next) 
as an objective function, along with lift and drag. XFOIL is used to construct a Pareto trade-off curve between 
aerodynamic and actuator cost. Higher fidelity genetic algorithm optimizations are given by Naujoks et al.'', who 
use a turbulent Navier-Stokes solver. Further applications of numerical airfoil optimization falls under the guise of 
inverse design methods: varying the airfoil shape to minimize the difference between computed pressure 
distributions and known, favorable distributions’”. Gopalarathnam and Selig'’ conduct such an inverse design for 
low speed airfoils, providing insight into the relationship between thickness and efficiency at low Reynolds 
numbers. 

Experimental optimization with hardware-in-the-loop is a relatively rare undertaking: non-aeronautical 
examples include the work of Peng et al.'*, who analyze a flat rectangular membrane via three tension forces applied 
to each edge. The deformed profile of the membrane is optically measured, and fed into a genetic algorithm to find 
the combination of forces that will minimize the wrinkling amplitude. Hemker et al.'° optimize walking speed and 
stability with a humanoid robot hardware-in-the-loop, using sequential surrogate quadratic programming with 
stochastic approximations to the objective functions. Aeronautical applications are given by Hunt et al.'®, who use a 
genetic algorithm to optimize the forward velocity and motor efficiency of a tethered ornithopter, with flapping rate 
and tail position as design variables. The authors report several unexpected optimal configurations. Finally, the 
work of Levinsky and Palko'’ is of most interest to the current research paradigm: shape optimization of a three- 
dimensional flexible model of a F-111 TACT aircraft wing in the wind tunnel at both low and transonic speeds. 
Twelve actuator locations are used over the wing for both span-wise and chord-wise morphing, for constrained 
minimization of drag using a gradient-projection method. Reported difficulties include: wing shape repeatability, 
wing degradation, drag measurement repeatability and quality, and enforcement of actuator displacement 
constraints. 

The remainder of this work is outlined as follows. A complete description of the experimental hardware and 
set-up will be given, followed by an account of the numerical evolutionary genetic algorithm, and a visual image 
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correlation system used to measure the wing shapes. A low-fidelity morphing wing with a single actuator is built to 
estimate the potentially-detrimental effects of experimental hysteresis, and to validate the optimal designs (through a 
comparison with conventional response surface optimization). Finally, a higher-fidelity morphing wing is used to 
determine shapes with optimal lift and efficiency, for an array of flight conditions. 


Il. Hardware-in-the-Loop Apparatus and Procedure 


There are four primary subsystems utilized in the experimental setup. These include the optimization software, 
the servo controller, the wind tunnel, and the data acquisition software. Through the use of simple text-format 
software, data 1s passed between each of these subsystems. This cooperative scheme of utilizing previously 
independent software programs provides the means for automating the optimization procedure. A visual image 
correlation system is used independently during testing to determine the three-dimensional wing shapes of 
noteworthy servo positions. 


A. Wind Tunnel 

The test facility used for the entirety of this work is an Engineering Laboratory Design 407B closed loop wind 
tunnel, with the flow loop arranged in a horizontal configuration. The test section has an inner dimension of 0.84 m 
on each side, and is 2.44 m long. The velocity range is between 2 and 45 m/s (maximum Re of 2.7 million), 
provided by a 250 HP motor and a 2-stage axial fan. Centerline turbulence levels have been measured on the order 
of 0.2%. Optical access is available on the sidewalls and the ceiling. A Heise model PM differential pressure 
transducer is attached to a pitot-static tube located at the center of the section’s entrance, while a four-wire RTD 
mounted to the wall of the test section measures the airflow temperature. 


B. Strain Gage Sting Balance 

An Aerolab 01-15 6-component strain gage sting balance is used to measure the aerodynamic forces and 
moments of the wind tunnel models. Each of the six channels is in a full Wheatstone-bridge configuration, with 5 
channels dedicated to force, and 1 to a moment. Data acquisition is done with a NI SCXI 1520 8 channel 
programmable strain gage module with full bridge configuration, 2.5 excitation volts, and a gain of 1000. A NI 
6052 DAQ PAD firewire provides A/D conversion, multiplexing, and the PC connection. For a given flight 
condition, the output signals from the six components are sampled at 1000 Hz for 2 seconds. The average of this 
data is sent to a LabVIEW-based module for the calculation of the relevant aerodynamic coefficients. The sting 
balance is mounted to a custom-fabricated aluminum model arm within the test section. The arm extends through a 
hole in the section wall, and is then attached to a gearbox and a brushless servomotor system for angle of attack 
control (rates on the order of 1 °/s). Tunnel speed, model inclination, and force/moment measurements are 
set/acquired using a dedicated PC and in-house LabVIEW-based software. A schematic of the wind tunnel setup 
can be seen in Figure 1. 


C. Servo Controller 

Morphing actuation is provided by a set of Futaba $3102 servo actuators. These servos supply 4.6 kg-cm of 
torque at a rate of 300 °/s, with a complete range of 120°. Using MATLAB, the position of each servo is 
commanded through a serial port in a 1200 baud rate, 8 bit, 1 stop, no-parity form. A sub-module converts the 
RS232 signal to transistor-to-transistor logic. A custom servo controller on board the morphing wing wind tunnel 
model receives the signal. The controller features 8 kilobytes of flash memory, a 16 bit high-resolution servo timer, 
and an 8 MHz ATMEL microprocessor. This servo controller is able to set and hold 250 different commanded 
servo positions while the wing undergoes aerodynamic loading. 


D. Genetic Algorithm 

Airfoil optimization is provided by a genetic algorithm, a stochastic evolutionary algorithm that mimics certain 
aspects of natural selection observed in biology'®. Distinct morphing wing shapes are grouped together in a 
population: the initial populations can be either randomly or uniformly distributed over the design space. The 
performance (fitness) of each design is then evaluated (using the wind tunnel/sting balance hardware). A stochastic 
process is then used to select the designs that will be used to breed the next generation: designs with higher fitness 
are more likely to be selected. A certain portion of the fittest designs are directly copied to the next generation 
(elitism), while the rest of the selected designs are paired off for crossover. A child design is created from two 
parents, and typically contains many of the traits of both (reproduction). Mutation is occasionally employed to 
promote diversity within the population, by randomly switching a portion of a design from one generation to the 
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next. While the relatively expensive genetic algorithm requires a large number of function evaluations (as compared 
to gradient-based methods), it is employed here for aforementioned reasons: less dependence on the initial design, 
and a smaller propensity to converge on a local optimum’. A third reason comes from purely experimental 
considerations: sensitivities would have to be computed from a finite difference equation, an entity that can be 
greatly distorted by even minor amounts of error or noise. 

MATLAB’s genetic algorithm toolbox’”’ is employed for the current work. Morphing wing designs are 
described by N integers (each integer ranges from 1 to 250, corresponding to the extremes of the servo positions), 
where N is the number of servos used to morph the wing. Several different population sizes are used, from 10 to 25. 
The elitism count is set to 2, reproduction is via a two-point crossover function with a 0.8 crossover fraction, and a 
uniform mutation function 1s used with a 0.01 mutation rate. No specific convergence criterion is utilized: when 
several generations pass with little change in the population’s designs, the process is manually terminated. Less than 
30 generations are typically adequate. 
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Figure 1. Schematic of wind tunnel setup. 


E. Optimization Procedure 

The two primary software packages used for this study are LabVIEW (for wind tunnel control and data 
acquisition) and MATLAB (for wing morphing control and optimization). A flow chart of how the information is 
passed between the programs is shown in Figure 2. To initiate the optimization procedure, an initial population is 
prescribed to the genetic algorithm. The first member of the initial population is then sent to a subroutine which 
commands the servo controller to morph the wing. This servo positions is simultaneously sent to a text file. The 
genetic algorithm then waits for the data associated with the servo positions (design fitness) to be added to the text 
file: this is done by monitoring the text file size (checking every 0.2 seconds). While the optimization software is on 
hold for data, the wind tunnel is brought up to the required velocity. After the angle of attack is set, the force and 
moment data are determined from the sting balance. The text file previously created by the optimization software is 
accessed and this data is appended to the same line. At this point, the data acquisition software waits for the next set 
of servo positions to be added to the text file. As with the optimization software, file size 1s constantly monitored. 
When the new servo positions are added to the text file, the file size will increase, thereby triggering the data 
acquisition software for the next set of data. After the force and moment data have been added to the text file, the 
optimization software identifies the increase in file size, and then accesses the data. The next servo positions are 
commanded, thereby beginning the cycle again. 
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As with any sting balance, signal drift of the strain gages is a concern, due to both environment and self-heating. 
To manage this signal drift, the wind tunnel software is set to count the number of data sets collected. After a 
prescribed number, the tunnel velocity is set to zero and the model arm is rotated to the original angle of attack 
(typically zero). At this point, the data acquisition software performs an offset nullification of the strain gage 
voltages. Then the tunnel is brought back up to the velocity and angle of attack required by the experiment. This 
simple procedure helps to ensure data quality; the prescribed number of data sets is conveniently set to coincide with 
the size of the genetic algorithm’s population. 
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Figure 2. Flow chart for hardware-in-the-loop optimization. 


F. Visual Image Correlation 

Visual image correlation (VIC) is a non-contacting full-field measurement technique’ used to measure the 
shape of the morphing wind tunnel models under aerodynamic loading. The system is only used to quantitatively 
assess the optimal (or otherwise noteworthy) wing shapes, and is not included in the optimization process. The 
underlying principle of the VIC system is to calculate the shape of a test specimen by digitally acquiring a random 
speckling pattern applied to the surface. This is done with twin synchronized pre-calibrated cameras installed over 
the wind tunnel, as seen in Figure 1. The VIC system can also measure the displacement field of a structure (useful 
in measuring aeroelastic deformation of the flexible wings) by tracking a subset of the speckling pattern from one 
image to the next: temporal matching. For the imaging parameters, wing shapes, and speckling patterns used in the 
current work, the displacement/shape resolution is estimated to be 0.05 mm. 


Hil. One-Design-Variable Morphing Wing 


A. Model Description 

In order to validate the data collection procedures, a morphing wing with a single design variable is utilized. 
The rectangular planform wing is fabricated from a single layer of plain weave carbon fiber reinforced plastic (with 
fibers aligned in the +45° directions), with horizontally aligned uni-directional carbon fiber strips that run from 
wingtip to wingtip spaced every 2.5 cm from the leading edge (as seen in Figure 3). These strips transfer loads such 
that morphing the wing shape at the root effectively extrudes this same shape up to the wingtip. The root chord of 
the rectangular wing is 19.5 cm, the wing span is 25.5 cm, the aspect ratio is 1.3, and the wing area is 497 cm’. At 
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U., = 16 m/s, the Reynolds number at the root is 200,000. The random speckling pattern applied to the suction 
surface of the wing can also be seen in Figure 3, to facilitate measuring the wing shape/deformation with the VIC 
system. 
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Figure 3. One-design-variable morphing rectangular wing. 


Actuation is provided with a single morphing servo; the corresponding control horn is attached to the root at x/c 
= (0.28, and is largely constrained to vertical cambering motion. The wing shape is constrained through two 
attachment points: one at the leading edge (which permits just rotation) and a second at x/c = 0.65 (which permits 
rotation and some chord-wise motion). The servo and the constraint linkages are attached to a mounting platform 
located below the wing at z/c = -0.2. A sting balance attachment point is also built into the mounting bracket. The 
full range of actuation will include significant changes in the wing’s angle of incidence due to the unconstrained 
trailing edge (which will also result in an aeroelastic nose-down geometric twist for adaptive washout). In order to 
facilitate a comparison of aerodynamic performance of different wing shapes at the same flight condition, the angle 
of attack 1s measured with respect to the rigid mounting platform. 

The capabilities of the full-field VIC system are demonstrated in Figure 4: the chord-normalized wing shape 
contour 1s plotted, along with airfoil sections at the wing root and the wing tip. It can be seen that the span-wise 
stiffeners (carbon fiber strips) built into the wing effectively transfer the shape from the actuation points to the wing 
tip. No change in the camber is noted, though minor aerodynamic twist can be seen at the leading and trailing edges 
of the wing root (less than 1% of the chord). Utilizing the VIC system, 9 servo positions are selected and 
photographed to determine the range of attainable wind-off wing shapes. The morphing mechanism is capable of 
altering the camber from -7% to 19%: normalized root airfoils are plotted in Figure 5. The location of this camber 1s 
variable as well: decreasing the servo position from 250 to 90 shifts the camber location from x/c = 0.32 to 0.45. A 
further decrease in servo position replaces the camber location back to 0.32. Passive aeroelastic effects are 
documented in Figure 6, for the two extremes of the wing motion (servo positions | and 250) at a moderate angle of 
attack. Minimal passive deformation 1s observed for the negatively-cambered wing, though a slight de-cambering 1s 
observed for the other extreme: the servo is unable to completely hold the wing shape under load, as the camber 
decreases from 19.1% to 18.4%. The aforementioned washout can also be seen, decreasing the incidence at the root 
by 4.3°. 
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Figure 4. Measured wind-off wing surface (z/c) at sp 250, with sections plotted at the root and the tip. 
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Figure 5. Measured wind-off root airfoils of the one-design-variable morphing wing model. 
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Figure 6. Passive aeroelastic wing deformation at U,, = 16 m/s, a = 10°. 


B. Hysteresis Assessment 

Several potential pitfalls exist in the wind tunnel testing of the wing shown in Figure 3. The first concern is the 
electrical hysteresis/resolution/drift of the strain gage sting balance*’. The forces generated by a micro air vehicle 
may not be more than a few percent of the maximum measurable load of each channel. The lowest measured drag 
force is 0.2 N (invariably measured from the wing when morphed to a flat plate at 0° angle of attack): such a force 
will correspond to microvolts across the strain gages, which introduces a challenging data processing and signal 
conditioning problem. A second concern is the repeatability of the shapes obtained from the morphing servos, a 
complaint reported in the work of Levinsky and Palko'’. The final and most important concern is a viscous effect: 
Mueller” shows that the location of the laminar flow separation, transition, and turbulent reattachment over a 
Lissaman 7769 and a Milet M06-13-128 airfoil is prone to aerodynamic hysteresis between Reynolds numbers of 
60,000 and 300,000 (a range which includes the current application). Differing lift and drag curves are measured 
depending on whether the angle of attack is decreasing or increasing, and may also be a problem with a low 
Reynolds number airfoil that semi-randomly changes its shape. 

In an effort to determine the effect, if any, of the above factors, the following test is conducted. Five servo 
positions are used to represent the extremes and intermediate positions within the range of motion. While holding 
the angle of attack and airspeed constant, ten consecutive data sets are collected for each of the five servo positions. 
For each position, this is done without nulling the sensors or changing the shape between measurements: the results 
should reflect the sting balance drift and resolution, not aerodynamic hysteresis. Data is then collected as the servo 
position is randomly altered among the five pre-selected positions. This data can then be sorted into common servo 
positions for comparison with the previous test: measured coefficients of variation are given for Cy (Table 1) and Cp 
(Table 2) in the low-drag, high-lift, and near-stall flight regimes. 
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Table 1. Measured scatter in the coefficient of lift, U,, = 16 m/s. 


a, = 0° a = 10° a = 22.5° 

COV (%) COV(%) COV (%) 
SP 1 Consecutive 0.14 0.32 1.79 
Random 1.36 2.20 2.28 
SP 125 Consecutive 0.34 0.21 0.25 
Random 1.17 0.98 1.45 
SP 250 Consecutive 0.19 0.38 0.30 
Random 0.83 0.25 1.36 


Table 2. Measured scatter in the coefficient of drag, U,, = 16 m/s. 


a, = 0° a = 10° a = 22.5° 

COV (%) COV(%) COV (%) 
SP 1 Consecutive 3.07 3.40 1.90 
Random 2.63 2.29 1.67 
SP 125 Consecutive 0.56 1.85 0.81 
Random 0.87 1.22 1.26 
SP 250 Consecutive 1.29 1.05 0.39 
Random 1.47 0.69 1.24 


The experimental scatter in both signals is acceptable: less than 2.3% COV in lift and less than 3.4% in drag. 
Measurement errors and repeatability are expected to be worse for the drag, as the relatively small forces reside 
closer to the sting balance’s resolution. Drag forces will also be more susceptible to the viscous flow hysteresis 
discussed above. Furthermore, only the lift force shows a distinct trend between the two measurement methods (an 
order of magnitude increase in scatter when the servo position is randomly altered); no such pattern emerges from 
the drag data. The largest coefficients of variation are typically measured from the negatively-cambered airfoil at 
servo position | (as seen in Figure 5). Significant leading edge flow separation can be expected even at low and 
moderate angles of attack. In addition to the turbulent reattachment issues, unsteady leading edge vortex shedding 
may distort what is essentially a static sting balance measurement. 


C. Alpha-Sweep Results 

The lift coefficient (Figure 7), drag coefficient (Figure 8), and the lift-to-drag ratio (Figure 9) are given as a 
function of angle of attack for the same nine servo positions that populate Figure 5. As expected, altering the 
camber has no significant effect on the pre-stall lift slope: the average C,,, is 0.047, characteristically shallow for the 
low aspect ratio wing. An increase in the slope is seen for the aggressively-cambered airfoils (up to 0.064), possibly 
due to the nonlinear vortex lift generated by the tip vortex swirling system. Otherwise, the lift curve is largely 
shifted up or down, with the zero-lift angle of attack ranging from 5.1° to -10.6° (extrapolated). The stalling angle 
and the maximum C, progressively increase with actuation up to 12% camber, after which stall is very benign and 
poorly defined. This latter effect may be due in part to the adaptive washout shown in Figure 6, a mechanism 
known to cause such behavior®. Distinct trends are harder to discern for the drag data of Figure 8. Airfoil cambers 
between -5% and 5% exhibit similar drag performance at small and moderate angles of attack (though a clear trend 
develops closer to stall, as exaggerated camber increases drag). The wing morphed to servo position 60 (-2% 
camber) appears to show the “drag bucket” commonly associated with transitional flows over low Reynolds number 
airfoils’. 

The trends in the lift-to-drag ratio (Figure 9) as a function of camber are similarly difficult to predict. Highly- 
cambered airfoils are preferred at negative and post-stall angles of attack, while moderate camber (5%) is optimal 
for most of the range in between. The exception is at 7.5° and 10° angles of attack, where the slightly negatively- 
cambered airfoil at servo position 60 is measured with the highest efficiency; this performance is certainly drag- 
based (rather than lift), predominately due to the aforementioned bucket. The rapid changes in drag between servo 
positions 30, 60, and 90 indicates the superiority of a evolutionary genetic algorithm over a gradient-based search 
optimization, which would struggle with the discontinuous objective functions present at low Reynolds numbers. 
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The systematic data presented in these three figures can now be used to validate the hardware-in-the-loop 
optimization procedure, as the optimal wing shape (in terms of lift, drag, or efficiency) is known for a sweep of 
angles of attack (to within 30 servo positions). 
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Figure 7. C,-a relationship as a function of servo actuation, U, = 16 m/s. 
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Figure 8. Cp-a relationship as a function of servo actuation, U,, = 16 m/s. 


D. Validation of Optimization Procedure 

The evolutionary optimization of the wing’s lift at 10° angle of attack is given in Figure 10. In addition to the 
GA specifications given above, a population size of 11 is used, initially uniformly distributed within the design 
space (between servo positions | and 250). The results from the function evaluations of each design in each 
generation are plotted through 13 generations (at which point suitable convergence is achieved). This data 1s 
overlaid with the results from the a-sweep at 10°, where servo position 215 is clearly optimal. Further cambering 
past this point to position 250 seems to induce flow separation, and a minor loss in lift. The genetic algorithm 
quickly moves to the area of the design space between positions 200 and 250 (only indicated in Figure 10 by the 
cluster of data points in this region), but ultimately settles on position 250 as optimal. We are thus left to conclude 
that the superiority of position 215 over 250 noted in the a-sweep tests has been affected by some form of hysteresis, 
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though the difference in performance between the two 1s less than 2%. The majority of the final population resides 
at servo position 250, with the exception of two designs at position 180, presumably a result of the 
reproduction/mutation rates employed by the genetic algorithm. Two more genetic algorithm runs at 10° produces 
very similar results, consistently converging to a position between 240 and 250. 
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Figure 10. Evolutionary optimization of C;, at a = 10° and U., = 16 m/s: function evaluations (left) and the 
initial/final design populations (right). 


Similar data 1s given in Figure 11 for maximizing the aerodynamic efficiency at 10° angle of attack. As before, 
a population size of 11 is used, and a suitable convergence is achieved after 13 generations. Whereas the correlation 
between the genetic algorithm function evaluations and the a-sweep is very good for the coefficient of lift, there is a 
considerable amount of scatter in both data sets’ L/D, ostensibly due to the error contribution from both the lift and 
drag measurements. The genetic algorithm converges to an intermediate servo position 125 (again indicated by the 
large data cluster), though a significant volume of data points lie in the range between 125 and 150, as does the final 
population reported in Figure 11. 

As noted above, the a-sweep tests indicate an optimal design at servo position 60, an apparent discontinuity in 
the L/D response surface. The cause of the discrepancy between the GA and a-sweep results is unclear: hysteresis 
may distort the data (as suspected in the above tests to locate the maximum C,), or perhaps the response 
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discontinuity at servo position 60 is narrow enough to remain undetected by the uniform initial distribution of 
designs. Regardless of the reason, the difference in the performance of the optimal design located by the genetic 
algorithm is again small, less than 5%. Two additional GA runs confirm a convergence at servo position 125. 
Additional tests to maximize L/D are also run at -5° and 0°. The former angle converges to servo position 230 (in 
good agreement with the data of Figure 9), while the latter converges to position 160 (the measured L/D of 4.2 at 
this angle is a significant improvement over any of the designs tested within the a-sweep). 
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Figure 11. Evolutionary optimization of L/D at a = 10° and U,, = 16 m/s: function evaluations (left) and the 
initial/final design populations (right). 


IV. High-Fidelity Morphing Wing Optimization 


A. Model Description 

As the results in the previous section have demonstrated a certain degree of confidence in the hardware-in-the- 
loop optimization procedure, the technique is now utilized on a higher-fidelity, more realistic wing platform. A 
single layer of plain weave carbon fiber is used to fabricate a Zimmerman planform wing (a common planform for 
micro air vehicle applications), with fibers running in the +45° directions. Curved uni-directional strips of carbon 
fiber are again used to effectively transfer the wing shape from the root to the tip. The randomly speckled wing (for 
VIC purposes) can be seen in Figure 12: the root chord is 17.5 cm, the wing span is 40.6 cm, the aspect ratio is 2.9, 
and the wing area is 567 cm’. The estimated chord-based Reynolds number for this vehicle is 180,000, a slightly 
lower value than that seen in the previous section. 
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Figure 12. Two-design-variable morphing Zimmerman wing. 
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As before, two actuation points are used: one at the leading edge (permitting mostly rotation), and a second at 
the trailing edge allows for both rotation and longitudinal translation. The latter effect prevents the mechanism from 
becoming over-constrained, decreasing the resistance seen by each morphing servo. A flat mounting platform is 
again used to house the servos and actuation linkages, and as a reference for angle of attack measurements. The first 
servo actuates the wing at x/c = 0.21; this is largely a vertical motion to provide wing camber. A second servo 
actuates the trailing edge, providing a rotation for potential wing reflex or negative camber. Introducing negative 
camber to a portion (typically towards the trailing edge) of the wing’s root will surely decreases the overall lift, but 
such a shape has certain advantages, particularly for micro air vehicle wings. Decreasing the lift will likewise 
temper the induced drag of the wing, whose relatively low aspect ratio (2.9) leaves a considerable portion of the 
longitudinal flow affected by the tip vortex swirling structures. Reflex may also be used to reverse the sign of the 
adverse pressure gradient over the upper surface of the wing, prompting flow reattachment or perhaps completely 
preventing separation at a certain angle of attack. Though only lift and drag are under consideration for the current 
work, a reflex airfoil can provide longitudinal static stability without the use of a horizontal stabilizer (the negative 
lift over the reflex can offset the nose-down pitching moment of the remainder of the wing), whose removal may be 
necessitated by micro air vehicle size constraints. 

Full field VIC results (z/c) of a cambered wing with trailing edge reflex 1s given in Figure 13, with airfoil 
sections plotted at the root and towards the wing tip. As expected, the curved carbon fiber stiffeners are less 
successful (than the rectangular wing) at transferring the same shape from root to wing tip. A slight loss in camber 
is observed, along with a shift towards the trailing edge. The choice of wing planform precludes the transferal of 
reflex to the wing tip, where the airfoil is only positively cambered. An array of the model’s morphing potential can 
be seen in Figure 14. Servo 1 is capable of +11% camber, while servo 2 1s capable of +4% reflex. As before, large 
servo | positions increase the camber, though the opposite (inverse) of this shape can now be obtained from the 
servo’s minimum position. Large servo 2 positions actuate a negative pitching moment to the trailing edge, 
increasing the reflex. A flat plate is obtained with both servos in their middle positions (125,125). Finally, as both 
the leading and trailing edge of the carbon fiber wing are fixed (as well as the intermediate location attached to servo 
1), wing aeroelasticity is not considered to be significant, and will not be examined. 
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Figure 13. Measured wind-off wing surface (z/c) at sp (250,250), with sections plotted at the root and the tip. 
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Figure 14. Measured wind-off root airfoils of the two-design-variable morphing wing model. 
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B. Optimal Lift Designs 

The optimization procedure is carried out to maximize the lift of the high fidelity morphing wing for a range of 
angles of attack from -5° to 20° in 5° increments. Testing above 20° invariably causes wing stall and excessive 
vibrations; function evaluation repeatability is very low and the genetic algorithm struggles to locate an optimum. 
In order to improve the convergence characteristics of the genetic algorithm operating with an increased number of 
design variables, the population size is increased to 25. The initial population is uniformly distributed over the 
design space, as before. Results at 10° angle of attack are given in Figure 15, in terms of the evolutionary history 
(both the mean lift across the population and the best are given at each generation) and the designs that populate the 
initial and final generations. For this case, convergence is very rapid, with little variation in the best design 
throughout the entire optimization. The majority of the population has converged upon this optimal design within 5 
generations, and the function evaluations appear to be very clean and repeatable (as previously noted in Figure 10). 
The optimal design lies approximately at (170,1): a moderate amount of camber from servo | and no reflex from 
servo 2, which would only serve to decrease lift as discussed above. Careful evaluation of the initial population 
reveals that design number 9 1s very close to this optimum. The elitism employed within the genetic algorithm 
ensures that this design is preserved from one generation to the next. 
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Figure 15. Evolutionary optimization of Cy at a = 10° and U., = 16 m/s: convergence history (left) and the 
initial/final design populations (right). 


The morphing wing shapes found to optimize lift at all six of the tested angles of attack are given in Figure 16; 
the measured performance is given in Figure 17, for comparison with the wing morphed into a flat plate (125,125) 
and an intermediate wing at servo positions (250,110). The moderate trailing edge actuation of his latter airfoil 
pushes the maximum camber to x/c = 0.4, a shape commonly seen on micro air vehicles, and thus a good model for 
comparison purposes. As expected, none of the optimal-lift designs utilize any reflex actuation from servo 2; the 
position is fixed at the minimum (1) throughout the a-sweep. For low angles of attack, camber from servo | is 
progressively increased to optimize lift, in accordance with linear theory. Flow separation clearly becomes an issue 
above 10° as the optimizer is forced to employ a moderate camber; this effect can also be seen in Figure 7 with the 
low-fidelity model. The optimal designs are identical at 5° and 20°, both calling for the largest camber. The 
adaptive morphing wing consistently out-performs both the flat plate and the cambered wing. Despite the 
aerodynamic twist of the morphing wing, all three wings have approximately the same pre-stall lift slope. 
Maximizing the lift has the unexpected consequence of improving the stalling behavior as well. In spite of a larger 
maximum possible lift coefficient, stall 1s delayed by roughly 3° as compared to the other two cases, and 
characterized by a very benign stall, with little loss of lift between 15° and 20° angle of attack. 


C. Optimal Efficiency Designs 

Similar tests are run to maximize the wing efficiency, again using a population size of 25 with the initial 
population uniformly distributed within the design space. The genetic algorithm is run between -5° and 5° angle of 
attack in 2.5° increments, and at 10° and 15° as well. The evolutionary history, initial population, and final 
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population found at 10° are given in Figure 18. As expected from the measurement repeatability issues discussed 
for the single design variable model above, convergence of the lift-to-drag ratio is much noisier than that found for 
the lift. Unlike the previous case, a member of the initial population does not lie very close to the optimum, and the 
best design of each generation changes several times throughout the optimum. Despite the variability in the best 
design’s performance from one iteration to the next, its actual identity only changes three times: at iterations 7, 12, 
and 15. The flat plateaus of the best design’s performance typically seen in GA runs with numerical objective 
functions (due to elitism) are destroyed here by experimental repeatability errors (though the pseudo-random 
variations in the mean performance is indicative of a healthy distribution throughout the design space). The genetic 
algorithm shows suitable convergence after 21 iterations, locating servo position (247,156), a cambered wing with a 
very small amount of reflex towards the trailing edge. At this point the best design in each generation has remained 
unchanged for 7 iterations, with almost no variability throughout the population for the last three iterations (as seen 
in the final population of Figure 18). Further iterations would rely solely on the mutation operator for greater 
improvements in the aerodynamic efficiency. 
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Figure 17. Morphing wing lift performance versus a flat plate and a cambered airfoil. 
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Figure 18. Evolutionary optimization of L/D at a = 10° and U., = 16 m/s: convergence history (left) and the 
initial/final design populations (right). 


The morphing wing shapes found to optimize L/D at all seven of the tested angles of attack are given in Figure 
19 and the measured performance is given in Figure 20 for comparison with the flat plate and cambered wing 
described above. As before, the camber actuation provided by servo | does not drastically change throughout the a- 
sweep, staying in a range between positions 150 and 250. Increasing the angle of attack from -5° to 2.5° sees an 
increase in the amount of reflex employed by the model to optimize L/D. Such a doubly-cambered geometry 
obviously decreases the lift, but it is unclear whether the drag is tempered through viscous (flow reattachment due to 
a favorable gradient) or pressure (loss of lift leading to a subsequent drop in the tip vortex strength) considerations. 
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Figure 19. Morphing wing shapes and servo positions for maximum L/D through an a-sweep: U., = 16 m/s. 


Between 0° and 5° angle of attack the differences in the optimal shape are minor, though further increase in the 
incidence begins to remove the reflex from the wing. At 15° the reflex is completely gone, with servo 2 now 
pushing the wing upward. This, coupled with a mild amount of camber from servo | provides a very counter- 
intuitive wing shape, with a relatively flat leading edge and the wing camber shifted back to x/c = 0.65. A similar 
shape is used at -5° angle of attack. One possible explanation for such a shape can be found in the design of low- 
speed laminar airfoils, where the maximum thickness/camber is placed farther aft than usual, in order to delay the 
location of the flow separation that is certain to occur against an adverse pressure gradient at low Reynolds numbers. 
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Substantial improvements in the performance of the morphing wing can be seen in Figure 20 for all of the angles of 
attack tested, with the exception of 10° (where the cambered airfoil used for comparison is very similar in shape to 
the optimal design). 
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Figure 20. Morphing wing L/D performance versus a flat plate and a cambered airfoil. 


V. Conclusion 


A procedure has been established to achieve proper shape management of morphing wing structures, utilizing a 
genetic algorithm with wind tunnel hardware-in-the-loop to optimize the lift and the efficiency. A detailed 
examination of the entire design space of a morphing wing with a single actuation point was used to verify the 
optimization process. A morphing wing structure with two actuation points provided a more practical set of optimal 
designs that effectively utilize doubly-cambered airfoils to control the lift-to-drag ratio. The largest barrier to the 
use of genetic algorithms with experimental objective functions is repeatability and error, thought to be primarily 
due to aerodynamic/electrical hysteresis encountered at the low testing Reynolds numbers. The elitism commonly 
employed in genetic algorithms will copy the best designs from one generation to the next, but experimental error 
may prevent sequential measurements of the superior performance, delaying optimization convergence. 

For the two examples considered here, the genetic algorithm based technique will not out-perform (in terms of 
fewer function evaluations) conventional experimental optimization techniques such as design of experiments 
(DOE). Dividing each servo actuation sweep into 10 increments for a full factorial design of the high-fidelity 
morphing wing structure requires 100 function evaluations to fit a response surface and locate an optimum. For the 
wing efficiency optimization, the GA requires 21 generations to converge with a population size of 25: 525 function 
evaluations. This discrepancy becomes less true with the addition of further design variables however: with five 
modes of wing actuation, for example, the hardware-in-the-loop optimization may be the only feasible method of 
properly exploring the experimental design space (the size of the factorial matrix will grow exponentially: 10° 
function evaluations for the example given). Micro and unmanned air vehicles are well suited for multiple levels of 
actuation: the flexible materials commonly utilized and the small flight loads allow for significant changes in shape 
with a relatively low amount of power. Effective shape management of such a vehicle is outlined in the techniques 
described above, and may be the best method for maximizing the performance of a high-fidelity morphing wing. 

Future work will include the use of laser-based flow visualization to document the flow patterns over the 
morphing wing during hysteresis testing and genetic algorithm testing. Flow separation and reattachment are 
obvious points of interest, hopefully elucidating the performance of the optimal wing designs. Further optimization 
runs will be conducted under various constraints: maximizing lift or efficiency with minimum bounds on the drag or 
the pitching moment (of particular interest to the static stability of the vehicle). Using these constraints at high 
angles of attack could result in the mitigation of wing stall, or yield a controlled descent flight mode useful in auto- 
land features. A genetic algorithm has no natural method for handling constraints, necessitating the use of penalty 
factors to the objective function for infeasible designs. As discussed above, higher fidelity morphing wings will be 
built and tested. The individual or simultaneous utilization of several distinct actuation modes 1s very realizable: the 
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multi-point aerodynamic twist used here, but also variable aspect ratio (through a telescoping mechanism perhaps), 
propeller thrust line, wing sweep (multiple joints can be considered for different in-board and out-board angles), and 
geometric wing twist. Finally, the optimization results will be used in an on-board control system for real-time wing 
morphing. Such a control system will receive instantaneous angle of attack and wind speed measurements, and in 
response, command the series of actuators to optimize a given objective function (which may be determined based 
upon the current flight mode). This will be done using data similar to that seen in Figure 16 and Figure 19, either in 
terms of a look-up table, or by fitting the optimal servo positions with a response surface. The former may require 
rounding the measured on-board angle of attack to the nearest optimization data point, while the latter requires 
interpolating between optimization data points. 
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SUMMARY 


A general theory of steady three-dimensional flow of @ nonviscous 
fiuid in subsonic and supersonic turbomachines having arbitrary hub and 
casing shapes and a finite number of blades is presented. The solution 
of the three-dimensional direct and inverse problem is obtained by 
investigating an appropriate combination of Plows on relative stream 
surfaces whose intersections with a z-plane either upstream of or some- 
where inside the biade row form @ circular are or a radial line. The 
equations obtained to deseribe the fluid flow on these stream surfaces 
Bhow clearly the several, approximations involved in ordinary tio- 
dimensional treatments. They also lead to a solution of the three- 
dimensional problem in a mathematically two-dimensional manner through 
iteration. The equation of continuity is combined with the equation of 
motion in either the tangential or the radial direction through the use 
of a stream function defined on the surface, and the resulting equation 
ig chosen as the principal equation for such flows. The character of 
this equation depends on the relative magnitude of the Local velocity of 
sound and 4a certain combination of velocity components of the fluid. A 
general method to solve this equation by both hand and high-speed 
digital machine computations when the equation is elliptic or hyperbolic 
ia described. The theory is applicable to both irrotational and rota- 
tional absolute flow at the inlet of the blade row and at both design 
and off-design operations. 


INTRODUCTION 


The problem of three-dimensional flow in turbomachines of axial-, 
radial-, and mixed-flow types is treated in references ]. to 1S. Because 
of the enormous mathematical difficulties involved in the problem, Lorenz 
(reference 1) first introduced the idea of an infinite number of blades 
of infinitesimal thickness in order to follow the flow on a given surface, 
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Bauersfeld {reference 2) added to the theory the condition of integra- 
bility for the blade surface that must be satisfied In the inverse, or 
design, problem. The theory is further clarified and strengthened by 
the works of Stodola (reference 3), von Mises (reference 4), and Dreyfus 
(reference 5), and is the basis of many recent investigations on axial-, 
radial-, and mixed~flow compressors and turbines. 


For incompressible Plow, Ruden (reference 6) proves that the 
through-rlow solution obtained under the assumption of an infinite num- 
ber of blades gives a circumferentiaily average value of the fluid prop- 
erties, provided the deviations of the fiuid properties from their cir- 
cumPerential averages are small. In reference 7, Traupel points out the 
oselliatory nature of radial Plow in 3 multistege turbomachine and gives 
solutions of the three-dimensional potential flow through inclined 
stationary blades and also of the rotational flow through a homogeneous 
stage of identical nontwisted blades for an incompressible fluid and an 
infinite number of blades bounded by cylindirecal walls. Meyer gives a 
detailed treatment of three-dimensional potential Plow in a stationary 
blade row, for an incompressible fluid and cylindrical bounding wall, 
in reference 8, there the solution for an infinite number of Dledes is 
extended to a finite number of blades by the vortexsand-source method of 
Ackeret, which is originally given for two-dimensional flow (reference 9). 
In reference 10, @ Jinearized solution for an incompressible fluid and an 
Lnfinite number of blades for a prescribed loading and cylindrical 
bounding wails is obtained by Marble, and is used later to investigate 
the problem of mutual interference of adjacent biadeé rows and off- 
design operations (reference 11). Siestrunck and Fabri. (reference 12) 
also obtained a linearized solution for incompressible flow, and the 
method is extended to compressible flow. For general wall shapes, 
Spannhake (reference 15) examines the flow through diffuser and impeller 
by the use of bound vortices for blades. The incompressible through 
flow in a mixed-flow impeller is treated by Gravolos (reference 14). 

In reference 15, Wislicenns examines the inflivence on the meridional 
flow of the blade force and nonuniform circulation along the blade span. 


For compressible flow, Reissner (reference 16) gives a biade-design 
Inethod in which the extension from an infinite number of blades to a 
finite number of blades is accomplished by the use of a power series in 
the circumferential direction, and the terms in the series are deter- 
mined by a comparison of the equations for an infinite number of blades 
and a finite number of blades. (In reference 5, Dreyfus gives a method 
of designing water turbines of thin blades, in which the solution for 
an infinite number of blades is extended to a finite number of blades by 
the use of a power series, the second term of which is determined from 
the equations of continuity end irrotational absolute flow and is 
explicitiy given.) In reference 17 the compressible flow problems in 
axial turbpomachines having an infinite number of blades are treated, and 
both the direct and inverse problems are considered. Methods for 
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limiting solutions for zero and infinite blade-row aspect ratios and a 
step-by-step method of solution, as well as a simpler method based on an 
approximate knowledge of the shape of the streamline, for a finite blade- 
row aspect ratio are given. Unaware of the work of Traupel at the time, 
the anthors of reference 17 also emphasized the oscillatory nature of 
radial flow in multistage machines and suggested the use of a simple 
sinusoidal form of the streamline as a first approximate solution. Their 
methods are derived for compressible flow, however, and are also extended 
to the case where both the hub and caésing wails or either is tapered. 
Reference 18 gives a general. through-fiow theory for both direct and 
inverse problems and for subsonic or supersonic flow in turbomeachines 
having arbitrary hub and casing shapes, The supersonic through flow in 
rotating impellers having 2 prescribed flow along the casing and pre- 
scribed blade shapes is treated in reference 19. 


A general theory of three-dimensional flow in subsonic and super- 
sonic turbomachines of axtal-, radial-, and mixed-flow types for a 
finite number of thick blades of finite thickness has been developed at 
the WACA Lewis laboratory and is presented herein. Both the direct and 
inverse problems are considered. The theory is applicable to either 
irrotational or rotational absolute flow at the inlet of a blade row 
and to both design and off-design operations. 


In the section BASIC AEROTHERMODYNAMIC RELATTONS, the motion and 
energy equations for the unsteady fiow of a nonviscous compressible rinid 
in a rotating blade row are expressed in terms of the velocity components 
and of two basic thermodynamic properties of the fluid, namely, entropy 
and a modified total enthalpy for flow in roteting blade rows with change 
In radial distance from the machine axis. Estimated entropy changes due 
to shock waves (in the case of supersonic flow), heat transfer (in the 
case of a cooled turbine), or viscous effect can be easily accomodated 
in the calculation, The equations obtained show clearly the condition 
under which the flow through blade rows can be treated on the basis of 
irrotational absolute flow. 


In the following section, a general potential equation is obtained 
for steady three-dimengional compressible flow through rotating or 
stationary blade rows when the absolute flow can be taken as irrota- 
tional. The methods of solution for both subsonic and supersonic flows 
are briefly discussed. : 


A simpler method of solving the three-dimensional irrotational 
(absolute) Plow, which is also applicable to rotational (absolute) flow, 
is obtained by considering fluid flows on a number of relative stream 
surfaces whose intersection with a zg-plane either upstream of or some- 
where in the blade row form a circular arc or a radial line. Equations 
governing the flow on these surfaces are obtained in the next four sec- 
tions. Through the use of a stream function defined on the stream 
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surface, the equations of continuity and motion for fluid Plow on these 
surfaces are combined into one principal equation, The character of the 
Principal equation is dependent on the relative magnitude of the local 
velocity of sound and & certain combination of velocity components. 


The process involved in soiving the direct and inverse problems by 
thia approach is described in the section STEPS FOR COMPLETE SOLUTIONS 
OF THREE-DIMENSTONAL DIRECT AND INVERSE PROBLEMS. Im the inverse prob- 
lem, besides the blade-thichkness distribution determined by biade 
strength and other considerations, either the tangential velocity, a 
relation between the tangential and axial velocity, or one other rela- 
tion is prescribed on 4 mean siream surface about midway between two 
blades. The last section gives a general method of solution of the 
principal equation when it is elliptic or hyperbolic. 


DIMBOLS 


The following symbols are used in this report: 


a velocity of sound 

B,yb integrating factor for continuity equation for So and S17 
surfaces, respectively 

Tips. differentiation coefficient -used to multiply function ya Lue 

le at point j to five the m°P derivative at point ji based 


on nei degree polynomial 


O,c nonzero term on right-hand side of continuity equation for 
So and 8S, surfaces, respectively 

Cp Cy specific heat of gas at constant pressure and volume, 
respectively 

= differentiation with respect to time following relative 
motion of fhuid particle 

D''g m°4 gerivative of q 

F,f vectors having the unit of force per unit mass of fluid 

Gog given function of W/W, on 8, 

H total enthalpy per unit mass of fluid, 4h + ev" 


bh static enthalpy per unit mass of fluid, u + p/p 
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L modified total enthalpy for flow in rotating blade row with 
change in radial distance from machine axis, 


1 
beg -= fr2 or H - o(Vyr) 


J,K,L,M,N coefficients of Pirst~ and second-order derivatives in the 
principal equation 


k thermal conductivity 

L distance along streamline 

bs) orthogonal coordinates on surface of revolution 

M mass flow between mean stream surface and one surface of 
blade 

iy number of blades 

n unit vector normal to relative stream surface 5&5 

P static pressure 

Q heat added to fluid particle along its path per unit mass 
per unit tine 

qd any quantity on relative stream surface § 

R gas constant 

Tp 1 remainder term of m2 derivative at point i obtained by 

2 using n&4 degree polynamial 

r radius vector 

a relative stream surface passing through fiuid particles 
iying on & circular arc upstream of or midway in blade 
row 

So relative stream surface passing through fluid particles 
lying on 2 radial or curved line upstream of or midway in 
blade row 

8 entropy per unit mass 

a * a/R 


T static temperature 


tesa er Sas a a ge eee. ee ee ee, EL eee ee 
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time 

velocity vector of blade element at radius r 
interval energy per unit mass | 
absolute velocity of fluid 

velocity of fluid relative to blade, v- U 
ju, 2 + We" 

independent variables 


distance along turbomachine axis 


ratio of specific heats 

average value of Y for the temperature range involved 

grid spacing 

equal to 1 and Fr for 5, and S. surfaces, respectively 


independent variable 2 or Yr for 65, surface and 2 for 
Do suriace 


independent variable $ and r for 5, and Se surfaces, 
respectively 


angular distance of fluid particle measured with respect to 
stationary radial line 


slope of characteristic curves, vo 
tan o 
& 
arc sin - 
equal to r and 1 for 6, and S» surfaces, respectively 


absolute vorticity, VX V 


fluid density 


oV 
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generalized variable used for general density table 


a angie between tangent of streamline or boundary well in the 
neridional plane and axial direction 

T radial, axial, or angular thickness of stream sheet 

D velocity potential 

p generalized variable used for general density table 

cp angular distance of fluid particle measured. wiae respect to 
radial line on rotating blade 

x angle between w and axial direction 

V,y stream functions defined on relative stream surfaces > and 6); 
respectively 

iD angular velocity of blade 

Subscripts: 

c casing 

e exit 

hn hub 

L inlet 

é meridional component 

m mean stream surrace 

oO Lower limit of integration 

r,u,Z radial, circumferential, and axial. components 

5 isentropic 

T total state. 

nC components in y- and (€-direction, respectively 

L on 8, or in front of rotor 

a on 85, ov behind rotor 


es ee 
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superscripts: 


a,b, . +. .- KE yrefer to points a,b, ... K, respectively 


BASIC AFROTHRRMODYNAMIC RELATTonS+ 


The three-dimensional flow of a nonviscous, compressible fluid 
through a turhomachine is governed by the following set of basic lave 
of aerothermodynamics. From the principie of conservation of matter, 
the equation of continuity is 


38 +V¥«(ow) = 0 (1) 
or 
VeW+PPag { 12) 


For a blade rotating at a constant anguler velocity @ about the 
Z-axis, Newton's second law of motion gives 


DW 2 if 
Te © + 2a YP (2) 


Because the boundary walls are surfacea of revolution and the relative 
Flow can be approximated as being steady in many cases, it is conventent 
to use a relative cylindrical coordinate system r, , and 2 with 
cp measured with respect to the rotating. biade (see fig. 1). By use of 


DW (W.V) W = 2 = Vw" ~ WX (XW) 


the scalar forms of the equation of motion (2) in the axial, circum- 
ferential, and radial directions can be expressed as 





OH OW, WW, ow Ww, WwW" 
ty ee te age + Wy go - Bee ake ~ aatty = - 2 (20) 
oy WwW, Wy wy OW,  Wp»W : 
set te ap tear tS a + Quit, = - SB (2b) 
wW, aw, Wy WwW, WW, 1 op 
ow 8 oe tr tp tos =" 5 dz (2c) 


lsome of the relations given in this section have been given in refer- 
ence 19. They are repeated here for completeness and easy reference 


for the following developments. 
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The first Lew of thermodynamics may be written 


= 
Du yp Dip) 9 (3) 


Where u is related to the temperature T by 
du = ec, at (4) 


and Q is given by the following equation if only conduction is 
considered: 


Q = pl vy. (KVT) (5) 


For the ranges of temperature and pressure encountered in ordinary 
turbomachines, p, p, and YT of the gas are accurately related by the 
following equation of state: 


p = R pf (6) 


Although the flow of the gas through the turbomachine is completely 
defined by the preceding equations together with the Imow variations of 
and k with temperature and the given boundary and initial condi- 
tions, it is found more convenient in references 17 and 18 to express 
the state of the gas in terms of the entropy and the total enthalpy or 
a4 quantity I of the gas, besides its velocity components. These 
quantities are defined as follows: 


T ds = du+p d(p +) (7) 
Hehtsv (8) 
i al 
Teh+5we -5 0° = 8 - af V7) (9) 
end 
=i 
h=u+pp (10) 


From equations (10), (4), and (6} is obtained 


dh = (cysR) af = op at = (11) 


where y is equal to Cp / Cy and is a function of temperature. Another 
expression for dh is obtained by using equations (10) and (7), so that 


dhe Peo as (11a) 


x — ee ee 
ee ee ee 
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By the use of equations {7), (4), and (6), 


sy, ¥ | 
ae) = sya inp - yd inp (12) 


end 


a5) =; a nT - a4 ing (12a) 


VWeo es 2c EO (15) 


Equation (13) can be expressed in @ slightly different form. From the 
definition of the local velocity of sound (reference 20), 


2.) os 


By the use of equations (12) and (6), 


af ay E = yRD (148) 


Substituting this relation into equations (12a) and (13), with the use of 
equation (11), resuits in 


© dh 5 
dinp= 2 as (12b) 
and 
1 Dh 0D 8s 
ViW+ oe - peR™ ° (13a) 
From equations (9) and (lla), 
=P + Suu ~ofr = VI -TVS 
The equation of motion (2) ean then be written 
Ow 
<= WXHIVXW) + 2wxXWe= -VI+Tve (15) 


An alternative form of equation (15), which involves the vorticity of 
the absolute motion, is obtained as follows: With the z-axis parallel 
tO Wy, 


———— 
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V=Wtruwxe (16) 
hence 
VXV=VXW +0X (wx Fr) (17) 
But | 
VX(wXr) = (rey) w -(wev) r tawl(ver) - r(yew) = Bw 
therefore 


VXV = VKXW + 2w (17a) 


This relation can also be seen from the following expressions of rela- 
tive and absolute vorticity expressed in terms of the rotating and 
stationary cylindrical coordinates r,om, z and r, @, 2, respectively: 


OW OW 
(WxXW) =F 3p 7 OE 
Sie | OW 
(VXW),, = = hb — (18) 
1 AWyr) 1 OW, 
WAW) 2 oe EOD 
OV OV 
a Z, u 
VX VES So Sr 
ov. OV 
(VXV). ea - (19) 


and the relation 


Vr) a(wyr) 
ore * 


Using equation (17a) results in the alternative form of equation (15) 


aw 


< ~WX(VX V) = -VI + IVs (15a) 
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By use of equations (2), (9), and (11a), 


DI Ds lo .1l,. DW DU 
= + (WV) D+ Ws Be - Us be 


p 
Da 1 op 2 Du Ds 1d 
Toe to oe tT Welwor-2exw) -U- Ge = Te +5 St (20) 


ll 


Tt may be emphasized that the preceding equation is a consequence 
of the equation of motion (2) and the thermodynamic relations (4), (6), 
(7), and (10). For steady relative flow, the rate of change of I 
along the streamline is seen to be proportional to the rate of chang 
of entropy along the streamline. ) 


The energy equation (3) can be used to express the rate of change 
of entropy along the streamline by the use of equation (7) as follows: 


Des 
Q= T ss (21) 


The preceding equations lead to several important general consider- 
ations: If the blade rows are not placed too close together and no 
trailing vortices are shed from preceding biade' rows (or where these 
effects can be neglected), the fluid properties at a fixed point rela- 
tive to the blade can be taken as constant with respect to time. Con- 
sequently, according to equations (20) and (21), the quantities s and 
I of the gas remain constant along the streamline for adiabatic flow. 
The invariancy of I means that the rate of change in total enthalpy 
along the streamline is equal to the angular speed of the blade multi- 
plied by the rate of change in angular momentum (about the machine axis) 
of the fluid particle along its streamline, which is the well-known 
uler turbine equation usually derived under less general conditions. 

In @ cooled turbine where the heat transfer may be large, the rate of 
change of s and I along the streamline can be corrected by equa- 
tion (21) Por an estimated value of @Q. Again, for steady relative flow, 
equation (15a) shows that either when gradient I and gradient s both 
vanish or when the difference between VI and TYs vanishes, the 
absolute vorticity either yanishes or is parallel to the relative 
Velocity. 


For the flow through eae stationary blade row m= 0, W becomes V, 
I becomes H, and equation (15a} becomes 


Ov vx (vxv) = -VE+TV5 (15b) 


Which agrees with similar relations previously obtained by Vazsonyi 
(reference 21) and Hicks, Guenther, and Wasserman (reference 22). It 
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is interesting to see that, for relative flow in a rotating blade row, 
VX(V XV} becomes Wx(YXV) and H becomes I. 


Tf it is assumed that the fluid enters the inlet guide vanes of a 
turbomachine with uniform H and 3s and zero vorticity and that the 
Plow is adiabatic, s does not vary in the iniet guide vanes and p 
is then a function of only p, according to equation (12). Gonse- 
quently, by virtue of Kelvin's circulation theorem, the absolute vor- 
tieity will remain zero in passing through the inlet guide venes and the 
Plow in the inlet guide vanes can be treated on the basis of irrota- 
tional absolute flow. 


Tf the guide vanes impart a radial variation of tangential velocity 
of the fluid in @ z-plane downstream of the vanes similar to that in a 
potential vortex, that is, inversely proportional to the radius, the 
circulation is constant along the blade span and the fluid maintains a 
uniform s and H anda zero vorticity of absolute flow entering the 
following rotor-blade row. If the rotor-blade row is situated far away 
from the inlet guide vanes, the fluid enters the rotor with a uniform 
I in the circumferential direction, as well as in the radial direction, 
and the flow through the rotor blades can again be treated on the basis 
of zero absolute vorticity and steady relative flow. If the rotor is 
close to the guide vanes, however, vortices are shed from the inlet 
guide vanes because of periodic variation in circulation caused by 
unsteady flow, and the Plow downstream of the stator and. through the 
rotor blades should theoretically be treated on the basis of rotational 
flow. 


If the guide vanes impart a radial variation of tangential velocity 
of the fluid at a z-plane downstream of the vanes not inversely propor- 
tional, to the radius, the circulation varies along the span of the guide 
vanes, vortices are shed from the trailing edge to the fluids dowmstream 
in the direction of the exit velocity, and the fluid enters the follow- 
ing rotor blades with a uniform s and H but 4 nonuniform I anda 
nonzero value of absolute vorticity. Consequentiy, the flow through the 
rotor-biade row can no longer be treated on the basis of zero absolute 
vorticity, even if it is far apart from the preceding guide Vanes. 


From the preceding discussion, the choice of s and H or I 
as the two basic thermodynamic variables of the gas besides its veloc- 
ity components is apparent. Compressor and turbine rotors are usually 
Gesigned to impart or subtract the same amount cf energy to or from the 
gas radially; hence HE is usually radially constant throughout the 
machine if the inlet flow is uniform (except in the boundary layer along 
hub and casing walle). If the circumferential velocity of the gas 
upstream of the blade row is zero or varies inversely with radius, [I 
is then constant throughout the machine. These facts will be utilized 
in the following developments, 


a ee a ee ee Tn a EY. = Se ee eee 
=e eet) sessaaeta ms 
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POTENTTAL EQUATION POR THREE-~DIMENSTONAL PLOW THROUGH 
ROTATING BLADE ROW 


Consider first the special case of steady relative flow where the 
fluid upstream of the blade row is free of vorticity and is uniform in 
EF and s. The adiabatic flow through the blade row is then relatively 
steady and absolutely irrotetional and is most conveniently treated by 
the use of a velocity potential & based on the zero absolute vorticity 
and. related to the relative velocity components through equation (16) as 
foLtowse: 


of 
= Vy = Wy 
ioe Vy = Wy + or (22) 
ad 
an Ve = W, 


For steady isentropic flow, the continuity equation (13a) becomes 


o(W 3) OW OW W 
1 rt 1 Py Zz oh u oh dh 
be de a (ey Be Bee Bao (25) 


From equations (9) and (22), 


2 a a 
ad 1} fae fa a6 36 
ne reo 30-2} (2 + ES) (2) (24) 
sh a) Wud?d Vue 3°o 
Bf, ESS * dzon = 
1 3h We 3° Wn a°@ Wy 3% 
em. (2 oe ee (25b) 
a Wo. oe Fa 
3h so Mua ato 
mH. (i, mor r wes) (25¢) 


By the use of equations (22) and (25a) to (25c), the continuity equa- 
tion (23) may be written 
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at / dr* a® J r@ dof a’ / 32° 


| 2 
2 Wey 1 34 _9 WuWz 1 3%6 _9 Wale 9°06 race Yu JP _ 0 
a2 vr drop ” 42 4r opez a2 Ozer "© ge / or 


(26) 


Equation (26) is then the three-dimensional potential equation for 
isentropic flow in @ rotating blade row. Jt is seen from this equation 
and equation (16) that the three-dimensional flow through a rotating 
blade row cannot be treated by a three-dimensional flow through a sin- 
Llar stationary blade row with the same inlet condition relative to the 
blade row, as in the case of two-dimensional flow on a cylindrical sur- 
face, because the difference between the absolute and relative vorticity 
ew does not enter into the two-dimensional flow on a cylindrical surface 
but does enter into the three-dimensional case, 


Equation (26) is very similar to the ordinary three-dimensional 
potential equation for flow past stationary objects, except thet both 
relative and absolute velocity components are involved in the coeffi- 
cients of © derivatives and that © is directly defined by the abso~ 
lute velocity. The real. difficulty in solving this equation lies in 
the fact that all the velocity components change greatly in passing’ 
through a turbomachine and, consequentiy, the equation cannot be lin- 
eparized and yet give a good approximate answer. For supersonic relative 
velocity, the method of characterlatic surfaces (references 235 and 24) 
may be used to solve equation (26), with the initial conditions not 
given on @ characteristic surface. For subsonic relative flow, the 
equation is more conveniently written in the form 


a ie vas W 
o6 196, 19396. 36 0 in ud in d Jin p _ 
wi tart yagi yet mae te ap tee tO 


(26a) 


and can be solved by Southwell's relaxation method (reference 25) or 
other numerical methods using the differentiation formulas obteined in 
reference 26 to teke care of the unequal grid spacings near the blade 
Burfaces and the curved hub and casing walls. The lest three terms in 
equation (26) sre computed from the $6 values or velocities obtained 
in the previous cycle and kept as constants during each improvement of 
@ values, and the whole process is repeated until the desired accuracy 
is obtained. Because e three-dimensional stream function cannot be 
defined, the use of velocity potential results in & boundary-value prob- 
fem of the second kind, which is more difficult to handle in the calcu- 
lation than the first Kind. The boundary condition to be satisfied is 
that the relative velocity normal to the moving blade is zero, or 
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W,. 0, + Wn, +W, pn, = 0 (27) 


Where nA is the unit normal vector at the boundary surface, and that, 
at inlet and exit stations far away from the blade, the velocity is 
parallel to the bounding hub and casing walls, which, in the case of 
the axial machine, means that 


V, = = 0 at z= te (27a) 


In both the subsonic and supersonic cases, the solution is 
extremely time-consuming. Furthermore, this direct approach to the 
three-dimensicnal problem requires that the absolute velocity at the 
inlet to the blade row be irrotetional and the fiow be adiabatic. in 
actual machines, the flow entering the blade rows is always rota- 
tional, which is caused by a nonuniform total enthalpy and entropy at 
the inlet of the machine, by entropy change caused by shock waves or 
heat transfer, or by the effect of boundary layers along the hub and 
casing wells. Some other approach to the problem, which is simpler to 
handle and is also applicable to rotational inlet flow, is therefore 
desirable. One approach is suggested in the following sections. 


FOLLOWING FLUID FLOW ON RELATIVE STREAM SURFACES 


In order to solve the steady three-dimensioneal Plow, with either an 
ixrrotational or rotational absolute fiow at the inlet, in e relatively 
simple manner, an approach is taken to obtain the three-dimensional 
solution by an appropriate combination of mathemtically two-dimensional 
Flows on essentially two different kinds of relative strean surface 
(figs. 1 to 3). Whe first kind of relative stream surface is one whose 
intersection with a z-plane either upstream of the blade row or midway 
in the blade row forms a circular are (fig. 1). The second kind of 
relative stream surface is one whose intersection with a z-piane either 
upstream of the blade row or somewhere inside the blade row forms a 
radial line (fig. 2). These two kinds of relative stream surface will 
be hereinafter designated stream surfaces 87] and Se, respectively. 


57 Stream Surfece of First Kind 


In figure 1 is shown a stream surface of the first Kind formed by 
fiuid particles lying on a circular arc ab of radius oa upstream 
of the blade row. It is usually assumed in ordinary two-dimensional 
treatments (for example, references 27 to 30) that the stream surface 
thus formed is 4 surface of revolution. In the following development, 
the surface will be allowed to take whatever shape it should have in 
order to satisfy all the equations governing the tbhree-dimensional flow. 
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In most cases, the deviation of the surface from a surface of revolution 
is not large, and it is satisfactory to consider 65, surfaces formed by 


Fluid particles originaily lying on a cireular arc upstream of the blade 
row. if the rotationality of the inlet abgoiute flow is large, if the 
blade is designed for a velocity diegram quite different from the free- 
vortex type, or if the blade length is long in the direction of the 
through flow (radial- and mixed-flow machines) , the twist of the surface 
may be quite large, resulting in very large circumferential derivatives. 
If this effect is found during calculation or mown from experience, it 
Le more satisfactory to consider 8, surfaces formed by fluid particles 
originally lying, in front of the blade row, on curves inclined to the 
circniar arc in a direction apposite to the twiat of the surface. In 
this way, the intersection of the 8; surface with a constant z-plane 


about midway in the flow path is a nearly circular arc, and the total 
twist of the surface will be about equally distributed toward the 
upstream and downstream directions (fig. 3). If this distribution of 
the twist of the stream surface is still not enough, it may be necessary 
to divide the complete flow path into a few shorter paths and consider 
an 87 surface for each of them. Under these conditions, 8; surfaces 
formed by fluid particles originally lying on the hub or casing walls 
upstream of the blade row should not be chosen in order that the compli- 
cation arising from the possibility of fluid particles leaving the well 
and flowing along the blade surface may be avoided. In such cases it 

is better to consider the 8] surface a short distance from the hub’ and 
casing; otherwise, for an approximate solution the fluid can be con- 
sidered to follow the hub and casing walls, which are surfaces of rev- 
olution, and the calculation is thus much simpler than that for a 
general surface. 


Bo Stream Surface of Second Kind 


A stream surface of the second kind is shown in figure 2. The 
most important surface of this family is the one about midway between 
two blades dividing the mass flow in the channel into two approximately 
equal parts. This surface is designated the mean stream surface 
(82 vm) For blades with radial elements, such as the one shown in fig- 


ure 4, it ts convenient to consider a mean stream surface formed by 
Pluid particles originally lying on a radial line ab upstream of the 
biade row if the twist of the surface is not expected to be large. 
Otherwise, the radial line is chosen sbout midway in the passage with 
the Pluid particles originally starting out from a curved line upstream 
of the blade row such as shown in figure 3. 

The mean stream surfaces for axial-flow ges turbines designed on a 
free-vortex velocity diagrem are shown in figures 3 and 4. The redial 
element of the mean stream surface (fig. 4) is. chosen accordingly as the 
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stator is designed to aline the blade sections radially at the leading 
edge, tralling edge, or somewhere between. Inasmuch as the rotor-blade 
sections are_usually alined radially at or near the center of pravity 

of the blade sections, the radial position of the mean relative stream 
surface is chosen at the same position (figs. 3 and 4). The continua- 
tion of the stream surface outside the blade row is not shown. The mean 
stream surfaces Tor the inlet stage of a multistage axial compressor 
designed on the principle of a symmetrical velocity diagram at all radii 
are shown in figure 5, 


Both of these two kinds of stream surface are employed, in general, 
in the solution of the three-dimernsional problem. The correct solution 
of one surface often requires some data obtainable from the other, and, 
consequently, successive solutions between these two are involved. Yet, 
the solution of each surface is manageable with the present mathematical 
technique and computational facilities. In many practical cases, and 
especially in the inverse problem, however, this iteration may not be 
required if only an approximate solution is required or if the prescribed 
Valnes lead to a satisfactory blade shape. These points will be dis- 
cussed in the section next to the last (pp. 53 to 57). 


Relations among Relative Velocity of Fluid, Coordinates of 
otream Surface, and Normal to Stream Surface 


in general, the coordinates of the stream surfaces and their differ- 
entials are reiated, respectively, by the following equations: 


S(r,t,z) = O (28) 
Sar +S ap + SF az = 0 (29) 


Rather than use the three partial derivatives of S with respect to 
the coordinates, it is conventent to consider the unit vector n norml 
to the surface, which is related to § by — 


Dy Dy Re 1 
ey BS > TD (30) 
or rop dz OS 198 os 
or} T\F Sp) * Sz 
The vector n is, of course, perpendicular to the relative velocity wW, 
so that 


n=eW= 0 


ee Oe 
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or 
nw, + o WW, + uW, = 0 (31) 
By using equation (30), equation (29) can be written 


n, drtn,r d)+n, dz=0 (29a) 


The vectors n and W are show on 5, and 8. surfaces in fig- 
ures i and 2. 


EQUATIONS GOVERNING FLUID FLOW ON By SURFACE FOR AXTAL-FLOW AND 
AXTAL-DISCHARGE MIXED-FLOW TURBOMACEHINES 


If the fluid motion on 8, is followed, equations (28) and (31) 

can be used to eliminate one of the three coordinates. For axial-flow 
= 6{a) to 6(c)}) and axial-discharge mixed-flow turbomachines 

fig. 6(d}), it is convenient to express r interms of and gz. 
For radial-flow and radial-discharge mixed-flow turbomachines (figs. 6(e) 
and 6(f)), this system will encounter difficulty at the exit where the 
rate of change of fluid state with respect to 2 becomes infinite. It 
is therefore necessary to eliminate 2 and to consider r and © as 
the two independent variables. 


Flow Along General 6, Surface 


For axial- and mixed-flow turbomachines, any quantity gq on the 
5; surface is considered a function of and 2; that is, 


q = al @, z,r(%, z)| 


The change in q along Sy due to a small change in m@ while z is 
held constant is (see fig. 1) 


aq = Sap + $8 SF ae 


From equation (29a), 


ie 


or 
op 


Sate rg er re eee ll 
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herice 
dg A 2a) 
dq = (34 a, oe) °? 


Similarly, for dp= 0, 


With a bold partial derivative sign used to denote the rate of 
change of any quantity q on 6) with respect to p or 2, with the 
other kept constant, the preceding relations give 


ap Poker or 

(32) 
aq aq 29 
oz ° OZ ny = 


"With the relations (31) and (32), the rate of change of q along a 
streamline on™ 5, is 


A 

oz tt Soe, He 28 (33) 
r 

Equations of continuity and motion. - When the fluid motion is 


followed along the stream surface and equations (31) and (32) are used, 
the continuity equation for steady relative motion becomes 


5 8(pWy) = pi.) 


— —_ ap — “ez = p (tp, 2) (34) 
where 
1 ( o(W.. r) WW, OW 
e(p~,z) = - — Soe tte (35) 


nh. 


For rotational steady relative motion, the equations of motion (14) 
in the radial, circumferential, and axial directions are 
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Z oI os 


Wy" WwW, Wy Wp (is i) 
oz e/a - StS 


yk 
ce ae ty op tv 


1 0L . T ds 
Lee S) + Oly - Fs +E ap 





WW Wy Wp Wp, 1 WW, Wy 
r Tero - rap ~ 2 


OW OW OW OW 
(36) 


Relations (9}, (16), (31), and (32) along the relative stream sur- 
face S) can be used to reduce equations (36) to the following: 


rat 
Wy Wu oWy oWr 
“ete ap t M2 a> iy = q= -lgy-° 





Wi, Wy» OW OW, OW 
oo ee ee ee ee, _ lal ,foas "ufdh _ os 2 
rr ap 2 (2 sae ast oe -- 23 -2 (2-2 wr? * 


ow aw OW n 
r 1 oe a az as z {ob as 
“Wr gg t Wa Fog - gp) - er B- 2 (So B. ) 
(37) 
The last term in each of the preceding three equations is propor- 
tional to the components of the normal vector and therefore can be 


expressed as a component of a vector that is parallel to n ané@ has 
the dimension of force per unit mass. If this term is defined as 


: 1 fan _ 3s 2) afl 2 
fe-3 Sr 7 Pye nae = E(B ate n (38) 
the preceding equations can be written 


r af 4 3zZ - 





Wri, Wp, ai, 1, WW, OW, 1 T 
rr op M2 \> op ~ oz) * r= - Fat 5 apt fu (390) 


ae 198, Su) al |, a8 
UAT aM az / 
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Similarly, the equations of motion in the form of equations (2) can be 
written 





a 
Mu Or oy, OE ne 
ap Z 9z r r 
Wa Wu Wu | We¥u i ap : 
> 30 Z 3a + eoMlr = - oF ag t tu (40) 
Wao We Lae yp: 
r ap 232 paz Z 
where 
fi=- LLP, 
iy 2: ee 


Because this vector f is parallel to on, it is perpendicular to 
the relative velocity of fluid, or 


fy + fy, + f Hae 8 (41) 


By the use of equation (41) and equations (39), it can be show that for 
steady flow on an S, surface, 


Di Ds 

at T ( 418} 

. which: agrees with equation (20). Therefore, for the present problem of 
steady relative flow on a stream surface, the relation (41a) can be 
taken either as one of the equations of motion or to represent the rela- 
tion given by equation (41). In other words, there are only four inde- 
pendent relations among equations (39a), (39b), (39c), (41), and (41a). 


dust as in the case of the continuity equation, either set of the 
preceding equations of motion is expressed in terms .of. the special par- 
tial derivatives with respect to the two independent variables 9 and 
z» The effect of radial pressure gradient is taken into account in all 
these equations by the f term, which is neglected in the ordinary two- 
dimensional treatment on a surface of revolution. Equations (28), (31), 
(34), and (39) or (40), however, lead to a possibility of correctly 
solving the three-dimensionai flow of fluid particles on an 5, surface 


in & mathematically two-dimensional manner. 


Principal equation. - The equations of continuity and the equation 
of motion in the circumferential direction can be combined into a prin- 
cipal equation through the use of a stream function y as follows: 
First, if a variable b is introduced such that 


4V 
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i. n, W 
n. © 
or 
' L 
D ny. 2) bt oa 
Inocs= - Ce ———)] dx = - = 1G —} dx 42a 
by ‘ ( n. W n, © ( ) 
4 Ly 


in which the integration is performed along a streamline on the &1 
surface, then the continulty equation (34) can be written 
a(bpW.) = a(bpW_r) 
ul Z, 
ee ef eee 6) eyiac 
oo ee (34a) 


The preceding equation is the necessary and sufficient condition that 
there exist a function y with 


oy 
So = TbeW,, (43a) 
oy 
a = ~bew,, {43b) 


The difference in y_ at two points j and Ek: on the 5y surface is 


1 
&K k 
yee ye | ays | dplwyr ap - Wy, az) 
j j 


In particular, the difference in Y at two points j and k on the 
constant-z Plane at the inlet where the fluid state is uniform is 


(pd 


These two equations show that, physicelly, the integrating factor b 
can be interpreted as proportional to the local radial thickness of 4 
thin stream sheet whose mean surface is the stream surface considered 
here, The continuity equation (34a) can also be obtained by consider- 
ing the masa flow going into an element of such a stream sheet as shown 
in figure 7. By equating to zero the mass Plow going into the element, 
which is defined by two axial planes dp apart and two normal planes 
dz apart (see fig. 7{a)), and letting dp and dz approach zero, 
there is obtained. 
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a(roW,)  a(teW_r) (54) 
—sp 1 32 * 
where 7 is the radial thickness of the stream sheet. From equa- 
tions (34a) and (34b), it is apparent that b is proportional to T, 
and the differences in wy at two points j and k as given by the 
two equations preceding equation (34b) are proportional to the mass flow 
across any line joining the two points. In actual computation, only the 
ratio b to b; or T to Ts is important (a different initial value 
amounts to a different constant muitiplier of the relation between 1 
and mass flow). In the following, b will be retained in the equation, 
but in actual calculation it is simpler to eveluate the ratio T to 

T,; ‘than to evaluate the ratio b to b;, both from the data obtained 


on the So surface to be discussed later. Although the evaluation of 


this ratio requires, in general, calculations on the S5 surfaces, a 
means is nevertheless provided to determine correctly the flow on a 
general 6, surface through iteration. 


¥rom equation (43), 


bp 2 Mz _ 1 ay 1 Bind ay, Mu 2 ay (448) 
rap 125 ye = 8 Pp DD, r@ op 
bp t= AY _ 3 Im bp ay (440) 
P 32 age az Oz 


The third terms in the preceding equations can be expressed in terms 
of h ‘through the use of equation (12b): 


dinbp =dinb +5 dh - ds * (45) 


a. 


where 5* = s/R. But from equations (9) and (43), 





Then from equations (45) and (48), 


C.,2—17 & 
[a - (v2 +'4,7)] 2 2an be 2 8 |S a” (2 Ind ast) _ 
Wy" 2 tr\ ap 3G 
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Substituting the preceding two equations into equations (44a) and (44b) 
and adding yield 


Fa 
ow, 2 ws, WW Ca 
epla® - (n? + We \1(3 se - ) <5 ANS? 


x” ak az° 


2 Bane a7 2 a2 2 2 
(a2 2) 2 22 (or) -8. (2 ind ast) & Ny’ Pujl ab _ 
4° g¢¢ [2 ap a r ap ap ro: n,|/F ag 





Substituting the preceding equation into equation (39b) and dividing by 
ae give the principal equation for the determination of Fluid motion 
along & general 5; surface; 





2 
Wo) 2a a®y WW 9%y We \a*y | Nav ay 
te) ee 8 a oe t \ - eet Se tM SEH 0 
8, r @a ar a / 92 
48 
where (48) 
2.2 2 
” @ind,~a* 1 { at, Wr Or th, ty 
ee: OZ 2 \ az Y az r ny. 
¥ a Re 
2b Bind, leas*,1( Lat Wr die af-(Wu"Hiz*) We 
Pec rep *r tp tae\ 3 3e'tr we r w, 
2.2 R: 
a (1,4 SLY an, Wa Bi ~ (i, Hi, *) | -2 2 Eee wi, (2 20 
a 2 rap r ut r an 


The equation of the characteristics of the differential equation (48) is 
(reference 31) 
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a ya a 
( -=>- : x3 ) +2 2 rat oe oe 0 (49) 





from which 


G 2 ra a 
r OM, an (Wut + Wet - a) (50) 
az aon,” a“ -W," 


Equation (S50) shows that the characteristics are real when 
NWy? +W,° >a, in which case the method of characteristics for two inde- 


pendent variables (references 20, 30, 31, and 32) can be applied. When 


AW? + W2° < &, the characteristics are imaginary, and it is more con- 
venient to solve the equation by relaxation (references 25, 33, 26, and 
29) and matrix methods (references 26 and 29) in the following forn, 
which is obtained by substituting equations (44) into equation (39b): 


tay ay [2 2 in bo | Mu) ag) ain be oF . 
r= ape Q7% re r= Ive) n,./ OD OZ oz 
Z W.. oaW.. W 
le) 1393 Tr 
a Retr tee 4 6) (= + 20) » 0 (48a) 
r an 


Procedure of solution. ~ It may be noted that equation (39b), 
instead of (39c), is chosen to form the principal equation (48) or (48a), 
because f,, 18, in general, much smeller than f.. The various quan- 
tities appearing in equation (48) or (48a) are to be computed from 
other equations given earlier, With the introduction of the stream 
function, there are altogether seven basic independent relations - one 
energy equation (21); three equations of motion, (48) or (48a), (39a), 
and (39¢); two equations between wy derivatives and fluid properties, 
(43a) and (43b}; and the orthogonal relation between W and f, equa- 
tion (41) or (418). On the other hand, there are ten basic dependent 
variables in y, b, W., Wy Wo, ff, fy fz, 8, and I (or h or op) 
to define the flow and the shape of the surface. In general, the vari- 
able b is to be evaluated according to equation (42a) or from the var- 
iation in the radial thickness of stream sheet using the data obtained 
in the solution of 85 surfaces and is therefore considered as given 
here, J? during the comlete solution of the three-dimensional flow the 
shape of an §) surface is taken as the one obtained by joining corre- 
sponding streamlines obtained on 85 surfaces of the preceding cycle, 
two relations between the n- (or f-} components are given by equa- 
tions (29), and there are now altogether nine equations to be solved to 


NACA TN 2604 at 


find the nine unknowns, Alternatively, the variation of W, may be 
considered as known from the 5S, solutions of the previous cycle, and 


the remaining elght variables, which determine the flow on and the shape 
of the §, surfaces, can be determined from the seven preceding rela- 


tions given and the following additional relation: Because f,, f,, 
oS 18 


and f,, respectively, are proportional to e Foe and 30 of the 
integral surface §, they satisfy the following equation (reference 34): 

fFeVXfH O (51) 
Which may be written 


t |S ag ar] t tals - ae) + tel$—e—- Feo 


(51a) 


By using equations (31) and (32), equation (51a) becomes simply 


£ tc 
J 7 > 29, os (51b) 
ap \f.. az \f, 


This equation can be used to give e by integrating along a constant 
(p line: 


A ta £ 

_ = (=) + xe &) ax (Sic) 
x r/ m2, Zo ? Vy 
eat 2= 2, f,= 0, then 
Z 
f r 
r 3 Z 

_,3s=-— — |_— |} dx bld 
Li ry acp =) ( ) 


In this case, then, the shape of the 5S, surface is determined after 
the f-components (or n-components) are obtained in the solution, In 
either case, equations (21) and (41a) are invariably to be used first to 
Getermine the change of s and ft. If the flow is isentropic, s 

and I remain constant along its streamlines on the surface. (For 
such a case and for a uniform ace condition, p in the continuity 


equation may be replaced by n-+ and, consequently, the ® and z 
derivatives of 38, as Well as those of I, will not be involved in 
the equations (46) to (48)}. In case of heat transfer or shock, the 
changes in s and I can be estimated by whatever method is avyeilable 


= — eee ee 
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and used in the calculation. For supersonic flow, all the equetions are 
psed to compute the fluid state at each point and the solution is carried 
downstream step by step. For subsonic flow, iteration over the whole 
domain is necessary. The details of these computations will be given in 
the last section. In general, the solution of the flow on the general. 
5) surface is very laborious, and is to be used in the final stages of 
calculation of the complete three-dimensional problem or when a high- 
speed computing machine is available. 


If the flow is such that it may be assumed to take place on 4 sur- 
face of revolution (at the hub and casing welis or other radii), the 
equations are considerably simplified as follows: 

Fiow along Surface of Revolution 


When the 5, surface is a surface of revolution, 


n,=fy= 0 (52) 


fe 
Ss -p-sg-= tang=A (53) 
cy 


where A is a given function of 2. (For a conical flow surface, A 
is simply a constant.) Equation (35) now gives 


W OW OW 
Rt 7s <3 tA x= (54) 


Whether c can be taken as zero will be determined by the relative 
Meenitude of the three terms on the right side of the eguation. In 
general, for nonneglipible c, equations (43) now become 


If 


C 


= = rbpW, (55a) 


oY 
ae = - baw, (55b) 
Because Wy, is now related to Wy, by equation (53), the three velocity 


components .can be solved simultaneously as follows: By use of the rela- 
tions (52) and (53), equation (39b) can be changed to 


OW aw Ww 
LT ON, Ww u 1 /10~ fT 328 
rop oz (2+ ») -& (ERE) 0 (56) 


it 


— re a es os - - a . 
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Instead of equation (46), 


2.9 7 a Pa 
h=eiI+ — Z (bp) . | (a2) (; 3) + (2u\" (57) 
should be written, Then 
bo Lor af {3i1nb dat = 2, Wz o%y cee 
(a aay ay Se ee oS 3 eo) ~ (bp) jas) & av 
(a2.y2) 94m be OE Py 4 a2 (2 in b 2") + (Wy2HH1,2) Awe? = _ 


az “OZ 


(58) 


Combining the preceding equations with equations (55) and substituting 
the resulting equation into equation (56) give the following principal 
equation for the flow on a surface of revolution: 


a . aly 2 
W a Whos 3¢ Wy. +W 
2 u } 1 3% 2, Wiz ay vt We | aty NW Oy ay 
ane) ) 3 By. (lah) aa BA ) r op “gz 7° 


(59) 


Owe yr ft 2 ee 
ok ; a=W? WOW obo r 
_ 3ind , @s L (<3 ro" n+ weer B) 





o,f itdinb 1lads* 11 ie Qn -1 {130 73s 
w= - (ant) | FOP 2 Ss +25 woes (F SS" F ) 


For this equation, the characteristics ere real or imaginary when the 
resultant relative velocity W is supersonic or subsonic, respectively. 
Yor the subsonic case, it is again better to use the following form 
Obtained by differentiating equations (55) and substituting the result- 
ing equations into equation (56): 


. ——— = ee — - | 
ee eee ee ee ee se ee eee Sl 
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Pa o 
2, lovey ray 1 dinbo oy S@ilnbp dy 
(144°) 5 SE PHASE | (a4) Op Op Bz Bz - ¢nbbp ~ 
(bp)*{1 dn @ ds\ _ 
r ofp 


With A given and b determined from data chtained on the Se 
surface, there are now the six independent relations equations (21), 
(59) or (59a), (55a), (55b), (535), and (418) for the determination of 
the six main variables in ¥, W,, W,, Wp, 8, and I. The f-componenis 
are not involved in the calculation. If the flow is adiabatic with 
uniform I and s, the equations are further simplified. 


Flow along Cylindrical Surface 


If the flow near the walls of an axtel-flow turbomachine can be 
considered to take place on a cylindrical surface, then 


m= u,=f,2f, = W,= 0 (60) 


Fquation (35) now gives 


ca > (61) 


which is relatively small. (If ¢ is negligible, b can be taken as 
1 everywhere.) For flow without change in radial distance, the quantity 
We 


Tt canbe replaced by 4H, ( = b+ =): The equations governing the 


cylindrical flow are then {compare reference 29) 


< = rbpw, (62a) 
oy 
oz = = DoW) (62b) 
3 
Top = @ (63) 
D 
Ay _t De (64) 
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and 
a 
, War) a oy 4 Meaty (We ley we, Lg 
af / r@ dor aX &POz a® /dz x oP TF sb 
where 





2 
ye -tSind Los 1 a Hy a hy a’ yw? P35 
roe rop ° 22 Wee rap Wee r op 
or 
1 dy oy {1 Sin de BF, Bin ve By\ (dp)27 /2 Hy Bs) | 
a a i ae rop rode? 
x On 


In general, the circumferential derivatives of H, and 8 are to 
be determined by the inlet flow and equations (63) and (64). For adia- 
batic flow with uniform E, and s upstream of the blade row, these 
derivatives are equal to zero everywhere, making the problem much sin- 
pler. The main difference between this simplified case and the ordinary 
two-dimensional fiow on a eylindrical surface is the inclusion of the 
factor b in equations (62) and (65) (in general, b is a function of 
® and 2). If the velocity diagram is such that there is considerable 
radial gradient in the radial velocity or considerable variation of the 
distance between the adjacent streamlines, the factor b is not 
negligible. 


EQUATIONS GOVERNING FLUID FLOW ON 5S; SURFACE FOR RADIAL-~FLOW 
AND RADTAL-DISCHARGE MIZED-FLOW TURBOMACHTNES 


Flow along General 61, Surface 


For turbomachines with radial discharge, r and @ are considered 
as the two indépendent variables; that is, 


GqG@=eq Lz, ? > z(x ,¢p) | 
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Then 
#-3- = 3 
Lea. idq _ Su og (66) 
ra r oD n, oz 
Dq aq, Vu ag 


Equations of continuity and motion. - By the use of these relations, 
the equations of continuity and motion become 


1 S(otyr) 2, atoWy) 


ea aie ete ae ! 
stg or r <a 4 = 
where 
; OW. OW, OW, 
eta - = (a, ae +n get Sz (68) 
and. 
.G 
: Wy 7 Wy Nu oy. lik: go, = - Shp O83 eo (69a) 
Wyly nm OO: eae ee 
r rar r ap x oP = ne ee : 
(69b)} 
ay W,, oW 
Zz 2 Us J 
DE = Wi ar + r ac = tS. (69¢) 
with 


pe -2(B 28 28) 4 = --2iZo (70) 


oo Principal equation. - If a veriable b' is introduced such that 


a cn 
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Or 


(71a) 


oye 
tt 
e 
It 
| 


in which the integration is performed along a streamline on the surface, 
then the continuity equation (67) can be written 


a(bipWwr) afb'pwW.) 
i ee 


= 50 (72) 
and a stream function: fy can be defined on the surface with 
o 
SE = -b' ply (73a) 
2 cb ' pi, (73) 


Here b! can be interpreted as the thickness (in the z-direction) of 
the stream sheet whose mean surface is the Sj surface considered. The 
continuity equation (72) can again be obtained by equating the mass flow 
into and out of an element of the stream sheet as defined by two axial 
planes ad angie apart and two cylindrical surfaces dr distance apart 
as shown in figure 7(b). As before, the difference of yw at any two 
points on the 87 surface is equal to the mass flow across eny Line 
connecting these two points. By the use of the preceding two equations 
and the relation 


oye W & _? Fas ia 
ma rE eB ERY] cn 


the principal equation for the flow of this surface is obtained from 
equation (39b): 


vas a 
Wr" \ ay Weil 1 oey Wa | 1 aty ay oN 
a -S)\ s- = set \l - 5 tue ie 








eB, aré a= F ara? a® / r@ age r 
(75) 
where 
sinbt as* 1 ; 51 aw, a —(W."49,,") | 
w= - er tar toe or 82a te 
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a ee 
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This equation is seen to be hyperbolic or elliptic when Ain Fy Ls 
greater or less than the speed of sound, respectively. For the elliptic 
case, it is preferable to use the following form: 


ra t 1 
ee ee ee a ae 
ar az pe am re 


The integrability condition (51) is now written 


a fur 
3 r o 
. be a) on 


hence 


r 
e £ £ 
— 7 “) 41 2 (=) ax (76a) 
a va em =. & 


The procedure of solving the principal equation with the various terms 
in it determined by other flow equations is the same as that in the 
previous systen. 


Flow along a Surface of Revolution 


For the special case of flow on @ surface of revolution, equa- 
tions (92) and (53) hold (with A considered as a function of r) and 
the expression of c' reduces to 


OW OW 
re a z 
“kK Oz 7 Oz. (77) 
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Furthermore, equations (73) become 


SY = -b'pll, (76a) 
oe = rb "pW. ) (78b) 


and equation (69b) becomes 





gives the corresponding principal equation as 


2 2 no 
L-—— —z 74 1 4 = —— +t = Yan dks HS 
a, ar Ao) aor r or © 
(81) 
where 
eee 2 omer 

Mee -oimbt, ask a (oar, a er Wr" an 

— oy ar 2\ or r “> dr 


we - (24LYL Sim bt 135%, 2 or) Fw? fam, 1 /SE_,, ds 
nar’: ro SG 3M “12 Sg)" Ww. . 2. \Sp 77 Oe 
: r 


2 2 
1 1 3 1n b? 
y+ H+ 3/54 [24 ee ee 2) 2° bip oy | _ 


ar = 4) 2? BF * 32 dep ow 
—  (b'o)* fL dr Tas 
r 


26 NACA TY 2604 


Flow on Radial Plane 


For the special case of flow on & radial plane, 





n,=m,=f,=f, = Wz, = 9 (82) 
and equations (77) to (81) reduce to 
oe 
c's = = { 83) 
Ov _b 'pWy (848) 
oF rb pw, (84) 
he t+t oer? 1 wy , (av (85) 
= dal =- r op + or 
2 a a 
a2 / dre ate FOP at | p22 or or OM 
(86) 


where 
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Alternative Form of Equations for Flow along 
surface of Revolution 


The equations given in the preceding secticns are obtained for 
turbomachines to avoid an infinite value of the partial derivative with 
respect to 2. Difficulty still exists in using either of the systems 
in the case of a mixed-fPlow type machine with an axial inflow and a 
radial outflow. For solutions of general H 7 surfaces, this difficulty 
can be avoided by dividing the machine at the middie of the flow path 
and using the first system at the inlet portion and the second system 
at the exit portion. If the $8, surface can be approximated by a sur- 
face of revolution, it is convenient to use a set of orthogonal coordi- 
nates 2 and , where 2% is the arc length of the generating line of 
the surface of revolution in the meridional plane and @ is the usual 
cylindrical angle (fig. 8). Because 


Wy = ar —- sin o (87) 
and 
We dz 
then, for use with the first systen, 
aq L og 
Zz = Bec oO 2 (89) 
and, for use with the second systen, 
a4 Og 
Sr CHC Oo x (90) 


By use of the preceding relations, the equation of motion in the cireum- 
ferential direction as given by either equation (56) or (79) for the 
two systems, respectively, becomes in both cases 


OW OW W 
1 ONG ul u 1 /LoLl Tas 


which agrees with the results obtained in references 23 and 30 in a 
different manner. The subsequent equations given in these two refer- 
ences can be modified and used for such surfaces. (The Last term on the 
left side of equation (79a) represents the rotationality of the absolute 
flow and is not included in reference 29, which is derived for itrrota- 
tional absolute flavw.} 


" 
ed age . ae — me i i ke pr | 8 
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By comparing the integrating factor b used herein and the thick- 
ness of the stream filament of revolution + used in these two refer- 
ences, it is seen thet the two play exactly the same role in the con- 
tinuity relation. Although b is obtained mathematically as an inte- 
erating factor, physically it may then be visualized as the thickness 
of the stream filament in the r or z-direction for the two systems, 
respectively. The use of bD herein is, of course, more general in that 
1% varies two-dimensionaily over the surface in the general case, where, 
as in references 29 and 30, 7 is considered a function of 7 only. 


EQUATIONS GOVERNING FLUID PLOW ON So SURFACES 


In the preceding section, it was shown that the determination of 
the flow on 5; surfaces requires a knowledge of the radial variation 
of the velocity components. This knowledge can be obtained by following 
the Pluid motion along relative stream surfaces of the second kind, So. 
On So, the relations (28) to (31) also hold. These relations, how- 
ever, will now be used to eliminate the independent variable ; that 
is, any quantity q on Ss is now considered as 


q=q4 | x, 2 (Tr, z) | 


Accordingly, on 8». 


aq _ oa _ St 1 og 
ro ny,r op 
(91) 
aq og _ Bz 1 oq 
Qz OZ yr OM 
and along a streamline on 5a 
Ug wy. 22 94 
bi = Yr op + Wz GS, | (92) 


Equations of Continuity and Motion 


Equations (30) and (91) are used to change the continuity equa- 
tion (35) to 


u a( pW r) : a( pw) <= tial (93) 


er az 


6V 
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where 
OW OW OW 
C(r,z) = - eS (oo so + ny so + Dg so (94) 


For general rotational motion, the equations of motion (14a) in the 
three perpendicular directions are 


Wy | O(¥,r) Wy fe ail) ST aa 
“ep Le 7 op | the Ss ey zr aE 


We | O(Vur) Hy 2 Wz My 102, Tas (95) 
rl er op] "2@\r op " / * rapt ap 


WW, Wy, 1, Wy — OL Os 
~ tp (See ~ ee) + wy (2 Se. ~~ O27 OF 


In following the motion on S 9, equations (95) are reduced to the fol- 
iowing form by using equations (9), (16), (31), and (91)- 











W,, a(Vx) aw ow 
epee + (2 - x) - ae ae. (96a) 
a or az or ar ar 
W,. 3(V,,7) . W, a(v,r) ON ae D(V,,r) (960) 
Yr or ro gz. uU uw Dt 
J kd » ef Yur) = ~8f 7 BaF (96c) 
— NEN 82 ar r EY oz OZ Z 
where F is a vector having the unit of force per unit masa of gas 
defined by: 
1 {dh Os 1 lop 
Fe. — ~ 7 Ne - 8 n 37 
mr \op = nx p op (97) 


A similar result is obtained for the equation of motion in the forn of 
(2): 

a 
OW. OW, Vy 


__i® 
tN. 2 ~ &e = "partF 





Wy AV r) W, a(V,r) 
eer te Ter Fa oe) 
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Because the vector F is normal to the 5 surface, 
FLW, + FLWy + FW, = 0 (99) 


By the use of equations (99) and (96), it can be show for steady flow 
on an Se surface that 


DI = = (99a) 


This result is the same as that obtained for the 5, surface. Again, 
for the present problem of steady relative flow on the 6S» aurface, the 
relation (99a) can be taken either as one of the equations of motion or 
to represent the relation (99). In other words, there are only four 
independent relations among equations (96a), (96b), (96c), (99), and 
(99a). In the following development, it is found convenient to use 
equations of motion in the form of equation (96), not only because 
ol/dr is zero in many design problems (whereas op/dr 4 0), but also 
because equation (96) leads to a form capable of a rigorous solution 
for both subsonic and supersonic flow and shows clearly how the various 
design factors affect the three-dimensional motion in general. (See 
equations (106) to (114) that follow.) 


In & manner analogous to the 8] surface, the continuity equa- 
tion (93) is put into the form 
a(xBpW,.)  3a(xBpWw,) 
a = CO 


or oz (100) 


by the use of an integrating factor B, which is related to C by the 
following equation: 


D in B ainB ain B 
pe ey HG +, SSE = (101) 
or 
+t cf 
B C 
ln = = C dx = sx (1038) 


Hgquation (100) is the necessary and sufficient condition that a 
strean function fy exist and 


ay 
ar = TBeW, (102a) 
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212 
N [<= 


= - rBoW,, (102d ) 


The difference in VW at two points j k on the 5S. surface is 


_ W . | aay a rBo(W, ar - Wr dz) 


Similar to’ the flow on the 8, surface, the preceding equation 
indicates that B is proportional to the anguiar thickness of a thin 
stream sheet whose mean surface is the stream surface Sp considered 
herein and whose variable circumferential thickness is equat to YB. 
Indeed, if the mass flow into and out of the element of such a stream 
sheet (cut between two planes normal to the z-axis, and a distance dz 
apart and between two cylindrical surfaces dr apart (fig. 7{c))} is 
equated to zero and the distances dr and dz approach zero as a linit, 
the following equation is obtained: 


a(T pW.) r a(Tpw,) 


— so ge 0 { 1008) 


Comparing this equation with equation {100} and considering the mass 
flow relations show T to be proportional to rB. This proportionality 
IKNeans that B can be physically intepreted as a quantity which is pro- 
portional to the angular thickness of 4 stream sheet whose mean surface 
is the 5, surface considered herein. With this interpretation, B is 
immediately seen to be closely related to the angular distance between 
two neighboring biades. In actual calculation, only the ratio rB to 
(xB), or tT to Tz is important, and it is also easier to obtain the 
yariation in rB from the distance betveen adjacent stresmlines 
obtained on S surfaces than to evaluate B/B,; by equations (101s) 
and (94) using data cbtained on S$] surfaces. 


Principal Equation for Case with Vr Given 


In the soiution of flow on an So surface, the continuity equations 
and the equation of motion in the radial direction are cambined to form 
the principal equation. The principal equation will now be obtained for 
two main groups of present designs in which a certain desirable varia- 
tion of the angular momentum of the fluid Vir and of the ratio of 
relative tangential and axial velocity are prescribed on the 52 .m sur + 
face, respectively. These equations can also be used for the solution 
of a direct problem, in which the same information obtained on 657 
solutions of a previous cycle is used as known values in the 65 
solution, 


— i me ee ee ee ee re ee 0 ee a ee eee 
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For the first group, the following equation is considered known: 
Vur = Glr,z) (103) 
Among this group of designs are the free-vortex design (in which G is 
simply a function of 2), the more general "solid-body rotation" design, 
the "symmetrical velocity diagram at all radii" design, and others (for 
example, see references 17 and 18). 


From equations (102) and (45}, 


ae acy 1 1 3h 39s* 38 inB\ 6V 
wo 229 ¥, ft 4 as (3048) 
ar ar” ( r 2” ar — or ar 
WwW, 869% 1 @h Os* 93 in BY ay 
~ 8p ge 2 (SHtn ya eee) 


But from equations (9) and (101), 


2 2 | 
2r2 OW 
_ oer uw L -2 a ay 
he l+“3--—3-§ (xBo) (au) + (2%) (105) 


Differentiating with respect to r and az gives 
a@.(W A417) ofr?_4, 2 « 
em are ot) (woe) (E+ 222 28% 


Be or or 
(rBo)-2 (3, = a — We av 


groz 


2_ (4 2ay_2 otre_y 2 
ao — | 4 _O+ Wi, 
oP tiPeia) an 2 (zr u )s (W247) (222 _ 2). 


a 32 oO 2 az az 


(rBp}"> (4, ayy 2¥) 


ardz r az" 














; Substituting the preceding equations into equations (104} and adding 


give 
aw. aw a2 
2. (Hy 21,2) | aa ees (oP ,2) SY acy 
E (Wy WH, ) xBp (= =) - W 2 WW, Soop t 
2p QrP 
(ain?) 28 | 28 2 fe - a wwe) ec (f ee Ses 
pews Ae r or P gr ar ar 


a ae 
a Wi 2{3 in B os*\; ov 
[2 (cme) Poe = = (106) 
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| Substituting equation (106) into equation (96a) and dividing by 
2% yield the following principal equation for the fluid flow on sur- 


face 59: 


Wee) «2 WW, a2 W7\ a2 
f ae rz o'y (a oF Fe Meo 








7 as are . ni OCS 7 ao Pao ar a 
(107) 
where . 

mae. 2 dmB, as* 1 fal oWu 
~ Sz az 2\dz2 44932 

t a@inB, as* I fal OWy 2) 

Ver; 7 “pr get (RE ow, Se + ote 7 
0 (Wy) hy 3( 

a Wy aI np Ly , he Yur) 

a2 ar * or "8 * Yr 


From the coefficients of the second derivatives, the principal equation 
is seen to be hyperbolic or elliptic when the meridional velocity 


WO Afit,? + W,* is greater or less than the speed of sound, respec- 


tively. For the elliptic case, it is again convenient to write the 
principal equation in a slightly different form. From equation (101), 


wp OZ. OY _ LOY _ 9 Im Bo BV 

or = r or ox or 
(108) 
«Bg az 


Substituting into equation (96a) results in 


c an 
3 
y lav av (ain Bo av, 3 in Bp av), 
ane Y ar are ar ar az a2 


2 k a(v,r) | 
rB LW u OL as 
(Bo) Be BE 8 = 0 (107a) 


or 
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With the variation of V,, or W, prescribed by the designer in an 
inverse problem or taken from the previous 5, calculation in a direct 
probiem, the meridional velocity components are determined by equa- 
tions (107) and (107a). (Other equations are used to determine various 
terms Involved in the coefficients M and N.) 


Principal Equation for Case with W/W, Given 


In the second group of designs, the following relation is pre- 
scribed on an 82,» surface (for example, see references 17 and 18): 


W 
—— = g(x,Z) (1.09) 
We 


In order to result in blades with the mean blade surface composed of all 
radial elements (for high-speed rotation), it may be Gesirable to spec- 
ify amean 55 surface consisting of ail radiel elements. Then 


Wa 
= r g3(2) (130) 
Wy 


Similarly, in order to obtain a cooled turbine rotor blade with minimum 
twist, the following function may be specified on So ym? 


Wy 
a, = Bo (2) (11.1) 


In application to direct problems, one of the preceding relations is 
obtained from the 5, soiution in the previous cycle and is considered 
as given in the 55 solution. Im both inverse and direct problems, 
with the relation between W,, and W, given by these equations, all 


three velocity components are to be combined into the main terms of the 
principal equation as follows: Substituting relation (109) into equa- 
tion (96a) gives 


a, | OW, 


a 
cite") ae- - ge 


&, 36 tf, 3t as = 
+6 (+28) iz + ang + pt (- Bae Baw ‘@ 
(112) 


Instead of equation (105), 
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should now be written. Differentiating with respect to r and a, 
combining with equation (104), and substituting into equation (112) 
give the following form of the principal equation: 


Wye \ 92 WrWs 32 WCW, 2) ar 
(1422) (255) Ory _2(14+g2) 2 or¥ (oe Orv en ei Seo 
di 








a’ / aré a2 orez ae 472 ar azn 
(114) 
where 
yw - . 2nB , ast -4(2 W24 22) 
oz Oz g2 \9Z ga az 
_ 2e,/ 1, 8inB_ as* 1 (2 2 2, 38 
N (1+e¢°) ie = 3 art wer - Wy Ba || + 


This equation is hyperbolic when the relative velocity is supersonic, 
elliptic when the relative velocity is subsonic. For the subsonic cage, 
a form of this equation more convenient for computation is obtained by 
substituting equation (108) into (112): 


A a 
2, au /1 _ag\ ay afy r 2, 3 1n Bp av 21m Bo ou 
(1+8*) are (2 & ee \ Usk ~ | (l+8*) or ar” Dz az_|* 
(+Bp)* al as 
egorBp + “Sy -3r tis, t Frj=0 (1148) 
ar 


It may be noted that for both groups, equation (96a) -rather 
than (96c) is chosen to cbtain the principal equation of the present 
problem, because F, is always much smalier than F, in axial 
machines and Fy is zero or nearly zero on Se surfaces for high-speed 
centrifugal and mixed-flow impellers whose mean blade surfaces are 
usually composed of all radial elements. (For low-speed centrifugal 


impellers, equation (96c} can be used to form the principal equation 
in a similar manner.) 


a ig a mm a ee er ee ee ee ey  — - e e 
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Procedure of Solution 


Although the equation of motion (96a) is chosen to form the prin- 
cipal equation, other equations are to be used to obtain the various 
terms involved in the principal equation. As in the case of feneral 
S) surfaces, there are ten basic variahles to define the flow and the 
shape of the Sp surface. They are: Y, B, W,, Wy, Wo» Fh, Fy Feo 8; 
and I {or p}. B is considered given. (In the direct problem, B 
is evaluated directly from the distances between adjacent streamlines 
or according to equation (100a) using the value of C obtained on By 
surfaces; in the inverse or design problem, B is estimated (refer- 
ences 29 and 35) from the blade thickness as desired from blade stress 
and other considerations.) On the other hand, there are seven inde- 
pendent relations in one energy equation (21); three equations of 
motion, one of equations (107), (107a), (114), or (14a), and equations 
{96b} and (96c); the orthogonality relation between W and F, equa- 
tion (99a); and the two equations relating WY and velocity, (102a) 
and (102b). 


Direct problem. - In the direct probiem, two alternative procedures 
may be used. If the shape of the Se surface (determined from the 
data obtained on 8; surfaces) is considered as given in the present 
Se solution, two additional relations between the fn- or F-components 
completely define the problem. The procedure of calculation is as 
follows: 


(1) Use equations (20) and (21) to determine the variation of 
s and I. 


(2) Compute W, from the orthogoality relation as follows: 


ir Og, 
Ww, = ~_ a We + 5H) 


(3) Compute F, from equation (96b). 

(4) Solve the principal equation. 

(5) Obtain W, and W, from equations (102). 

If only the tangential velocity or the relation (109) is taken 
fron the 8S, solutions of the previous cycle and is considered as 
given in the present 6 solution, one more reiation is available 


between the F-components such as that which exists between the 
f-components on the 6, surface: 


F-VxXxF=0 (115) 


=-— = 
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Writing equation (115} in scalar form and using the relations (31) 
and (91) give 


¥ F 
O22). os 
ar (3) ~ 82 (5) (15a) 


By integrating along a constant z-line, equation (1158) provides the 
Polliowing relation to determine the value of F, to be used in the 
principal equation from the values of Fy and Fy: 





FP F = F 
ee (Fz) ’ * Ee ad (2115p) 
Fyr uX/o : r Br 
QO 
If F, = 0, at 2 
F 
| vA 
F... = Br or (=) dz (115¢) 
oO 


The procedure of calculation is as foltows: 


(1) Use equations (21} and (99a) to determine the variation of s5 
and £, 


(2) Compute F, and F, from equations (96b) and (96c}. 

(3) Compute F, from equation (115b) or (115c). 

(4) Solve W from the principal equation. 

(5) Compute Wy, and W, from equations (102a) and (10zb). 

Inverse problem. - In the inverse or design problem of a finite 
number of thick blades, in addition to the blade-thickmness distribution 


or its equivalent B, either equation (103) or (109) ig prescribed on 
&8 Mean stream surface So ym: It may appear that still another rela- 


tion can be prescribed on the mean surface, The differentials of the 
coordinates of the surface are now governed by 


F, dr + Fir dp + F, dz = 0 (116) 


A mit -_ ——— ee 
z ————————— ae ——— —. re —=— 
a nnn a ee ee el - _ =-—o 
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however, and, in order that this differential equation will lead to an 
integral surface of the form represented by equation (28), F mst 
satisfy the condition of integrability as given by equation (115) 
{reference 34). An expression similar to equation (115a) for the case 
of an infinite number of bledes was Tirst pointed out by Baversfeld 
(reference 2} in a discussion of the Lorenz paper (reference 1}. In 
effect, it restricts the freedom that the designer has in prescribing 
the velocity components of the fluid on the surface. Hence, in the 
- inverse problem of a finite number of thick blades, in addition to the 
blade thickness distribution or its equivalent B, the designer can 
specify only one relation on the mean stream surface, which relation may 
be either the tengential velocity as given by equation (103), the flow 
angle between the tangential and axial velocity as given by equa- 
tion (109), the axial velocity, or any other reasonable relation that 
will lead to a solution of the set of equations. 


In the preceding consideration, the hub and casing shapes are also 
prescribed by the Gesigner in the inverse problem. Alternatively, the 
prescription of the hub shape can be replaced by @ prescription of 
another relation at the casing, thereby fixing the shape of and the flow 
along 52 ,m at the casing entirely. The flow is then extended to the 


hub and the last streamline gives the hub contour (reference 19). 


Approximations Involved in Through-Fiow Theory 


When the equations previously derived in reference 18 for a large 
mumber of thin bisdes are compared with the corresponding equations 
derived herein aiong ea stream surface, the two are obviously exactly 
the same if the ordinary derivatives used in reference 18 are replaced 
by the present partial derivatives following the stream surface, and 
if B is equal to 1 or if the variation of B along the flow path 
is zero. In the interpretation of the through-flow solutions as the 
flow along a mean stream surfece (which divides mass flow into two equal 
parts circumferentially) or as the Plow along the’ mean channel surface 
{seometrical mean}, the first difference can easily be removed by simply 
interpreting the values obtained in the solution as those along the sur- 
face rather than in the meridional plane. The second. condition, however, 
is satisfied only when the circumferential variation of all the velocity 
ccmponents approaches zero, or When the ciromferential derivative of 
the tangential velocity and the ratios of mn, and ny, to Ny approach 


zero (see equation (94)}. 


Besides the use as a limiting solution in general and to give cer- 
tain trends where the contribution due to the Pinite number of biades 
is small or constant, the through-flow calculation should be properly 
modified by the factor B in its application to actual turbomachines 
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of a finite number of thick blades. As B can be physically inter- 
preted as the ratio of the local angular thickness of the stream sheet 
to its inlet value, a good approximate value can be obtained by solving 
the two-dimensional flows on a number of stream surfaces of revolution 
starting at different radii at the Inlet. For.the subsonic flow in the 
turbine cascade reported in reference 29 and for the supersonic flow in 
two impulse bladings investigated in reference 30, the reductions in 
anguiar thickness from the inlet value along the mean streamline are 
seen to be a chordwise average of 4 and 9 percent more than the reduc- 
tion in the channel width, respectively. Aliso, in the subsonic cage, 
the inflvence is extended a certain distance outside the blade row. The 
inclusion of this factor B, even if it is epproximate, should give a 
much closer answer than that obtained with B taken as 1. 


In this interpretation of the infinite number of blades solution 
as the solution of through flow along a particular stream surface 
petween two adjacent blades, the distributed “body force" F has a 
definite meaning, as given by equation (97). (For an infinite number 
of blades, F becomes the blade force.) For blades with large turning 
and large redial twist, as in a free-vortex turbine, the influence of 
the radial component of F on the flow is not negligible. 


CIRCUMFERENTIAL VARIATION OF FLUID PROPERTIES BY USE OF POWER SERIES 


In general, the blade-to-blade variations of flyid properties are 
to be obtained from calculations on 67] surfaces. When the twist of 
the 8S; surface is iarge, some other method of obtaining the biade-to- 
biade information is desirable, For subsonic irrotational absolute 
flow, this information can be obtained by extending the solution 
obtained on the mean stream surface in the circumferential Grection by 
the use of power series (without the consideration of the shape of the 
8 flow surfaces}. The various derivatives involved in the series are 
obtained from the flow condition on the mean stream surface. The higher 
the solidity and the thinner the blade sections, the fewer are the terms 
required for a given accuracy. Results obtained in references 29 and 36 
indicate that only three terms in the series wiil be required to give 
surficlent accuracy for high-solidity turbines and centrifugal com- 
pressors. 


The series method will also be used in one of the two methods of 
the inverse solution in which the flow obtained on the mean stream sure 
face is extended out circumrerentially. 


Denoting the absolute vorticity VxXV by E and. using the rela- 
tions (16), (91), and (97) in equations (19) give 


— ——— ee 
fetes at Sa a a a erat ee re pe ee a a eee : 


pues — 
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p i OW, = L a 
r=rlap - dz =roap “r 82 “Fr 3p (1172) 


buSe oe a ae tr dp "Re dp yy) 


1 | oun) ay,| 7a(¥r) F, oh, 4 oH, 
f2=rloer opt rer trap rap (M7) 


Fro + Polo t Bebe Fy atv r) wir a(V,z) Furi 


This equation means that the apparent vorticity, which 1s obtained by 
differentiating the velocity on the mean streem surface with respect to 
the coordinates, is not zero even if the absolute vorticity is zero or 
tangent to the mean stream surface. Substituting equations (1178) 

and {117c) into equation (94) results in 


13%, «FF Fy our) Fe Mar), ree it ¢ (119) 
rop pf [yr or Fyr 3 a Sas 


Substituting equation (119) into equations (117a) and (1l17c) gives 





1 OW, 3 ACY) it FF, | Fy O(Vyr) | F, o(¥,r) ae Fy, 7 Fr 
rop +r 92 Tv pe | Fy ar Fpr 32 Fy ~ By 7% 
(120) 
1 OW, 1 O(Vyr) FF, | Fr, o(¥yr) F, a¥yr) : _ _ Fy 
Op *r or ee lee ar te ee Ot eg 
(121) 


The first derivatives of h or p, and p can be obtained as follows: 
From equation (97), 


(122) 
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Or from equation (9), 


10h lor {*, OW, Wy Ww, 4, — “ 
2 2H (2 oD rop * r MD, (122) 
With 5 known, * can be obtained by using equation ({12b): 
13inp 1 dh 10ds* 
a ao - ee” Ee (128) 


The second derivatives of the fluid properties with respect to 
can be obtained in a similar manner. Differentiating the continuity 
equation (1) with respect to © and dividing by r give 


1 9%(pWr) =, -9*(pw,) 3@( pW.) 


z 
2s tPF + = 3 = 0 (124) 


Equations (91) are used to change equation (124) to 


; 
2 Sep" 2,2) ar op in a" r|az ap Fur 3 of 


(1248) 


Differentiating equations (117a) and (117c} with respect to @m and 
dividing by r result in 











r= oor ry op re ozop rap re 32 ren oe ee 
(125) 
13%, 1 a, 2 o°(Vyr) 2b ag Our) og Fy Oty 
r® ayy" 7 Pop re or op ” ” r op r23r oo r2 Fu jy? 


—— 
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Substituting equations (125) and (126) into (1248) ‘and noting that F 
is perpendicular to -W give 


1 OW, Fe Fer >» O(V7) ,*2 5 AV, ) 1 5 ool, TO 
Fr 


re Sop" = 7-72 2er ap + Br be Op <2 or cp 
1 9 %W,) eeding , Fz Sr Fy Oy 
cn” a> oe Se we ore (127) 


Equation (103) is to be used in equations (101) and (102) to obtain the 
second derivatives of Wr, and W,. The second derivatives of h and 


p are obtained from equations (9) and (123) as 


2 2 2 
1 Om 1 OT 1 Of (128) 





5, $82 ee -S 


{129} 


Dpimilar Pormules can be obtained for higher-order derivatives. At 
a fixed value of r and 2, the velocity components, h, and 9 ata 
hort angular distance away from the mean stream surface 5S. can then 
be obtained by a Taylor series: 


~-~P,)4 p-p.)5 
a(p) = g(p,) + (-p,) at(@,) + Sa)" a"@,,) + on g"@p) +... 


(130) 


fn alternative way to obtain deneity is to use equation (145) (to be 
given subsequently} after the other fluid properties are determined. 
Obviously, the preceding equations are most useful when the flow is 
isentropic with vorticity equal to zero. Otherwise, the variation of 
vorticity along the mean stream surface has to be determined first. 

At present no such method is available. It appears, however, that the 
method of Squire and Winter (reference 37) and Hawthorne (reference 38) 
may be generalized to compressible flow for the variation of vorticity 
along a mean stream surface in turbomachines. 
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DEEPB FOR COMPLETE SOLUTIONS OF THREK-DIMENSIONAL 
DIRECT AND LNVERSE PROBLEMS 


In general, the solution of the three-dimensional direct and inverse 
problem involves the use of both 6], and Se surfaces. In the direct 


problem, starting with assumed flow surface, the solution is obtained 
through the successive (alternate) use of the two kinds of flow sur- 

Face, although a satisfactory approximate solution may be obtained in 

one or two complete cycles. The use of an approximate method of solu- 
tion to get a good starting value on each surface will shorten the length 
of camputetion. For inverse are, the process is usually shorter. 

The calculation will start on the m Surface on which elther 4 con- 
dition on the fluid velocity or the ae itself is prescribed and an 
estimated value of B Yor a desirable blade thickness distribution is 
used. After the solution on the S2,m surface and its shape are obtained, 
the blade coordinates are obtained by extending the solution circumferen- 
tially either by the series method or by the method given in reference 35 
ubing the variation of the distances between the streamline obtained in the 
S2,m Surface. Because it is Important only to obtain the right order of 
magnitude and the right kind of variation (three-dimensionally) of the 
blade thickness, the first solution may give satisfactory resulis. The 
velocity distribution on the blade surface is controlled directly by the 
one relation specified on the 52. surface and the variation of 8. 


Suitable procedure is subsequently suggested for the solutions of 
direct and inverse problems with either Lrrotational or rotational inlet 
absolute motion, at design or off-design flow condtiions, for turbo- 
machines haying various wall configurations (fig. 6). 


Direct Problem 


Axial. turbomachines with nontapered straight walls. - In this type 
of machine, it is desirable to start the computation on 6] surfaces, 
because with short axial blade length, the total deviation of the 6] 
surface from the cylindrical surface is relatively amall, especially 
along the bub and casing walls. 


The following steps are therefore suggested: 


(a) In the initial calculation, the flow surfaces are assumed to be 
eylindric&él and the set of equations (60) to (65) derived for cylindrical 
flow or the approximate method given in reference 39 can also be used to 
obtain the streamlines and circumferential variation of finid state on 
OD] surraces at three or more radii. - 


ay 
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(b>) From the data cbtained in step (a), an Se stream surface about 
midway between two blades is constructed by comnecting the streamlines 
Which divide mass flow on the 5S] surfaces in the same percentages. ‘The 
direction numbers of the surface and the Wy, and W, at the surface 
ootained in step (a) give the starting value of W;, by use of equa- 
tion (31). The factor B is evaluated either directly from the engular 
distances between streamlines obtained in (a) or according to equa- 
tion (100a) with C evaluated from the information cbtained in (a). Its 
value at other radii is obtained by interpolation or by proportioning 
according to the channel-width ratio. Calculation of the flow on this 
surface is then made by the use of equations (91) to (115). For subsonic 
flow with irrotational inlet flow, the solution obtained on the 52, 
surface is easily extended circumferentially by series expansion using 
equations (117) to (130). The values obtained can be further adjusted 
to fit the given blade (reference 39) and can be used in a more accurate 
second calcuiation on 5] surfaces in the next step. For subsonic flow 
with large rotationality at the inlet and supersonic flow with signifi- 
cant check caused by the blade entrance angle, it is more desirable to 
obtain the information on circumferential variations by the use of two 
or more 52 surfaces at or near the two blade surfaces. 


(c) The radial variation of fluid state computed from the solution 
obtained in step (b) or the variation of the radial distance between 
streamlines is used to determine the factor b and used in the principal 
equation (48) for a more accurate determination of $7) surfaces and the 
flows thereon. The general equations (32) to (51) should now be used 
for the 8, surfaces located between hub and casing, if not at or near 
these walis. 


(a) The calculation of Ss surfaces can again be repeated and so 
forth. 


if the inlet flow is quite rotational, so that the S81 surfaces 
along the walls and the 6 surface near the blades may turn around the 
corners, these surfaces should be chosen at a short distance from these 
boundary walls as shown in figure 3. By the use of these two kinds of 
surface, the secondary flow caused by a rotational inlet profile or by 
the turning of the blades is inciuded in the complete solution. 


Axial turbomachines with tapered or curved walls. - The steps 
involved here are quite similar to those of the preceding case, except 
that for the initial calculation of §, surfaces along or near the 
tapered or curved wall, either equations (52) to (59) are to be used, or 
equations (13!) to (235') given in reference (35) can be used in the 
manner given in reference (39). 


SV 
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Radial- and mixed-flow type turbomachines with curved walls. - In 
this type of machine it is not desirable to start the computation on the 
S| surfaces because the flow surfaces near the wails may deviate con- 
siderably from surfaces of revolution because of the long flow path. On 
the other hend, the solidity of the blade is very high and the blade 
section is uniformly thin. As a result, the shapes of the Soa surfaces 
are closely related to the blade shape and the factor B can be esti- 
mated relatively accurately from the blade thickness distribution. The 
following steps are therefore suggested: 


(a) The computation is begun on the S2,m surface. For subsonic 
irrotational inlet flow, computation need be made only on a mean £&8e 
surface and the solution can be extended out circumferentialily by equa- 
tions {117) to (130). The approximate method given in reference 40 can 
also be used in the initial calculation. For subsonic flow with rota- 
tionai inlet profile and for supersonic flow it may again be more 
desirable to compute two or more Se surfaces between the blades. 


(>) The data obtained in step (a) may be used to make calculations 
for three or more §)] surfaces between hub and casing walls. 


(c} The processes (a) and (b) can be repeated until the desired 
accuracy is reached. 


Jnverse Problem 


Conditions prescribed on mean stream surface. - Im the inverse or 
design problem it is most convenient to consider a mean satream surface 
of the Se kind about midway between two neighboring bladea to be 
designed (figs. 3 to 5). From the results developed previously for 
such surfaces, it is seen that in addition to the factor 8B, the 
designer can specify only one relation among the fluid properties on 
that surface, which can be either a velocity component, a relation 
between two velocity components, or one other reasonable condition. 

The factor B essentially controls the blade thickness distribution, 
whereas the relation specified on the surface essentially controis the 
mean camber surface of the blade. From a consideration of strength and 
Mach number in general, and the requirement of coolant passage in the 
case of cooled turbine blades, the designer always has a very good idea 
of what kind of blede-thickness distribution he wants. With this thick- 
ness distribution, the ratio of pitch minus circumferential thickness 
of blade to pitch can be obtained. After correcting this ratio with 
some known relations between this ratio and B (such as those given in 
references 29, 30, and 35), especially near the leading and trailing 
edges, it can be taken as the factor B in equation (101). Then from 
the type of velocity diagram or a certain feature of blade shape 


1 
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desired, a relation along the mean stream surface §o,, can be pre- 
seribed and coordinates of the mean surface and the fiow on that surface 
can be solved at the same time by equations (101) to (215). It may be 
noted that in this process, the designer still has, in general, a little 
freedom in choosing the value of 2, in equation (115c}). For a rota- 
ting blade, zg is8 usually taken somewhere near the center of gravity 
of the biade section, whereas for the stationary blade, the position of 
Zqg can be utilized to control the magnitude and distribution of Fy, 

in the most favorable manner. 


Boundary conditions for mean stream surface. - In the solution of 
this S2,m surface, the boundary conditions are a little different for 
subsonic and supersonic flow. For subsonic flow, not only the varia- 
tions of the stream function at stations far upstream and downstream 
are given, the meridional contours of the hub and casing walls are also 
given (these contours can be determined by approximate calculetions 
from biade row to blade row such as given in references 17 and 41). For 
supersonic flow, the variation of the stream function and its normal 
derivative is prescribed on an jnitial curve, which is not e character- 
istic curve. Then either the hub and casing contours are prescribed, 
or only the casing contour but with one more velocity component along 
the casing is prescribed. In the second case, the flow is extended 
toward the axis of the machine and the hub contour is determined by the 
shape of the last streamline for the required mass fiow. 


Determination of blade shape. - For subsonic irrotational flow, 
the solution obtained on the mean stream surface can be extended ont 
circumferentially by using equations (117) to (130). ‘The blade sur- 
face can be then determined as follows: 


(a) The position of the mean stream surface is first determined 
by solving the circumferential coordinate as a function of the axial 
coordinate at several radii. With the circumferential coordinate 
measured from the radial element of the surface chosen at Zp, equa- 


tion (116) gives, at a constant r: 


* UF 
rm, - (re) 2=Z. -| (= i dx (131) 
oO 


(b} The blade coordinates (r,9) will first be chosen at one 
station 2, a8 follows {see fig. 9): The mass flow passing through 
the Z, plane between the mean stream surface and the tentative suction 
or pressure surface is computed as follows: 
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Ps Ye 
Me = 7 pO Wert dr dtp (132) 


Pp f le : 
Mn = W dr da? V5 
P ., th p Wz ( } 


Because of the inaccuracy in 8 for the blade-thickness effect, the 
mass flow obtained will be a little different from that required. The 
blade coordinates and @p as functions of 2, and r are modi- 
fied until the mass flow checks. It is not important that Mg and Mp 
are & little different from one-half the required mass flow as long as 
their sum is equal to the total mass flow, but once the division ts 
chosen, it should be maintained at other z-stations. 


(c)} The blade coordinates obtained at z= Zy are extended 


upstream and downstream according to the velocity components evaluated 
at the surface. For example, for a short distance 2-2, away, the 


changes in the blade surface coordinates r and P are 


rar (Ft) (2-20) (154) 
op = %, + (23) (z = Zo) (135) 


After r and @ are thus obtained, the total mass flow may be checked 
again by equations (132) and (133). 


When the blade coordinates are obtained close to the leading and 
trailing edges, they can be faired in according to some standard shapes. 
A blade shape is therefore obtained in which the three-dimensional flow 
of the fluid is considered. The right kind of three-dimensional blade- 
thickness distribution is obtained and a good knowledge about the Lrlow 
on the blade surface is also available at the same time. The’ data 
obtained in the solution can also be used directly for © more accurate 
and detailed determination of the velocity variation around the nose of 
the blade, for a relatively quick check of the series approximation, or 
for improvement, if necessary, of the inverse solution throughout by 
the method given earlier for solving the direct problem. This process 
seems to be the quickest way of establishing some standard three- 
dimensional flow variations for typical designs of blades from which a 
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good approximate method for routine design calculations can be estab- 
lished and of providing a basis on which the viscous flow along the 
blade surfaces and hub and casing walls can be analyzed. The results 
given in references 29 and 35 indicate that for blades of high 
solidity, three terms in the series give sufficiently accurate 
results. 


For subsonic flow with vorticity, the circumferential extension 
cannot be accurately made at the present because of lack of adequate 
methods for the determination of vorticity variation along the mean 
stream surface (S24). An estimate of this variation can be made, 
however, and the solution can be cheeked later. An alternative method 
is to use the shapes of the streamlines and the distances between them 
obtained in the "2m solution and to design the blades with the 
assumption that the flow surfaces are surfaces of revolution by the 
method given in reference 35. Inasmuch as the rotationality of inlet 
Plow is usually serious only in later stages of a multistage compressor 
where the hub-tip radius ratio is high, this essumption is reasonably 
good. 


For supersonic flow, the flow in the mean stream surface 82,m is 
aiso determined first. If the shock due to the entrance wedge angle 
is small, an approximate solution of the blade shape can also be 
obtained by the series method neglecting the finite jump acrose the 
shock or using an estimated value. The improvement of the flow varia- 
tion for the resultant Diade is then more important than that in the 
subsonic case. Local modification of the biade shape can also be made 
if the velocity distribution on the blade obtained is unsatisfactory. 
A better method is to use the shape of the streamlines and the dis- 
tances between them obtained in the S2,m solution and to design the 
blades assuming flow surfaces of revolution according to the method 
given in reference 350. 


The processes described here for the three-dimensional solution 
have been and are now being used to analyze the compressible flow 
through @ number of compressors and turbines. Some of the results 
obtained are given in reference 42. 


GENERAL METHODS OF SOLVING PRINCIPAL EQUATION 


In the solution of the 8) surface for the direct problem and of 
the 8S surface for both the direct and inverse problems, the main 
calculation is the solution of the principal equation, which is a 
second-order, nonlinear partial differential equation in two independent 
variables. The case when the principal equation is elliptic will be 
considered first. 
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Liliptic Case 


A common form of the principal equation is written as follows: 


lay.a 3 3 ay | do Ind 0 inb 
ot. rian han 3 -| 2p te Ot 4 (, 2 a be 2 BY) + ube] + 


Mo(bp)* = 0 | (136) 


In equation (136), y and b are used for both SS, and Se surfaces; 
HR denotes $ Yor the 6] surface and r for the Se surface; 
C denotes 2 or vr for the S) surface, and 2 for the Se sur- 


face. The values of 9, €, J; and K for each individual case are given 
in the following table: 


Case Surface d Ls Equation 








1 {Si (general) (48a) 

2 |8, (surface of ee (59a) 
revolution) 

3 |8. (cylindrical (65a) 
surface } 

4 |8, (general) (75a) 

S |S, (surface of = |—_—_>-— (81a) 
revolution) 

6 (86a) 

7 (i07a) 

8 (1148) 


ls i eel 
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The equation is nonlinear even in the case of incompressible fiow. In 
the numerical computation, it is convenient to rewrite the equation in 
the following form: 


cee a Sax Shere 8 (137) 
até at 
where 
2 
_x Sin bp of , a in bp ay _ Ma(bp) 
N= Ban ant at PP oy (138) 
an 


and is evalueted from any approximate solution at the start of the cal- 
culation and from the values af ww and p obtained in the previous 
eyele during the calculation. For simple boundary shapes for an 5e 
surface and simple functions of J, K, and L, it is possible to find 
a Green's function G(n,-€, %, yy) with its proper characteristics so 
that the solution of the problem can be written in the following form 
(for example, see reference 10): 


¥ (n,$) = ff G(n, 6, x, y) N (x, y) ax ay (139) 


If the boundary wall is arbitrarily curved, it is necessary in this 
method to use the technique of conformal transformation to render the 
given boundary into a simpler one, such as cylindrical. Because this 
process will involve a numerical solution of the Laplace equation with 
the given boundary shape, it may be better to solve the given equs- 
tion (137) directly with the given shape by the numerical method. - 
Furthermore, this method will be the only choice in the feneral case 
where J and K are complicated functions, which makes the task of 
obtaining the proper Green's function very difficult if not impossible. 


FPinite-difference form of principal equation. - In order to solve 
the given equation (137) directly, the general numerical differentiation 
formuia for first and second derivatives with the function value given at 
unequally spaced grid points using second- and higher-degree polynonmtal 
representation as given by reference 26 is used to give the finite- 
difference expressions conveniently and accurately at the grid point 
near the curved boundary. If the value of any quentity q on the 
stream surface under investigation corresponding to a number of values 
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of one of the independent variables x not equally spaced, denoted by 


xO, xt, . .., xt, x 4 oy is given, the mth derivative of q (on 
the surface) with respect to x when x = x+ may be written 


pal 
(Da) a = > 3, q) + mB (140) 
j=0 : 


The differentiation coefficients B and the coefficients of the deriva- 
tives in the first or second remainder term have been explicitly 
expressed in reference 26 in terms of the spacings between the successive 
grid points by using polynomials of the second, third, and fourth degree 
for general nonuniform spacing throughout and for the special case near 
a tapered or curved boundary where only the first or last spacing is 
different from the otbers. For the special case, these coefficients 
have also been computed for different ratios of the distance between the 
boundary and the nearest point and the other spacing, from 0.1 to 1.29 
in intervals of 0.01, and are given in reference 46. For spacing ratios 
lying between these tabulated intervals, B can be obtained from the 
values tabulated by applying interpolation formulas given in refer- 

ence 43, or by the direct use of the formulas. Differentiation: cceffi- 
cients B for equal intervals using various degrees of polynomials are 
given earlier by Bickley in reference 44. 


In the present fluid-fiow problems, a large region must be covered. 
in order to get to the boundary conditions which are always given at 
stations far upstream and downstream of the blade row. In order to 
reduce the labor of computation, it is desirable to attempt to reduce 
the number of grid points required for a given accuracy by using a 
degree of polynomial higher than the customary second. A study of the 
expression of the remainder terms (see reference 26) and actnal experi- 
ence in the present problem show that, in most cases, the use of the 
fourth-order polynomial will reduce the necessary number of grid points 
to less than one-fourth that required by the second-order polynomial. 
Near the leading and trailing edges of the flow on surfaces of the §] 
kind, the variation of ww is such that accnracy is obtained most 
effectively by using small spacing there. In such case, the grid 
pattern should be chosen at these regions first, and either be kept 
constant or be continuously increased toward the inlet and exit stations. 


With the grid pattern and the order of polynomial representation 
selected, the coefficients B at each point can be obtained from refer- 
ences 44 and 26 for equally and unequally spaced points. Then the dif- 
ferential equation (157) at any grid point whose w value is fF is 
replaced by the following algebraic equation: 


ed ears . - me rm mg ce er er a ee nO OS 
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fal ; jal . : 
2 (o* Zat + wh dad) yd + D (204 + lat) ys -wi=o0 (142) 
J= 


where wt and 4 denote the values of vy on the surface considered 
corresponding to the grid points along constant ¢ and constant 7 
lines, respectively. (See Pigs. 10 and 11.) It should be noted that, 
in accordance with the definition of the speciai partial derivatives, 
tw values are those on the surface 5; whereas the grid spacings 
involved ere just the distances along the q- and {-coordinetes. 


Boundary conditions. - In flow on surfaces of the first kind, the 
flow picture is as shown in figure 9. Arbitrarily assigning a value 
Yr; on the suction surface, the value Wy 7 on the pressure surface of 


the next blade is determined from the mass Plow passing between them. 
These two values are used as the end values in equation (1412) for grid 
points next to the boundary. Outside the blade region, however, the 
position of the dividing streamiine is not known. Instead, there is 
the condition that the flow repeats itself or the -value increases 
by ¥rr-¥r when @ increases by an amount equal to the pitch angie 


(2x divided by number of blades). It is then convenient to draw any 
two parallel lines up to the leading and trailing edges of the biade and 
consider only the grid points lying between the two reference lines. 

For the -derivative at a point y°, for example, the requirea yb 
value is obtained from wi, which is a pitch angle away from yb 

(fig. 10), as 


we=yt - (yp, - ¥y) (142) 


This relation is used between the inlet station 1-1 and the leading edge 
of the blade and between the exit station 2-2 and the trailing edge of 
the blade when the 5) surfaces are assumed to be surfaces of revolu- 
tion. For the general 5S; surface where its deviation from the surface 


of revolution is considered, modification has to be made in places such 
as shown in the exit portion of figure 9. Because of the twist of the 
flow surface, the dividing line from station 1-1 to the leading edge of 
the blade becomes two separate lines from the trailing edge of the blade 
to the exit station 4-2, accompanied by trailing vortices. Although the 
flow still repeats itself cireumferentially every pitch angie, the use 
of equation (142) for the derivative at a point close to these lines 
Will give inaccurate results. in these cases, it is better to use the 
end-point differentiation formulas to evaluate the derivatives. 


IV 
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At the station 1-1 sufficiently far upstream of the blades, the 
Plow condition can be taken as uniform and the flow angle, equal to the 
given inlet angie. For the point Hh, the yy value at point i 
upstream can be obtained Prom the given flow angle as foilows: 


GS, tan a4 


y Pete ee 


H 


( - gh tan a) v8 + (38 tan ay) # (143) 


Thus, the required  velue upstream of station 1-1 can be replaced by 
the values on that station, and only the wW values downstream of 
station 1-1 will be involved in the finite-difference expression (141). 


An alternative method to take account of the inlet condition is 
as follows: If the first station 1-1 is chosen sufficiently far from 
the blades, the variation of the stream function upstream of the 
station 1-1 is linear in the circumferential direction. The value of 
the stream function, however, depends on the inlet angle. If solutions 
for a range of inlet angle are desired, they can be obtained by speci- 
fying a number of sets of linearly varying stream Punctions upstream of 
station 1-1 as fixed boundary vaiues. The slope of the streamlines 
obtained in the solution at the iniet then gives the value of the inlet 
angle. If, however, the solution for a certain specific iniet angle 
is desired, the streamline obtained in the solution must be adjusted 
according to that inlet angle, for example, as jk in figure iO is 
adjusted to position gk, thereby cbtaining an improved set of boundary 
Values of the stream functions to be used in the next calewlation. 
This method is, of course, not Se accurate and convenient as the previ- 
ous method for obtaining a solution for a given inlet angle, but is 
desirable in the matrix solution because the inlet angle is then not 
involved in the matrix factorization, thereby making the same matrix 
factors usable for a range of inlet angie and Mach number. 


At the exit station far downstream of the blade, the same methods 
can be applied. For a blade having a sharp trailing edge, the Kutta- 
Joukowski condition can be used and the correct exit angle far down- 
stream is the one that gives the flow at the trailing edge satisfying 
that condition. For round trailing edges, either the position of the 
stagnation point 18 assumed or some available empirical rule for the 
exit angle is used. If the calculation is made to compare with certain 
experimental results, the measured exit angle may be used. 
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In flow along surfaces of the second kind, the boundary walls 
extend all the way to the inlet and exit stations with the + values 
given on the walls (fig. Li). Across the inlet and exit. stations, the 
flow is considered to be uniform and parallel to the walls so that the 
required w value outside the station can be obtained by an equation 
similar to equation (143). For the inlet station where the axial 
velocity is redially uniform and there is no radial or tangential 
velocity, f varies as the square of the radius. For the exit station 
with a certein radial gradient in filwid state, the radial variation of 
fy can be determined from the corresponding radial variation in axial 
velocity and density. 


Solution of finite-difference equations. - With the grid system 
and the degree of polynamial representation chosen and the boundary 


conditions taken into account, the problem remaining is the solution 
of the set of linear algebraic equations (141) written for all interior 
grid points. For a small mmber of solutions with a given blade, the 
best method is the relaxation method (references 25, 353, 45, and 36). 
A modification of this method involving the use of higher-order difYver- 
ences is suggested by Fox (reference 46). Formulas and tables of 
coefficients obtained in reference 26 enable the direct use of higher- 
degree polynomials for problems with curved boundaries (reference 29}. 
For the present flow problems, It is necessary to include a large 
damain to get to the boundary conditions that are given at places far 
from the biades, and the use of higher-dezgree polynomials whenever it 
is applicable greatly reduces the numerical, work. 


I? a number of cases are to be solved for a given geometry (same 
plades for 5S) surface and same hub and casing shapes for Ss. sur- 
face), it is advantageous to solve the problem on a large-scale digital 
computing machine. If a high-speed digital machine is available, the 
| gimualtanecus equations may be solved by Liebmann's iterative process, 

which is the most simple to set up. For quicker results or when only a 
relatively slow-speed machine is available, the matrix process discussed 
in reference 26 is most suitable. Ina calculation of the 62,m sur- 
face for a gas turbine and in a calculation of the §S1 surface of 
revolution for a centrifugal compressor, the coefficient matrices 
(about 400 and 200 interior grid points for the two problems, respec- 
tively, and the fourth-degree differentiation forma are used} were 
factorized into the lower and upper triangular matrices on an IBM CPEC 
and an JBM 604, respectively, in about 60 hours. The determination of 
¢ for a given set of values of WN tock only 2 hours on the CPEC for 
the gas-turbine probiem. The gas-turbine problem was also worked out 
on an Univeac; the factorization took ouly lt minutes and the determina- 
tion of YY, 2.5 minutes. The increasing availability of these high- 
speed large-scale digital calculating machines will render the suggested. 
method of solving the three-dimensional-flow problem a practical one. 
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General table for evaluation of density from wy-derivatives,. - 
After the wW values are obtained at the end of each cycle of calcula- 
tion, the velocity components are evaluated from the derivatives of ff 
with respect to the coordinates, after the density is obtained as 
follows: From equations (46), (57), (74}, (80), (95), (105), and (113), 
the relation between h or p and w-deriveatives can be put into se 


common form as 
2] x «(= a) «(3 21)" (144) 


The quantities represented by X, as well as by €, yn, and € for dif- 
ferent cases, are given in the following table: 








a 





a, 1 (general) (46) 

2 (surface of (1 + 22) 1 (57) 
revolution) 

5 (cylindrical 1 
surface) 

4 (general) 7p r 1 (74) 

5 (surface of op r 1} = (80) 
revolution) 

6 (radial @ r rl (85) 
plane) 





7 (Vyr given) r a r (105) 
W 

8 > given) r L+ g4 QO r (113) 
ra 


With the w-derivatives evaluated, if an exact determination of h 
or p from the preceding equation considering the variation of specific 
heat with temperature is desired, the Keenan and Kay gas tables (refer- 
ence 47) can be used. With two or three readings of h and p (or its 
reciprocal, specific volume), the correct value of h or p satisfying 
equation (144) is found. For most cases where the temperature range 
involved is not too large the use of an appropriate average value of jy; 
> may give accurate enough results. With the use of an average TY; 
the density at any point in the flow field can be related to the inlet 
total value by equation (12a) as 


a, 
re a oe ee 1 ee a _ r 
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1 | 
a 2.2 (2 2|\7r-L 
—— -* + ofr 1 ov es sk * 
Ppa By Ay. a(bp)* Hy 
(145) 


In order to make out a general table for the calculation of density from 
the y-derivetives, the preceding equation is rewritten as 





2 
2 ae 
Ye Q : oye (146) 
xr 
where ol 
y-1 
T+rtatr2 -x\" * se 
Cr Hy 
t+. 
1 22 Y-1 
-_ ~2 = _ xl x o* 
» -|x (23¥ lta) Coen Pepe he oh 
r On e z oY) ar Or a By 


The functional relations between % and ® are given in table I for 7 
equal to 1.4 and 4/3, respectively. From the given iniet condition and 
the given XX velues, the veriation of 





_ ye 
~-1 
+ were _x : 
=i. -2 2 
(2 H; ) (bpp, 4) * 
i 


is first ecypyted and plotted as an auxiliary graph or table as a func- 
tion of = - i ). A similar auxiliary graph or table is prepared 





for the variation of 


= 
a 


Ay 





Bont 
as a function et Gz: ~ x). Anytime during the calculation, from the 


Value of Fe x ~ X } at each point (in general, X changes during 


successive improvements between 5, and 8, surfaces), 
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Y-1 
L + $ oer? _x\° 


(28) "(bop 4) a ae 


ig read from the first graph or table and is combined with 


he(2 a8) «(2 ax) 


and the entropy factor to obtain ®. The value of ~Y is then read from 
tables I or Ii. After the value of 


od 
L + = ofr® - x y 


EH. 


a 


is read from the second curve ar table, the density ratio is obtained. 


Hyperbolic Case 
In the hyperbolic case, the main problem is the solution of the 


following principal equation, written in a common form for the two kinds 
of flow surface: 


2 
ov, ASS ae eee (147) 
o> an2 


Py Lb 
atan * 2 


with the initial condition that y and its normal derivative are given 
on a curve which is not a characteristic curve. From equation (147), 
the equation of the characteristic curve ia - 


2 
qa dy; 
1 (9) ~ 2K (v 32) + n= 0 (148) 
The slopes of the characteristic curves are 


i= ost) = §-5 Ke. IL, (1498) 
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Ag = a = = + = NKe-JL (249b) 


The coefficients J, K, L, and yp and the independent variables 
7 and ¢ for the eight cases considered are given in the table on the 
following page. Using these values of J, K, and L, Ay and Ag are 


also expressed in terms of the velocity components. Except for cases 2, 
5, and 8, they can also be expressed in the usual trigonometric form, 
tan (Xi). The vaiues of X and uw are also given. 


Changes of vy-derivatives along characteristic curve. - When the 
reference point on the jib-plane moves along the image of the charac- 
teristic curve in the 7C-plane corresponding to a small change in C, 


at, the change in y is dy = Fa. Because of these two small 
changes, the change of any quantity q on the surface is (fig. 12) 


dq = Gat = Sat + SF at (150) 
or 
ag _ 34, Ada 
at ~ at * Dan (151) 
Hence along AL 
2 Aa 2 
day aay 4 aay _ay Loy 
at 3 at t+ an Of oat toy atan roe 
a ay_ a ay, 4t a ay aty , Mt aty (153) 
at an at aq v a ay alan an? 
From equations (152) and (153), 
ay day a a%y 
ato7n at an” v anZ eis) 
: 2 
ay _ a oy An a ay (An) o%y 
mre ae By AC BT Ay PR name 








Case| Burface |[Coordi-| » J E L A = 
nated : = ‘2 
tan (h)) 

: S| 
ty 
REE AE © 
1 | Fig@eneral) | py) z ry i- Ya = as us leak BR yStHst 8? wu AW, +H hid > 

=a a, ———— _ — poe 

a a a” EE -¥ Le xz r=] 


Zz 


(surface Case , : 
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Substituting equations (154} and (155) into equation (147) gives 


Paes ; 1 d dy 2 | 13 oy Noy. 
J ar ar + aK - thy) Sag ay t Se YE) ee age Be 7 = 0 
(156) 
By virtue of equations (149a) and (149b), equation (156) becomes 
,f2 a ay ee. 2 
+= = 0 ° 
a St +S ae Sn t 3 St ty ete) 
Dbimilarly, along the second characteristic curve fo; 
Ald ob May, Nay_,- 
at at + a an * T at * Tv ay O ete 


Starting from two points a and b a short distance apart on the 
initial curve, equations (1498) and (149b) give the tangent to the 
characteristic curves at these two points and equations (157a) and 
(157b) give the new value of o4/d€ and od/dy at the point of inter- 
section C of the two tangent Fines (fig. 11). ‘Fhe auxiliary equa- 
tions corresponding to the particular problem are then used to deter- 
mine other pertinent quantities at the point ¢. ‘This process is to 
be carried step-by-step downstream. 


Changes of fluid velocity and direction along characteristic 
curve. - When the characteristic curve hits the boundary wall, it is 
more convenient to express equations (157a) and (157b) in terms of the 
magnitude of the fluid velocity and the flow direction. In order to 
do this, the definitions of wW-derivatives are first put in @ common 
form for all cases as 





= = bp We x (158) 
st =~ bp W, € (159) 


where € equals land r for the 5, and So surfaces, respectively. 
By the use of equation (45), 


OV 
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d dind 1 dh as* ¢INW dine 
at 8 ~-vome (SRP SR oe, aq * ae 
(160) 
d oy _ dinb. 1 dh ds* GInWe gine 
at ay ~ PP We at * seat 7 att a at 
(161) 
Substituting equations (160) and (161) into equation (156) yields 
dinb i adh ds* awe aw 
(Ae Wg - my) (S38 +EB-£)+ leg fi ) 4 
‘ d In r ln ¢€ N M 
Ag We ae - %y ae te F - Wy F =O (162) 
Let 
Wy) = Ww sin X 
(163) 
We =woecos X 
and 
wr 2 2 r 
h =I + ~ > (Wet + Wo + We") = T+ -~ = (wo + Wp4) 
2 s d C 
(164) 


where We is equal to W,, W. r» 9; Wo, W,, 0, Wy, and W, for cases 1. 


to 8, respectively. By the use of equations (163), (164) » end (144), 
equation (162) can be written 





if we a. cos X¥ + de sin X ax 14 were me 
={ *_ Ne FC ee Oe - 7 
a2 dg * Ko cos X% - sin x dt 2 at ro 


d jin b a + 


ag * wrx Tak Gx § —— Ay coax se) 4 





7 1M sinX ~ N cos X_ 


J As cos™%- sin X (165) 


= Sm. tl oy. 
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A similar expression can be obtained Por the change in w and xX 
along A, by replacing Ap by A) in the preceding equation. For 
cases 1, 3, 4, 6, and 7, A can be written as tan (xin), where u is 


equal to sin” = through purely trigonometric transformations as 
folicws: 


AE +N Ke-JL Oy “t +a sa 
— —— aT: 0”CtC~<‘ 


2. Wee 


sin X% cos X¥ + Sin w cosy sin 2% + sin au 


7 eee XY - sin@ " ~ cos 2X + cos 2u 


sin (X+u} cos (X-p)} eos (¥tun) sin (x-u) 
= cos a cos can CF cos (X+u) cos (x-u 


tan (tu) or tan (x-p) 


For these cases, equation (165} can then be written (compare refer- 
ence 30): 


aaa ds* dine a in r 
ag at * Deore > emmy (2X —~at - cos x Sa) 


1M sinX - N cos xX] _ 
J AcosX% - sin X |- : (166) 


where the minus and pius signs on the second term and subscripts «a 
and 1 for A in the last two terms are used along characteristics AL 


and As, respectively. Equations (165) and (166) are most useful when 


the characteristic hits the boundary wall. For a direct problem, the 
siope there is known from the givén blade shape and for an inverse or 
design problem, either the desired turning at the boundary or the 
velocity on the boundary is prescribed. With either d* or dw known, 
Gw or aX is evaluated from equation (162) or (165} (only one charac- 
teristic equation is used at the wall). For convenience of setup in 
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calculation, tais system can also be used for interior points. Lbxcept 
that more terms are involved in the present problem and that w takes 
different meanings in different cases, the procedure of calculation is 
very much the same as ordinary two-dimensional flow described in refer- 
ences 34 and 30. 


CONCLUDING REMARKS 


A general theory of steady three-dimensional flow of @ nonviscous 
Pluid in subsonic and supersonic turbomachines having arbitrary hub and 
casing shapes and a finite number of thick blades is presented. The 
solution of the three-dimensional direct and inverse problem is 
obtained by investigating a combination of flows on relative stream 
surfaces whose intersection with a z-plane either upstream of or scme- 
where inside the blade row form a circular arc or a radial line. ‘The 
equations obtained to describe the fluid flow on these stream surfaces 
show clearly the several approximations involved in ordinary two- 
dimensional treatments. They also lead to a solution of the three- 
dimensional problem in‘a mathematically two-dimensional manner through 
an iterative process. The equation of continuity is combined with the 
equation of motion in either the tangential or the radial direction 
through the use of a stream function defined on the surface, and the 
resulting equation is chosen as the principal equation for such flows. 
The character of this equation depends on the relative magnitude of 
the local velocity of sound and ea certain combination of velocity com- 
ponents of the fluid. <A general method to solve this equation by both 
hand and machine computations when the equation is elliptic or hyper- 
bolic is described. The theory is applicable to both irrotational and 
rotational absolute flow at the inlet of the blade row and to both 
design and off-design operations. 

Lewis Flight Propulsion Laboratory 
Wational Advisory Committee for Aeronautics 
Cleveland, Ohic, July 13, 1951 
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Abstract 


For nearly a century Ludwig Prandtl’s lifting-line theory remains a standard tool for understanding and 
analyzing aircraft wings. The tool, said Prandtl, initially points to the elliptical spanload as the most efficient 
wing choice, and it, too, has become the standard in aviation. 

Having no other model, avian researchers have used the elliptical spanload virtually since its 
introduction. Yet over the last half-century, research in bird flight has generated increasing data 
incongruous with the elliptical spanload. 

In 1933 Prandtl published a little-known paper presenting a superior spanload: any other solution 
produces greater drag. We argue that this second spanload is the correct model for bird flight data. Based 
on research we present a unifying theory for superior efficiency and coordinated control in a single solution. 
Specifically, Prandtl’s second spanload offers the only solution to three aspects of bird flight: how birds are 
able to turn and maneuver without a vertical tail; why birds fly in formation with their wingtips overlapped; 
and why narrow wingtips do not result in wingtip stall. 

We performed research using two experimental aircraft designed in accordance with the fundamentals 
of Prandtl’s second paper, but applying recent developments, to validate the various potentials of the new 
spanload, to wit: as an alternative for avian researchers, to demonstrate the concept of proverse yaw, and to 
offer a new method of aircraft control and efficiency. 


Introduction 


In 1922 Ludwig Prandtl published his “lifting line” theory in English; the tool enabled the calculation 
of lift and drag for a given wing. Using this tool results in the optimum spanload for minimum induced drag 
(the greatest efficiency) for a given span, which, Prandtl said, was elliptical (ref. 1). Since then, the lifting 
line theory and elliptical spanload have become the standard design tool and wing spanloading in aviation. 
So ubiquitous is it that avian researchers have relied on it to explain bird flight data almost since its 
introduction. But in 1933 Prandtl published a second paper on the subject in which he conceded that his 
first conclusion was incomplete: there was a superior spanload solution to maximum efficiency for a given 
structural weight. “That the wingspan has to be specified,” he wrote, “leads to the invalid assertion that the 
elliptical distribution is best” (ref. 2). His new bell-shaped spanload creates a wing that is 11 percent more 
efficient and has 22 percent greater span than its elliptically-loaded cousin, all while using exactly the same 
amount of structure. It results in the minimum drag solution in every case of physical wings: any other 
solution will produce greater drag. Oddly, Prandtl’s second spanload remains virtually unknown. 

Sometime around 1935 Reimar Horten independently derived an approximate equivalent to Prandtl’s 
1933 solution. Horten dubbed it “bell shaped” for its wing loading. The extant evidence shows sufficient 
differences between the two men’s methods, objectives, and conclusions to exclude any mingling of 
information on this subject despite being contemporaries. While Prandtl calculated the total induced drag 
for a wing with this new spanload, he did not examine the distribution of the induced drag across the span, 
and so he missed its implications. Horten, on the other hand, did calculate the induced drag across the span 
of the wing, and in 1950 concluded that something singularly possible existed with such a spanload, 
although he never conclusively proved it (refs. 3, 4). What Prandtl missed and Horten believed existed with 
respect to the alternate spanloading (the bell) is proverse yaw. Figure 1 shows the elliptical and bell 
spanloads of Ludwig Prandtl. 
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Figure 1. The elliptical and bell spanloads of Ludwig Prandtl. 


Figure 1(a) shows Prandtl’s elliptical spanload from 1920 and the bell spanload from 1933. The symbol 
gamma (1°) signifies the airflow circulation about the wing. Figure 1(b) shows the matching downwash 
(dw) of the elliptical spanload (1920) and the downwash of the bell spanload (1933). In figure 1(c) the 
upwash outboard of the wingtip is shown. Figure 1(d) shows the 1920 Prandtl elliptical spanload downwash 
and upwash (note the sharp discontinuity at the wingtip, which is the wingtip vortex). Figure 1(e) shows 
the 1933 Prandtl spanload downwash and upwash (in contrast to the 1920 solution, note the smooth, 
continuous upwash across the wing and beyond; the wing vortex is now inboard of the tips). A comparison 
of the flow fields resulting from the elliptical and bell spanloads is shown in figures 1(d) and 1(e). The 
elliptical spanload wing, figure 1(d), has a sharp discontinuous slope at the wingtip span location in the 
upwash (this is the location of the wingtip vortex), in contrast to the smooth curve of the new upwash, 
figure 1(e) with no discontinuity (a weak vortex forms at the point where the downwash crosses the zero 
line and becomes upwash). 
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Prandtl’s 1933 solution is stated as 
L= ( 1-— x’) 3/2 


where L is the nondimensional local load (this is also expressed as gamma or I’); and x is the span location 
between 0 and 1. Subsequently, 


DW = 3/2 (x*-%) 


where DW is the nondimensional downwash (angle) of the flow. 
The lift approaches zero at the wingtip, as shown in equation (1): 


in TORO 
x: 0 > b/2 (1) 


The slope of the lift (as a function of span) approaches zero at the wingtip, as shown in equation (2): 


lim dL(x) =0 (2) 
x: 0 —> b/2 dx 


The slope of the upwash (as a function of span) at the wingtip 1s equal on both sides of the wingtip, as 
shown in equation (3): 


lim dDW(x) = lim d DW(x) 3) 
x:0—b/2 dx x:00—>b/2 dx 


Induced Drag, and Adverse and Proverse Yaw 


It 1s critical to understand the airflow and forces exerted on a wing during flight, including lift and 
induced drag, to appreciate the differences between the elliptical and bell spanloads, and the implications 
for birds and aircraft. 

Ludwig Prandtl described both the elliptical (1920) and bell (1933) spanload distributions as shown in 
figure 2. The 1920 elliptical spanload, figure 2(a), describes a wing with a uniform downwash along the 
wing’s trailing edge, and a sharp discontinuity of downwash and upwash at the wingtip, which results in a 
strong, tightly-rolled vortex formed at the wingtip. In contrast, the bell spanload describes a wing having a 
downwash that varies from strong downwash near the wing root, which tapers outboard 
(past b/2 = 0.704), to upwash near the wingtip. The bell spanload is also much more heavily loaded 
(more net force) in the root area of which the large root downwash is a consequence. The significance of 
these disparate characteristics is both subtle and dramatic. 






Freestresn 
Resultant |  airtigy, 
| force; Lift 














Resultant 
force 


Note: Resultant 
force twisted 
forward 


Induced ; ¥: 


— — thrust —_ 2 > 
Wingtip “Ne — Pal 
Downwash 


(angle) 


Downwash 


Freestream flow unesh (angle) 


(angle) 
160069 
Figure 2. Prandtl’s elliptical and bell spanloads explained. 


Figure 2(a) shows Prandtl’s elliptical spanload from 1920; figure 2(b) the bell spanload from 1933. The 
symbol gamma (I) signifies the airflow circulation about the wing. The matching downwash (dw) of the 
elliptical spanload and of the bell spanload for each are also shown. The upwash on the 1920 Prandtl 
elliptical spanload is outboard of the wingtip. Of importance in the elliptical spanload shown in figure 2(a) 
is that the net force vector field is tilted backwards by the constant downwash along the entire span of the 
wing. The resulting horizontal component of the resultant force (I’) manifests itself as induced drag across 
the entire wingspan. By contrast, in figure 2(b) it can be seen that the 1933 Prandtl bell spanload and 
downwash show the twisted downwash crossing the zero line and becoming upwash near the wingtip. The 
resultant force is tilted forward of the vertical and the horizontal component is manifested as induced thrust 
at the wingtip, due to the resulting upwash. 

Airflow over a wing generates a net force, which is approximately normal to the wing chord. As shown 
in figure 2(a) for an elliptic spanload, this resultant force vector is not exactly perpendicular to the airflow. 
The larger component perpendicular to the relative wind is known as lift. For finite wings there exists a 
component parallel to the relative wind (for elliptical spanload, always in the direction with the wind, that 
is, toward the trailing edge) that is referred to as induced drag. Induced drag 1s the “cost” of producing lift 
with a finite wing. As lift increases, induced drag also increases. Thus, any control surface deflected to 
locally produce more lift will also locally produce more drag. Ailerons deflected anti-symmetrically to 
generate a rolling moment will also produce a yawing moment to the outside, or against the turn being 
generated by the roll. This phenomenon is referred to as adverse yaw and is the reason all aircraft with an 
elliptical spanload require an auxiliary yaw device (typically a rudder, courtesy of the Wright brothers in 
1902 [refs. 5, 6]) to counter the adverse yaw in order to coordinate the turn (yaw with the turn). 

For the bell spanload, shown in figure 2(b), the net force vector is such that it varies along the span. 
Inboard, the force vector 1s tilted away from the relative wind, like that of the elliptical spanload case, and 
the parallel component produces induced drag. Progressing outboard, this parallel component reduces in 
magnitude until it eventually (past b/2 = 0.704) is tilted into the relative wind. This phenomenon is referred 
to as induced thrust (that is, negative induced drag). It should be noted that the sum total force of this parallel 
component is still producing a net drag (and this sum total is more than that of an elliptical spanload for the 
Same span - in our case we are able to increase the span and achieve less total induced drag), but locally for 
the outer 0.296 span, it produces thrust. A control surface placed in this local thrust region will generate 
increasing thrust with increasing lift. Thus an aileron located in this region will produce a yawing moment 
into the turn, which moment is referred to as a proverse yawing moment. A properly designed aileron, on 
a bell spanload wing, could produce just the right amount of proverse yaw such that an auxiliary yaw device 
would be entirely unnecessary for coordinated turning flight. A design without an auxiliary yaw device 
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means there would be neither added drag nor complexity from such a device. It should be noted that this 
does not mean for all aircraft designs employing a bell spanload that an auxiliary yaw device would not 
needed. There are instances on modern aircraft designs (for example, engine-out or crosswind landing) in 
which such a device would still be needed, but there could be designs for which such a device would not 
be required. Various bird species appear to maneuver gracefully with very minimal auxiliary yaw devices, 
and some seemingly have no discernible auxiliary yaw device. Further, as Prandtl pointed out in 1933, it 1s 
possible to extend the wingspan of a bell spanload, achieve the same lift and the same integrated wing 
bending moment, and achieve less induced drag than the equivalent elliptical spanload. This final solution 
is examined here. 

The downwash/upwash curve of the bell spanload is one smooth and continuous function from beyond 
one wingtip, across the wing to beyond the opposite wingtip. Note that the slope of the downwash/upwash 
function will also be continuous across the wing. The upwash curve rises from the equilibrium level of the 
air far beyond the wing tip to a gentle peak at the maximum upwash of the wing, located outboard of the 
wingtip, which we show as an extension of Prandtl’s downwash/upwash. Inboard of the peak, the upwash 
decreases and meets the upwash of the wing at the wingtip, and the two upwash curves, inboard and 
outboard, must be of equal slope. 

As with any other aircraft, to turn we deflect the control surfaces near the wingtips, increasing the lift 
near one wingtip, resulting in the desired bank angle. But when we increase the lift on one wingtip the 
resulting induced thrust also increases (there is always thrust at the wingtips). The raised wing will create 
more thrust than the lowered wing, resulting in both bank and yaw in the direction of the turn: proverse 
yaw. As a result of this proverse yaw, coordinated flight is achieved without the need for a tail, rudder, or 
other drag devices. 

Figure 3(a) shows the downwash field behind a wing using the bell spanload through use of twist 
(Marko Stamenovic). Figure 3(b) shows the vortex roll-up behind the wing, analytical and in flight 
(Marko Stamenovic and Tom Tschida, NASA photo). 
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Figure 3. Downwash field, wing vortex roll-up, and resulting wingtip overlap in bird formations. 


The Experiment 


To validate the theory and the most critical principles of the bell spanload, we conducted an experiment 
using two subscale flying wing aircraft that used wing twist to achieve the selected bell shaped spanload. 
The model planform with a bell shaped spanload based on Prandtl’s theory was a 25-percent Horten H Xc 
aircraft (12.3 ft span) with a design lift coefficient of 0.6. The objective of the experiment was to 
demonstrate coordinated flight with proverse yaw for an aircraft with a bell shaped spanload and no vertical 
surfaces of any kind on the aircraft. The elevons have equal and opposite throws while functioning as 
ailerons. There is no differential bias; this is a direct, stick to surface control system. 


The Bell Spanload Aircraft Experiment 


The radio-controlled aircraft were bungee-launched and flown by a pilot on the ground. Bungee tension 
was roughly 50 Ib at release and typical altitude at separation from the bungee/cord was 200 ft above ground 
level. The pilot flew the aircraft on a single racetrack pattern during its descent and landing on the dry 
lakebed from whence it launched, completing various flight dynamics maneuvers en route to collect data. 
Flight times increased as experience grew, reaching a maximum flight time of 1 min 55 s and averaging 
nearly 1 min 22 s per flight on aircraft no. 2. Nearly 3 hr of flight time has accumulated. 

The first aircraft carried an on-board data collection system: a smartphone with a triad linear 
accelerometer and triad angular rate recording application. This aircraft also later flew with a 
microcomputer-based flight data recorder providing basic inertial measurement unit functionality (pitch 
rate, roll rate, yaw rate, airspeed, and heading). The data sensors included global positioning system, 
pitot/static system, alpha/beta probes, and control position transducers. Configuration 3 had an open-source 
data recorder and autopilot, inertial measurement unit, global positioning system, pitot/static system, 
alpha/beta probes, and control position transducers. All data-gathering and -generating systems were 
calibrated before flight. Data were downloaded after each flight for later analysis. 


Mass Properties 


The aircrafts’ mass properties are: roll inertia 5.425 slug-ft*; pitch inertia 0.2717 slug-ft? (estimated); 
yaw inertia 5.818 slug-ft?; and x-z plane cross product of inertia 0.5054 slug-ft*. The inertias were measured 
using a bifilar method, except for pitch inertia, which was estimated from the computer-aided design 
geometry and the point mass locations of the onboard systems. The center of gravity was placed at 0.128 
of the mean aerodynamic chord. The aircraft mass was 14.5 lb. The lateral-directional mass properties 
proved to be critical to the experiment. Maine and Iliff (ref. 7) show a very high sensitivity to x-z plane 
cross product of inertia in the estimation of Cnda (yawing moment due to aileron deflection coefficient). 


Data Parameters 


We gathered flight mechanics data for the aircraft with instrumentation for the following parameters: 
angle of attack (-20 to 70 deg); angle of sideslip (-45 to 45 deg); total pressure (0 to 2.16 lb/ft’); static 
pressure (0 to 2.16 lb/ft”); normal acceleration (+/-6 g); axial acceleration (+/- 4 g); lateral acceleration (+/- 
4 g); roll rate (+/- 200 deg/sec); pitch rate (+/- 200 deg/sec); yaw rate (+/-100 deg/sec); left elevon deflection 
(+/- 90 deg); and right elevon deflection (+/- 90 deg). The sampling rate was 20 samples per second for all 
parameters. Open-source microprocessor systems were used for all data collection. 


Preliminary Design 


We performed preliminary design analyses using two methods: a vortex-lattice model paneling the 
aircraft as 320 discrete surfaces (ref. 8), each of the discrete surfaces with its own angle; and a build-up of 
two-dimensional airfoil panel methods (7 span locations, with 5 control surface deflections, 5 chord 
Reynolds numbers varying from 200,000 to 2,000,000, and at 9 angles of attack from -2 deg to 10 deg). 
The build-up of the two-dimensional airfoils was integrated in MATLAB (The MathWorks, Inc., Natick, 
Massachusetts). The airfoils and twist are detailed in Tables 1, 2, and 3. A converged solution was declared 
between the two result sets when we achieved a four-significant-digit match. The airfoils were 
custom-designed using the Eppler code (refs. 9, 10). Estimates of the control surface effectiveness were 
made from the vortex-lattice results and adjusted on the basis of boundary layer thickness. The control 
surface effectiveness was also adjusted on the basis of the control surface configuration change (plain 
surface to plain surface with balance added) for the scale of the aircraft. The model scale was set at 25 
percent, however, the mass of the vehicle increased due to the addition of the instrumentation package. The 
resulting wing loading placed the subscale aircraft in the range of the full-scale wing loading; consequently 
the subscale aircraft flew at velocities closely matching the full-scale aircraft predictions. The wingspan is 
12.3 ft with a leading-edge sweep of 24 deg at the nose. 

With this in mind for the analysis, the common use of Oswald’s efficiency factor (“‘e’’) is also not 
appropriate for bell spanloads. Perhaps the invention of a Prandtl efficiency factor (“p’’) or a bell efficiency 
factor (“b’”) should be used for these Prandtl 1933 bell spanloads. A comparison of the elliptical and bell 
spanload efficiency parameters is given in table 4. 

The coordinate frame for the flight mechanics data on the aircraft is: origin at the center of gravity 
(12.875 inches aft of the nose); x-axis is positive forward out the nose; y-axis 1s positive out the right wing; 
and Z-axis is positive down out the bottom of the aircraft. Using a right-hand convention; roll is rotation 
about the x-axis and is positive for roll right; pitch is rotation about the y-axis and is positive for pitch up; 
and yaw is rotation about the z-axis and is positive yaw right. 

The coordinate frame is: for the wing definition, the x-axis origin is at the wing centerline and extends 
to b/2 (half-span). The y-axis is defined as vertical upward (though in specific cases it 1s defined otherwise, 
and the convention should be apparent by the context). 

The aircraft design was generated to produce a bell spanload. The airfoils vary continuously and linearly 
from the centerline to the tip. The airfoils are specified in nondimensional coordinates. The airfoils used 
are shown in tables 1 and 2. 


Table 1. Airfoil section, centerline. 


0.37059 | 0.10000 | 0.00000 _| 0.00000] 0.37059 | -0.01904 | | 





Table 2. Airfoil section, wingtip. 


0.4062 


0.68325 


0 

0 | ji 
0 | 
0 


0.43132 | 0.04453 | 0.00000 _| 0.00000 | 0.43132] -0.04453 | | 
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The wing twist is nonlinear; it is specified at 20 intervals from the centerline to the wingtip, in degrees, 
as shown in table 3. Using the above airfoil coordinates, this twist does not require any compensation for 
aerodynamic twist relative to the geometric twist. 


Table 3. Wing twist distribution. 


/O | 8.3274 | 11 | 7.2592 _ 


/6 | 8.8257 | 17 | 1.9394 _ 
/8 | 8.4565 | 19 | -0.6417_ 
/9 | 8.1492 | 20 | -1.6726_ 
(104 7.7522{ | 


The control surfaces are located in the outboard 14 percent of each wing, in the trailing 25 percent of 
the chord; the round tips are included as part of the control surfaces. The wingspan is 12.3 ft, the wing area 
is 10.125 ft’, the centerline chord is 15.75 in., and the wingtip chord is 3.94 in. The wing had 2.5 deg of 
dihedral. 





Table 4. Elliptical spanload and bell spanload comparison of spanload parameters and efficiency factors. 


This comparison is made using the traditional elliptical spanload as the baseline from which the bell 
spanload is compared. 

In figure 4, three spanloads (blue = -5 deg; red = O deg; and green = +5 deg) are plotted showing the 
effect of sideslip on the area of induced thrust near the wingtips and the resulting effect on yawing moment. 
The light-green line shows a large area of induced thrust on the left and a small area of induced thrust on 
the right, which would result in a large right-yawing moment. 
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Figure 4. Effect of sideslip on bell spanload with twist (O and +/- 5 deg). 
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Aerodynamic Coefficient Estimation 


We used Maine and Iliff’s output-error approach to estimate the aerodynamic coefficients, which was 
the same source used for the estimates of Cnda. We assume the aircraft is a continuous-time dynamic 
system. The process of estimating the aerodynamic coefficients is an exercise in system identification. 
Assumptions were made in this process, many based on previous experience, such as which parameters 
were important (these are retained) and which were not (these are ignored). This approach uses a 
formulation of the solid-body aircraft flight mechanics as a linear simulation of the vehicle. An initial 
estimate is made of the aerodynamic coefficients; the simulation then makes an estimate of the vehicle 
motion based on the aerodynamic coefficients, the mass properties, and the equations of motion, after which 
the linear estimates are compared to the measured flight data from the vehicle. Errors from all of these 
measurements subject the final estimates of the aerodynamic coefficients to uncertainty. The errors between 
the simulation output and the measured data are subjected to a measurement based on a weighted 
error-based cost function defined by the researchers. 

The aerodynamic coefficient estimates are then varied, and slopes or gradients are determined 
numerically from the errors. The estimation program then marches toward minimizing the cost function 
from the “fit” between the output of the linear simulation and the measured flight data. Maneuvers were 
simple doublet maneuvers, which are simple square-wave pulses, both positive, followed immediately by 
a similar pulse of the opposite sign. 

The results of the flight research on the small flying wing glider were successful, as can be seen in 
figure 5(a). We measured proverse yaw in flight for the first time on June 27, 2013. A sample output from 
the flights shows proverse yaw, as shown in figure 5(b). 
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Figure 5. A subscale aircraft in flight, and resulting proverse yaw data trace. 


Figure 5(b) shows a data trace of the angular rates from onboard instrumentation. Red 1s pitch rate, blue 
is roll rate, and green is yaw rate. The high-frequency motion in pitch rate is due to air turbulence. The yaw 
motion following the roll motion is the same sign; the yaw gain is 0.0643 and correlation is 0.77 for this 
maneuver. All rates are to the same scale. 
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We calculated the coefficient Cnda from the flight research maneuvers. In figure 6, the scattered dots 
represent the flight research maneuvers. The value of Cnda is positive and the trend of the slope 1s also 
positive. The degree of scatter in the data is a result of all experimental error. From this we see that Cnda 
is providing the yawing moment in the same direction as the rolling moment. 
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Figure 6. Yawing moment due to aileron deflection coefficient versus lift coefficient. 


The yawing moment due to aileron deflection coefficient (Cnda) is shown plotted against lift coefficient 
in figure 6. The blue line and red circles were predicted for the full-scale aircraft. The black line is an 
estimate of the 0.25-scale experiment aircraft. The black dots are the estimates from flight data. Error bars 
are 5x Cramer-Rao bounds. The straight line with the circles represents the analytical data from the vortex 
lattice. An estimate of the effect of scale 1s made on the vortex-lattice (reducing the scale reduces the 
effectiveness of the control deflection and reduces the resulting yawing moment). The good comparison 
between the predicted and the measured flight Cnda confirmed our expectations regarding Prandtl’s bell 
spanload. 


Birds and the Bell Spanload 


There are at least two larger implications of this work. The first 1s for the avian research community; 
the second is for the aeronautical world. 

That birds have no vertical tail yet effect effortless turns remains a puzzle, inasmuch as all avian flight 
research is analyzed using the elliptical spanload. The matter of formation flight also defies satisfactory 
explanation despite a century’s worth of research, analysis and effort, again in large part because the 
analysis relies entirely on the elliptical spanload. (“The wake [of the kestrel] was found to be similar to that 
measured behind an elliptically loaded airfoil of the same span,” wrote Geoff Spedding when analyzing his 
data. “As a result, classical airfoil theory for an elliptically loaded wing was used to calculate parameters 
such as lift coefficients and efficiency factors” (ref. 11). Less apparent but equally puzzling to close 
observers is the shape of birds’ wings when compared to aircraft wings: the former taper, often to a sharp 
point, while the latter rarely do, and this, too, defies the elliptical spanload solution. The load distribution 
over a bird’s wing is far more gradual than an elliptical spanload provides: consider a birds’ wing 
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structure—both skeletal and on the surface—which tapers to almost nothing near the tips, where the 
outermost feathers carry virtually no load at all, as compared to an aircraft’s wing. An elliptically loaded 
aircraft’s wings carry loads right to the wingtip. 

First, based on our research results we assert that the growing data on bird flight is irreconcilable so 
long as it relies on the elliptical spanload as the analytical tool. Second, based on the analytical results of 
the bell spanload and the flight data, we assert the only viable solution for interpreting bird flight, formation 
flight, and bird wing structure is the bell spanload. 

We know that it is a biological imperative that birds carry no excess structure in their wings or chest 
muscles, only as much as muscle, tendon, and bone as necessary. Birds embody minimum structure while 
achieving maximum aerodynamic efficiency while accomplishing coordinated flight: birds are a solution 
to a multivariate optimization. Recall that Prandtl’s second paper provided a spanload solution to maximum 
efficiency for a given structural weight when the wingspan need not be constrained. The bell spanload 1s 
the only explanation for how birds achieve this multivariate solution (refs. 12-15). 


Birds-Bell Spanload; Airplanes-Elliptical Spanload 


1. Birds’ primary feathers are soft and flexible at their wingtips and the wings have a narrow chord; 
these wingtip feathers are incapable of supporting any substantial load. Additionally, the outboard wing 
structures of birds are long and slender. The ligaments, tendons, supporting muscles, and bones are long 
and thin, improving aerodynamic performance, but the load-carrying ability of these structures is very 
modest (the same was true for pterosaurs). In contrast, aircraft wingtip structures are large, heavy, and 
expected to carry real loads in flight. 

2. Birds flying in formation position themselves to capture upwash from a leading bird’s wing vortex 
roll-up for added efficiency. Data shows they do this with wings overlapped. Aircraft flying in formation 
with similar objectives do not match this profile, however: they fly with wingtips in line. 

3. Birds do not experience wingtip stall even with their narrow-chord, sharp-tipped, wings. But when 
sharp-tipped swept wings are used on aircraft, wingtip stall is common and requires other solutions to 
overcome. 

We are accustomed to seeing birds turn and maneuver without a vertical tail, and only seeing aircraft 
do so using such drag-inducing devices. The ability to turn and maneuver without resorting to drag-inducing 
devices to counter adverse yawing forces is the first evidence for why the bell spanload—which generates 
proverse yaw—explains the flight of birds. 

Figure 7 shows a wandering albatross (diomedea exulans) in flight. The wandering albatross has no 
vertical tail, yet these birds are able to expertly fly so that they precisely touch their wingtips to the water. 
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Figure 7. Wandering albatross 1n flight. 
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Researchers such as Wieselsberger (ref. 16), Lissaman and Schollenberger (ref. 17), and Portugal have 
argued that flying in formation allows birds to capture upwash in the air from the wing vortex roll-up. There 
is no dispute that birds maximize the energy from the upwash, something only possible in formation flight. 
What we dispute is where that vortex occurs on birds. Figure 8(a) shows a formation of pelicans flying with 
wingtips overlapped, which is an optimal arrangement with the bell spanload but suboptimal for the 
elliptical spanload because in this case the vortex roll-up is not at the wingtip but inboard of the wingtip 
(at .704 of the semi-span) and is in fact a wing vortex roll-up, not a wingtip vortex roll-up. Spedding’s data 
support this, as can be seen in figure 8(b); the vortices seen behind his kestrel show a vortex roll-up inboard 
of the wingtips. (“This wing loading distribution [elliptical] is reflected in the geometry of the wake,” he 
wrote). Birds position themselves in formation flight based on the location of the actual vortex roll-up, and 
only the bell spanload generates a vortex roll-up in that location. 
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Figure 8. Significant bird flight characteristics; a: Formation flight of brown pelicans (pelecanus 
occidentalis) demonstrating the resulting wingtip overlap; and b: Spedding’s kestrel (falco tinnunculus) 
data showing an inboard vortex core location. 


Figure 9(a) shows spanwise location data of following bird relative to the lead bird in the northern ibis 
(geronticus eremita) from Portugal, with Hainsworth (ref. 18), Cutts & Speakman (ref. 19), and Speakman 
& Banks (ref. 20). Figure 9(b) shows an overlay of the data sets with our addition of the downwash curve 
of the Prandtl 1933 spanload and our extension of Prandtl’s 1933 theorum. 
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Figure 9. Bird position in formation flight. 


Portugal recently published research based on global positioning system data showing northern ibis 
flying in formation with the tips of their wings overlapping. He concluded that the mean spacing was 
0.904 m on a mean wingspan of 1.2 m, for a vortex core separation of 0.753. Spedding gave the vortex core 
separation of his kestrel research as 0.76 of span. The vortex separation on our research flying wing aircraft 
occurred at 0.704 of the semispan. Portugal, like Spedding and others before him, analyzed his results using 
the elliptical spanload, forcing the analysis of the birds to fly formation with their wingtips in line with each 
other rather than with wings overlapped. Birds in formation flight seek out the greatest upwash, and there 
is a clear, strong correlation between the location data of birds in formation flight and the vortex formation 
and upwash data of the bell spanload. 

How are birds able to fly with pointed wingtips? Note how the lift tapers gradually to zero at the wingtip 
with the bell spanload. The result is that even wings with very strongly tapered tips show no tendency to 
wingtip stall. Rather than occurring at the wingtip (as it will with an elliptical spanload) the stall begins 
about 20 percent out from the wing root, something observable in the flight of birds [figure 10(b)]. Because 
the bell spanload creates proverse yaw in the outer third of the wing, the thrust yields controllability even 
with a sharply tapered wing. 

The upwash at the tips of the bell spanload makes it possible to capture the wingtip-induced thrust that 
can then generate coordinated roll and yaw without resorting to the use of a vertical tail and without 
generating drag at the wingtips. If we accept Prandtl’s 1933 lift distribution as useful for birds, it follows 
that birds are manipulating thrust at their wingtips to control yaw. 
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Figure 10. The local lift coefficient (Cl) and the beginning of stall on the wing of a wandering albatross. 


Figure 10(a) shows the local lift coefficient as a function of span for a bell spanload from centerline to 
the wingtip. Note that the highest point on the curve is the area in which the wing would first stall. Figure 
10(b) shows an image of a wandering albatross soaring at low speeds (image by Jeff Jennings). Ruffled 
feathers indicate the beginnings of the stall, at approximately 20 percent of span, not near the tip, matching 
the bell spanload predictions. 

Combining observational evidence and data developed by avian researchers with our own research 
results, we assert that only the bell spanload provides a coherent paradigm for bird flight. Our research 
offers for the first time a theory and a tool derived from flight test that satisfactorily explains bird flight to 
match the data. It also serves as a solution to far more efficient aircraft flight. 


Conclusion 


The bell spanload maximizes aerodynamic efficiency with a given structure, coordinates the roll-yaw 
motion so that birds are able to turn and maneuver without a vertical tail, and explains why birds fly in 
formations with their wingtips overlapped, as well as how birds use narrow wingtips without experiencing 
tip stall. 

The bell spanload also allows for improved aircraft designs, particularly all flying-wing aircraft and 
blended-wing body aircraft. Even conventional tailed aircraft can benefit from the improved aerodynamics 
and minimum structure approach. There are circumstances in which span constraints exist (such as 
extremely large transport category aircraft), in which cases current approaches provide better solutions. 

Neither Prandtl nor Horten followed through to the logical and complete conclusion of their work. 
Prandtl did not extend the upwash outboard of the wingtip, which would have answered the question of 
formation flight in birds, and he did not find the induced thrust at the outboard ends of the wings, which 
leads to proverse yaw. In turn, with his approximation and objectives Horten did not understand the origin 
of the induced thrust at the outboard ends of the wings for proverse yaw, and he did not prove that proverse 
yaw exists. 

It remained for the current authors to prove conclusively that proverse yaw is achievable through an 
efficient bell-shaped spanload, that an optimal solution integrating minimum structure and minimum drag 
can solve the problem of yaw control and stability of a flying wing, and that the bell spanload solution 
answers some of the great enduring mysteries of the flight of birds. 

In the case of the flight of birds, the bell spanload is the only viable solution. 
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The Mechanical Analog Computers of 
Hannibal Ford and William Newell 


A. BEN CLYMER 


The history of mechanical analog computers is described from ear/y devel- 
opments to their peak in World War II and fa their obsolescence in the 1950s. 
The chief importance of most of these computers was their contribution to the 
superb gunnery of the US Navy. The work of Hannibal Ford, William Newell, 
and the Ford Instrument Co. is the framework around which this account Is 


based. 


F or over 40 years mechanical analog computers provided 
the US Navy with the world’s most advanced and capa- 
ble fire-control systems for aiming large naval guns and 
setting fuze times on the shells for destroying either surface 
or air targets. A large part of this preeminence can be 
attributed to the work of Hannibal Ford and William New- 
ell. However, the credit has usually been withheld. first 
because of security classifications and later by the resulting 
widespread ignorance of even the main facts of their stories. 

The history of the evolution of fire-control equipment 
can be divided into three crudely defined periods of prog- 
ress: early. middle. and late, being respectively the eigh- 
teenth, nineteenth, and twentieth centuries. In the early 
period, the eighteenth century. there was no perception of 
fire control as a hierarchical system. so there were no inven- 
tions on the #¥4fe#? Jevel. Lack of concern for improvement 
caused continuation of the status quo. In the middle period, 
the nineteenth century. there began a trend toward automa- 
tion in many practical pursuits (e.g., the cotton gin. railroads. 
steamboats. and glass-forming machines) which extended to 
naval gunnery. Handwheels provided a mechanical advan- 
tage in training and elevating guns. The man-machine sys- 
tem was being made easier and better for the men by 
delegating more to machines. 

In the late period, the twentieth century. people have 
seen the system as a whole, and they have been conscious 
of missing subsystems. Inventions then took place on the top 
echelon, and system engineering began to deal with the 
entire hierarchical system. In the late period there was 
concern for errors of system performance. In the case of a 
fire-control system, the contributions of all causes to the 
ultimate miss data were studied to identify ih most critical 
remaining sources of error. 


Early analog computing mechanisms 

To understand the types of mechanisms invented by 
Ford and Newell, it is necessary to briefly examine a few of 
the simple components from which they arose. The history 
of mechanical analog devices goes back at least to Vitruvius 
(SO BC), who described the use of a wheel for measuring arc 


length along a curve. the most simple integral in space. Many 
other elementary analog devices were described before the 
modern period: Differential gears (Figure 1). used for add- 
ing or subtracting two variables. arc usually ascribed to 
Leonardo da Vinci: and Leibniz is credited for the idea late 
in the seventeenth century of a similar-triangles device for 
equation solving or root solving.’ 

The first device to form the integral under a curve, or the 
area within a closed curve, was the integrator of B.H. Hetr- 
mann in 1814. Hermann’s integrator was essentially a wheel 
pressed against a disk. as shown in Figure 2. There was a 
second disk over the first. which squeezed the wheel be- 
tween them. The rate of rotation of the wheel is proportional 
to the product of the disk rotation rate and the radial 
location of the point of contact of the wheel on the disk. That 
is. the rate of change of angular position of the wheel z is 
given by 


d*- dy 
“=kKy 
dn ay 


where z is the time integral of ¥ times a constant. x is the 
angular position of the disk, and K is a scale constant. Note 
that the variables in this device are angular and linear 
positions. 

An early application of such integrators was the integra- 
tion of force over distance to measure work. Another appli- 
cation was a planimeter to measure the area within a closed 
curve. In fact. the chief impetus behind the early integrator 
inventions of the nineteenth century was to get an improved 
planimeter. 

James Clerk Maxwell’ described a ball type of integrat- 
ing device while he was an undergraduate: it was incorpo- 
rated in a planimeter design. In about 1863. James Thom- 
son’ conceived an equivalent integrator in which a ball 
rotates between the disk and a cylinder (see Figure 3). The 
angular position of the cylinder is the output variable z. and 
the ball replaces the wheel of the Hermann integrator. The 
ball is held in a housing that is translated along the radius of 
the disk with displacement y. This integrator became the 
heart of numerous harmonic analyzers and time analyzers. 
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Figure 2. The Hermann integrator. 


In 1881 a different type of integrator was developed in 
Madrid by V. Ventosa.’ It consisted of a tilluble drive roller. 
a ball, and four output rollers. If wind velocity is put into the 
drive roller (marked “A” in Figure 4) as angular velocity. 
and if wind direction is put in as tilt angle. then the four 
output rollers turn with speeds proportional to the compass 
components of wind velocity. As a computing device this 
ball constitutes a “component integrator’ -it produces the 
time integral of the sine and cosine components of a given 
varying magnitude. Later forms of trigonometric integrators 
were developed by Hele-Shaw, Smith. Newell (see Appen- 
dix), and others. 

Harmonic analyzers were developed to determine the 
coefficients of a Fourier series to fit a given record. such as 
tide data. Lord Kelvin built two. the second in |4/9. A 
refined version by Michelson and Stratton built in 1897 
could sum 80 Fourier terms. According to Vannevar Bush” 
a three-dimensional cam for multiplying was developed by 


Bollée. 





A two-dimensional cam (Figure 
5) was used to generate a virtually 
arbitrary function of one variable: 
The input is the rotation angle of 
the cam, and the output is the radius 
of the cam at the point of contact of 
a roller. A three-dimensional cam 
(Figure 6) was similarly used to 
generate a function of two vari- 
ables. such as time of flight as a 
function of range angle and eleva- 
tion angle to the target. 

William Thomson. Lord Kelvin, 
had the powerful idea of using ana- 
log computing mechanisms tied to- 


Figure 1. The Ford 3/8-inch spur differential gears. (Photograph by Laurie Minor, &cther to solve a differential equa- 


tim." Ten years later, Abdank- 
Abakanowicz built an “integraph,” 
which had the purpose of solving 
one particular differential equa- 
tion. Thomson’s idea was the con- 
ception of differential analyzers. 
which. however. did not become a 
practical reality until the 1930s with 
the work of V. Bush.’ Lord Kelvin 
also invented a pulley device for 
solving simultaneous cquations.” 
Larger versions were built by MIT 
professor Bohn Wilbur in 1934 and 
1935, An “isograph” was devel- 
oped at Bell Telephone Labora- 
tories. following a concept due to 
Thornton Fry in 1937. It could find 
the roots of polynomials of up to 
lGth degree, even if the roots were 
complex numbers. It was based on 
a Scotch yoke mechanism to trans- 
form from polar to rectilinear coor- 
dinates.’ The state of the art of 
these and other computing mecha- 
nisms has been summarized as of the end of World War II 
by Macon Fry” and C'lymer."' 

These analog mechanisms. together with a “multiplier” 
(using slides and based on the mathematics of similar trian- 
gles) and a “resolver” (which produced R sin 9 and R cos 6 
from R and ® by means of a Scotch yoke mechanism). were 
among the building blocks for the practical computing sys- 
tems to be described. 


Naval surface fire-control computers of 
1910 to 1930 


It is necessary to describe a little of the technology of 
naval gunnery and fire control to present a snapshot of the 
state of affairs just before the entry of Hannibal Ford into 
the picture. What he accomplished was in direct response to 
the needs of the US Navy. He was responsible for the 
development of mechanical analog computers of unprece- 
dented size. complexity, dependability. ruggedness, and ac- 
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curacy. The mechanical analog 
computers of 1915 were, however. 
quite simple, small, and uncompli- 
cated compared with their descen- 
dants in the next three decades. 


The fire-control problem. In the 
nineteenth century the fire-control 
problem greatly increased in diffi- 
culty. Ranges had been 20 to 4) 
yards in 1800.’ ' Most of the engage- 
ment between the Monitor and the 
Mctrimac had been fought at 100 
yards. which was virtually point- 
blank range, and the ships were 
slow in maneuvers, affording gun- 
ners plenty of time to take aim.” By 
the end of the century, naval guns 
could fire at ranges far in excess of 
10,000 yards. Ships could move 
much faster, and still rolled and 
pitched to large angles in heavy 
seas, causing both sights and guns 
to move off target. 

With the increased ranges avail- from the disk center.) 
able to guns the problem of “spot- 
ting’ the errors in the locations of 
splashes of shells became more dif- 
ficult even in the clearest weather. 
Likewise. the task of determining 
target range became more chal- 
lenging. With the increased target 
range went a more than linear in- 
crease in the time of flight of a shell. 
so the target had more time in 
which to maneuver. Moreover. the 
greater time spent by a shell in flight 
enabled wind to have very impor- 
tant effects upon the impact point. 
Another complication was that ri- 
fling the gun barrels. while reducing 
random scatter. caused a systematic 
lateral “drift” of the projectile. 
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Figure 3. The Thomson integrator. (The displacement is perpendicular to the paper away 





which had to be compensated for in Figure 4. The Ventosa integrator. 


aiming the guns. 

The greater need for angular ac- 
curacy at greater ranges increased 
the importance of some relatively 
minor effects, such as variationsin atmospheric temperature 
and pressure. barrel erosion resulting from previous firing 
(which reduced the initial velocity and hence the range of 
the shell). propellant weight and temperature variations. 
projectile weight. and so on.'- The largest disturbances to 
accurate naval gunnery were the rates of change of range 
and target bearing due to relative motions of “own ship” 
(the firing ship) and the target. 

Clearly the crisis in naval gunnery created pressure to 
improve naval fire-control equipment. 


Fire-control equipment of 1910 to 1915. During World 
War | fire-control equipment included three classes of de- 
vices. |* 


Hevices alti, Spotters’ scopes were used for viewing 
splasitcs in order to phone gun angle corrections (“spots’’) 
relative to the line of sight. Optical range finders of succes- 
sively improved types determined range to the target. 
(American models had a base of 14 to 20 feet. but the British 
had only 9 feet. giving double the error. German range 
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Figure 5. A two-dimensional cam. (Photograph by Laurie 
Minor, Smithsonian Institution.) 


finders were the best because they had the best optics and 
thus the best view.) 

Directors. after about 1912.” consisted of sights kept 
aimed at the target in train and elevation in order to correct 
gun train and elevation angles for own ship roll and pitch. 
The English company Vickers had the lead in director de- 
velopment.” The US Navy purchased some of these direc- 
tors from Vickers for 4 inch guns. 


Devices befawships (in the “plotting room” or “control 
information center’). Gyrocompasses determined own_ ship 
course (purchased from the Sperry Corporation by the US 
Navy after 1910). Plotting boards were used for plotting the 
paths of own ship and target to determine range at the future 
time when the projectile would arrive (“advance range’), 
using range-finder data. The invention of the plotting board 
is ascribed to a junior gunnery officer in about 1906. 

Range clocks let operators set in the present rate of 
change of range to obtain a crude running estimate of range. 
“Time of flight clocks” told the time when a shell fired 
“now” would reach the target. The Argo clock was a me- 
chanical analog computer for solving the relative motion 
equations for range. As of 1912. the US Navy had a “lite- 
control table” (a mechanical analog computer) having input 
from the range finder and director. 

The pitometer log measured own ship speed. 








Figure 6. A three-dimensional cam. (Photograph by Laurie 
Minor, Smithsonian Institution.) 


Devices at the #iif%. Mechanical drives for guns appeared 
between 1907 and 1910. Manual tracking of command an- 
gles on dials positioned guns in train and clevation.’’ Grad- 
uated sights on the guns had been used at the time of the 
American Civil War but were obsolete by 1910 or 1915. 


Differences between Britain and the US. The connectiv- 
ity of the primitive fire-control “system” composed of the 
foregoing fragments foreshadowed some aspects of modern 
fire control. However. there were differences among the 
systems used by different countries. For example, between 
Britain and the LS. there were differences in who controlled 
gunfire, from where. and with what use of the plotting 
room.” In the US Navy, the plotting room personnel con- 
trolled the fire, using data from spotters and their own data 
to compute gun angles. On the other hand, the British 
preferred optical system angular outputs. Director person- 
nel controlled the fire. using the plotting room information 
mainly to correct range. 

Thus the stage was set for the contributions of Hannibal 
Ford. 


The fire-control computers of Hannibal 
C. Ford 


Hannibal Choate Ford was born in Dryden, N.Y... an May 
8.1887. His parents were Abram Millard Ford (born Febru- 
ary 22. 1831) and Susan Agusta Giles Ford (born June 3, 
1834). 

As a young boy. Ford showed mechanical talent with 
clocks and watches. Between high school and college he 
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Figure 7, Hannibal C. Ford and his engineering staff about 1922. Ford is front and center; the others are unknown. (Photograph 
from the Sperry Gyroscope collection.) 


worked at the Crandall Typewriter Company, Groton, N.Y. 
(I 894). at the Daugherty Typewriter Company, Kittanning, 
Pa. (1896-1898), and at the Westinghouse Electric and Man- 
ufacturing Company (1898). 

He studied mechanical engineering at Cornell Univer- 
sity, graduating in 1903 as a “mechanical engineer in elec- 
trical engineering.” Evidently his classmates at Cornell re 
specled his mechanical inventive ability, because his motto 
in their senior yearbook was, “I would construct a machine 
to do any old thing in any old way.” He was elected to 
membership in Sigma Xi, the honorary society for research. 

After graduation Ford worked for the J.G. White Com- 
pany. New York (1903-19051, where he developed and held 
two basic patents issued in 1906 on the speed-control system 
long used in the New York subways. At the Smith-Premier 
Typewriter Company, Syracuse, N.Y. (1905-1909). he de- 
veloped over 60 mechanisms of commercial importance and 
received a number of patents over the period 1908 to 1915.” 

In 1909, Ford worked for Elmer A. Sperry. whom he had 
known as a young man in his home town, Sperry having been 
somewhat older. Ford assisted Sperry in the development 


of the gyrocompass, a mechanical device for determining 
own ship’s heading. The following year, Ford was promoted 
to be chief engineer of the newly formed Sperry Gyroscope 
Company, a position which he held until 1914," 

In 1915. Ford resigned from Sperry to organize his own 
company, the Ford Marine Appliance Corporation, which 
became the Ford Instrument Company in 1916 (see Figure 
7). The company’s mission was to develop and sell fire-con- 
trol systems to the US Navy. Its first product, Range Keeper 
Mark 1, was introduced into the US Navy in 1917 on the 
USS Texas. 

Ford’s Range Keeper Mark | (abbreviated Mk. 1) per- 
formed a remarkable number of continuous functions in real 
time for a computing system in those days: 


1. It generated range rate. 

2. By integration of range rate it determined present 
range. 

3. It generated the relative speed at right angles to the 
line of sight” but not the present target bearing 
angle.*” 


IEEE Annals Of the History Of Computing, Vol.15.No.2, 1993 « 23 


Mechanical Analog Computers 


The rates were obtained by resolving own ship’s and target’s 
speed vectors along, and perpendicular to, the present line 
of sight. These operations required mechanical resolvers, 
differential gears, and an integrator. 

Ford’s integrator (Figure 8) was of superior design for 
achieving high accuracy and long life. It used two stacked 
balls, held by stiff springs, between a disk and cylinder, each 
made of hard steel. The balls were held in place by pairs of 
small rollers in a carriage. This design permitted the carriage 
to move even when the disk was not moving, a feature that 
was necessary when integrating with respect to a variable 
other than time. The author does not know if Ford was 
aware of the prior art, such as James Thomson’s integrator 
and William Thomson’s (Lord Kelvin’s) computer concept . 
before applying for his patent.°” 

Own ship speed (measured from a pitometer log) and 
estimated target speed and course, own ship course (from a 
gyrocompass), as well as target bearing, were entered man- 
ually with the aid of dials, hand cranks. and knobs. The 
assembly of mechanisms was driven by an electric motor 
whose rotations represented the elapse of time. Present 
range, from the range finder, was telephoned to the plotting 
room, where the range keeper was kept. 

Meanwhile, Arthur H. Pollen, a British inventor, had 
devised a mechanism Of the differential analyzer type (called 
an “Argo clock’) to solve, on a continuous real-time basis, 
the relative motion equations for own ship and a target ship: 
“It accounted in large part for the extraordinarily good 
shooting of several Russian battleships during World War 
I.“ It was used also in the British Navy. Pollen’s invention 
must have preceded, by a short time, Ford’s range keeper. 

During World War I, the US Navy obtained the patent 
for the British Pollen fire-control computer system (Argo 
clock), and the Range Keeper Mark 1 was modified to 
incorporate one of Pollen’s concepts (dividing by the range 
and integrating with respect to time to get the bearing 
angle). By dividing relative motion across the line of sight 
by present range, the Ford range keeper (called apprecia- 
tively the “Baby Ford’) was able to generate the rate of 
change of target bearing and integrate it to get the target 
bearing angle, which in turn defined the line of sight. Thus 
the range and direction to the target could be generated and 
known, even if the target was lost from sight for a while. 
These modifications introduced another integrator and a 
divider into the evolving range keeper.!? 

Another of the early additions to the Baby Ford was a 
ballisticcapability.” It was to determine the time of flight of 
the shell to the predicted point of impact, the bearing of that 
point, and the range of that point. Then the gun angles could 
be calculated to implement that prediction. The guns were 
steered by hand (following pointers), but they were powered 
by Waterbury Speed Gears (hydraulic drives). 

Another capability was “rate control.” This function en- 
abled determining corrections to target speed and course as a 
result of data obtained from spotters aloft regarding the splash 
locations relative to the target. The Baby Ford had a rudimen- 
tary scheme for doing this, but it required the prediction calcu- 
lations to be stopped while rate control was being done. Han- 
nibal Ford earned a patent for his rate control scheme. 


By the end of World War I, the Ford range keepers 
provided a serviceable nucleus for a partially mechanized 
fire-control system. It was roughly comparable with the 
British system. The British gun directors were deemed bet- 
ter than those of the US Navy, but British range finders, 
having a smaller baseline, were inferior in accuracy. The 
Pollen Argo clock and Baby Ford were about a standoff.” 
Acceptance of the Baby Ford was not universal and imme- 
diate. Some senior fleet officers tended to resist it, preferring 
the plotting boards, where they could “see” the situation at 
a glance. 

In addition to developing range keepers, Hannibal Ford 
almost single-handedly developed an entire gun director. It 
included an optical turret, a stable element to establish the 
vertical on a rolling and pitching ship, an angle gyro pointing 
at the target, and the associated Baby Ford range keeper, 
which included a ballistic computer. 


Naval fire control from 1930 to 1950 


In the 1920s the international clamor for disarmament 
forced the US Naval budget to a very low point. Although 
the situation improved in the 1930. when the US Navy 
began again to grow, money was still tight. The Bureau of 
Ordnance was forced to drastically limit what it could pro- 
cure. A striking example is offered by the deck tilt corrector 
that was, in the 193fs, ordered by the bureau to be devel- 
oped by Ford Instrument Co. Unfortunately, there was only 
enough money to order half of the desired corrector. During 
part of that period Ford Instrument Co. was down to a 
three-day week for its employees. 

In the late 1920s, Hannibal Ford began developing the 
first antiaircraft (AA) fire-control system, including both a 
director (Mark 19) and a range keeper. Because of the 
target’s ability to maneuver at high speeds and angular rates 
as seen from own ship, the AA fire-control problem was 
intrinsically much more challenging than was fire control for 
a surface target. Despite the work on AA fire control, 
systems for surface fire control continued to pour from the 
Ford Instrument Co. under Ford’s technical direction. For 
example, the company developed the Range Keeper Mark 
8, which was used in the Marks 24, 31, 34, and 38 Gun 
Directors. Equations and a schematic diagram of informa- 
tion flow in the Range Keeper Mark 8 have been published 
in the open literature, although values of constants in the 
equations were not given.“.” 

The period starting in 1930 saw the introduction of many 
improvements in fire-control systems. One was automation of 
data input into the computer. Friedman’”’ provides the follow- 
ing list of data entered manually in 1933 range keepers: 

Variable Source 
Range Phoned from range finder 
Own ship course Gyrocompass repeater 
Own ship speed Pitometer log 
Target course Initial estimates for rate control 
Target speed Initial estimates for rate control 
Target bearing Automatically from director 
Spotting data Spotter, by telephone 
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By the late 1930s the input of these variables was much more 
highly automated. 

The Gun Director Mark 33 was initiated in 1932 for 
dual-purpose 5-inch/38 guns on ships of all sizes. It resem- 
bled an apple on a stick when it was mounted aloft, and it 
had vibration problems. It was used with the Ford Range 
Keeper Mark 10 for antiaircraft fire, and it had a stable 
element and a computer below deck. A total of nearly 850 
Mark 33s was eventually installed. 

A typical World War II range keeper or computer con- 
sisted of three sections: 


|. Tracking section (the original range keeper functions 
dealing with relative and absolute motions of own 
ship and target). 

2. Prediction section (predicting range and time of flight, 
each from two moving time origins: the time of gun 
firing and the time of fuze time setting; and the re- 
quired gun angles found by considering the ballistic 
functions and wind). 

3. Correction section (calculating and applying correc- 
tions due to own ship angular motions. namely. roll 
and pitch. requiring trunnion tilt and deck tilt correc- 
tions to the gun angles). 


By the time of World War If most main battery fire 
control was done by Range Keepers Mark 8 in Directors 





’s integrator. (Photograph by Laurie Minor, Smithsonian Institution.) 
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Mark 34, mainly for cruisers, and Directors Mark 38, for 
cruisers and battleships.!* The Ford range keepers were 
superseded by the Ford Computer Mark | in the Gun 
Director Mark 37. This director was first tested in 1939 and 
it quickly became the standard dual-purpose director in 
World War II, although many Range Keepers Mark 10 in 
Directors Mark 33 also were built and used. The Bureau of 
Ordnance considered the Computer Mark 1 to be “enor- 
mously — successful.“”’ The system included transmission of 
data to and from the computer below decks by means of 
synchros. Designed originally for the 5-inch/3& guns, it was 
soon modified by Ford Instrument Co. for a number of other 
guns and ammunition types as well. 

Choice of the term “computer” in preference to “range 
keeper” recognized the growing inadequacy of the term 
“range keeper” to describe the system. Keeping range was 
a small part of its function. 

Fine as this fire-control equipment was for 4-invh guns 
and up, it was not suited to the smaller guns and decentral- 
ized control that proved necessary in World War II for 
defense against incoming aircraft in large numbers. More- 
over, the large fire-control systems were not economically 
feasible for use on small naval vessels and merchant ships 
having guns even as large as 3 inches. Fire control for 
close-in attack by a number of aircraft was “sadly neglected 
in the years between the two wars” due to an “ill-founded 
complacency” concerning the ability of fire-control systems 
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of the day to destroy all tar- 
gets at greater ranges.** The 
Japanese exploited this 
weakness with several dis- 
tinct modes of attack. 

Ford Instrument Co. was 
caught up in the rush by the 
Bureau of Ordnance to de- 
velop fire-control systems to 
meet these new needs. Ford 
developed, to various ex- 
: fl tents, the Gun Directors 

Fe ie, Mark 45, 48, and 49 — all 

ee intended for close-in AA fire 

Figure 9. William H. Newell with small guns. The Mark 49 

in 1988. used a gyro to determine lead 

angles based on the preces- 

sion rates measured in track- 

ing the target. It was ready by late 1942, and nearly 350 were 
eventually delivered. 

Ford’s answer to the merchant ship problem was the 
Computer Mark 6 -used with Gun Directors Mark 52 and 
53. Although only about the size of a large wheel of cheese, 
it ingeniously contained a simplified capability for solving 
the surface fire-control problem. 

In spite of all these developments with gyros, reticules, 
and lead computers, they only partly replaced the old open 
sight in World War II. Gunnery and fire-control system 
designers had prepared for a different enemy — one more 
like a towed target remaining at a distance of miles.’ 

Optical range finders gave way to radar in the late 1930s 
and early 1940s. This resulted in a substantial increase of 
capability of searching for targets (with “broad-beam search 
radars”) and tracking targets (with “narrow-beam fire-con- 
trol radar’). No longer was it necessary to illuminate a target 
with star shells at night or lose a target in mist. Moreover, 
the range, target bearing, and elevation signals were cleaner, 
smoother. and more accurate. The measurement of range 
and target direction angles had been freed from the limita- 
tions of the human operator of an optical range finder. The 
advancement of synchros for transmitting and receiving 
data in fire-control systems was a step away from manual 
follow-the-pointer systems. These synchro systems are de- 
scribed in Department of Ordnance and Gunnery publica- 
tions.” 

A few problems existed because the Bureau of Ordnance 
had to deal with other bureaus in getting its equipment 
installed. For many years — until 1943, in fact — the gun 
mount foundations provided by the Bureau of Ships did not 
meet specifications of the Bureau of Ordnance.‘* Presum- 
ably the accuracy of gunnery then improved somewhat. 

One of the most valuable advances was the development 
(about 1940) of powerful control systems for automatic 
training and elevating of guns of all sizes. After the installa- 
tion of automatic control, the guns could fire with precise 
aiming at any time, freeing gunnery from the centuries-long 
dependence on synchronizing firing with rolling of the ship. 
Although the earliest systems were susceptible to oscilla- 
tions and lags.’ improvements in the mathematical design of 





control systems, and (according to William Hampton, then 
a Ford employee) the use of steel piping for greater hydrau- 
lic stiffness, resulted in satisfactory performance. 

Another advance, the “proximity fuze,” made it possible 
to avoid having to set fuze time and incurring the associated 
errors of burst time. Projectiles could be loaded directly and 
fired immediately, and this allowed gunnery accuracy to 
improve even further. 

The entire functional environment of fire-control com- 
puters had to evolve to keep pace with the increased sophis- 
tication of the other components. 


Evolution on the system engineering 
level 


A respectably mature discipline of system engineering 
had developed in naval fire control by the late 1930s and, 
from that time on, the days of the inventor left to his own 
judgment were gone. 

One evidence of system engineering was the standard set 
of symbols that came to be used in equations to designate 
variables, such as ff for time of flight and R2 for advance 
range. Likewise, there was a standardized vocabulary of 
concepts such as “advance range” (the range at time of 
predicted impact) and “time of flight’ (the time from firing 
to impact). As more and more corrections were incorpo- 
rated in the range keepers, even the equations took an 
increasingly standard form which was then imposed by the 
Navy across all manufacturers. Some of these equations are 
given by l'riedman' and the 1941 US Navy Academy 
book.” 

Another evidence of the use of system engineering is the 
top-down generation of specifications, beginning with the 
Bureau of Ordnance, with the manufacturers going into 
greater detail in the specifications. This procedure resulted 
in the systematic production of schematic diagrams, engi- 
neering drawings. training manuals, and other documenta- 
tion. 

Another hallmark of system engineering was the analysis 
of system performance errors: For each Ford Instrument Co. 
product there was calculated a full complement of “class B 
errors.” These were the deviations of the system’s answers 
from theoretical answers calculated from the exact equa- 
tions for specified cases. Analysis of these errors led to 
knowledge of where more accurate calculations were 
needed in the product. The next step was to develop an 
“error budget” that allocated allowable errors among. all 
contributing categories in a hierarchy. The error budget 
pointed to novel developments needed as well as to limits 
on errors of conventional equipment. 

Yet another aspect of system engineering was the analy- 
sis of errors of the enemy’s system, seeking weaknesses to 
exploit. By whatever means were used. the Japanese iden- 
tified opportunities for dive bombers, torpedo planes, toss 
bombers, kamikazes, and so on. These tactical weapons 
presented the ships’ fire-control systems with short-range, 
high-range rate, and/or high bearing and elevation rates, 
where the accuracy of the Gun Directors Mark 33 and 37 
fell off sharply.** That low performance is in contrast to the 
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reported high accuracy with slow 
targets, even at great ranges. (For 
example, the battleship Washing- Characteristic 
ton is said to have achieved nine 


hits on the Japanese battleship me ppucanen 
Kirishima, out of 75 rounds of 1f- 
inch shells at 19,000 yards range in 
the night battle of Guadalcanal in eaysnennien 
1942, where radar was used.) 

Construction 


The contributions of 
William H. Newell 


In 1926 the Ford Instrument Problem size 
Co., which was then working on its 
first antiaircraft director, got a new 
employee: William H. Newell, aged 16. He worked first in 
the shop making high-precision mechanical computing com- 
ponents and, a year later, transferred to the Test Depart- 
ment where he acquired the techniques of making mechan- 
ical analog computers perform to their limits. In the 
evenings for seven years he went to the College of the City 
of New York to study engineering. He advanced rapidly as 
a result of his nearly unique talents as an inventor, designer, 
and developer of mechanisms and indeed, like Hannibal 
Ford, entire computing systems. In 1943. at age 32, he 
became chief engineer. 


Design style 


Newell’s inventions. Newell (see Figure 9) has received 
80 patents in connection with his work. The subject matter 
was long classified, so the public has not known of his 
contributions. Any attempt to determine Newell’s accom- 
plishments by concentrating on patent dates is difficult be- 
cause the date of filing for a patent might have been much 
earlier than the date of issue due to secrecy orders prevent- 
ing responsive issue. 

Among Newell’s mechanical. hydraulic. and electrical 
inventions (see Appendix) were 31 devices of fundamental 
importance to analog technology. Included are devices such 
as a hydraulic computer: an irreversible drive involving 
wedges to lock two disks if direction starts to reverse, as in 
back torque from gun recoil: a torpedo director (Mark 2); a 
director for defense against horizontal bombing runs; a 
scheme for using trains of balls, with wheels and steering 
rollers, to integrate complicated trigonometric functions 
and solve the fire-control tracking problem; and a comput- 
ing device for predicting the deck angles of an aircraft carrier 
at the instant an airplane would be landing. 

Many of these inventions concerned ways to deal with 
inertia and friction loads on the driving mechanisms. They 
were essentially servos, then usually called “follow-ups.” 
that provided torque amplification while following a shaft 
angular position signal. These servos had a differential gear 
for comparing the output angle of the servo with the input 
signal angle, producing an error angle, which determined 
the signal to the drive to reduce the error-that differential 
gear was represented on schematics by a cross in a circle. a 
symbol which is still used on schematic diagrams for the 


Differential 


Solution of arbitrary differential 
equations sets (general-purpose 
computer) 


On “solid ground” in a building 


Originally spread out on a large 
breadboard for flexibility 


Laboratory 
practice 


Several differential and 
algebraic eouations 


Table 1. Differences between differential analyzers and fire-control computers. 


analyzers Fire-control computers 


Computing continuous aiming 
and fuzing of naval guns 


In a moving warship 
experiencing severe shocks and 
vibrations 


Designed into minimum volume 
for shipboard use 


Rugged, yet precise machine 
design 


instrument design 


Many differential and algebraic 
eauations 


error-determining subtraction in control systems of many 
types today. 

The Ford Instrument equipment often used an “intermit- 
tent drive,” a device that enabled one part of the equipment 
to drive another over only a limited part of its total travel. 
Ford had designed the first intermittent drive, but Newell 
improved the design, putting the whole drive on one shaft. 


The significance of Newell’s work. One of the hallmarks 
of Newell’s work has been that he took extra trouble to find 
the neat and simple way to do things, rather than go ahead 
with his first idea. A notable testimony to Newell’s and Ford 
Instrument’s skills was that Wernher von Braun selected 
them to build the mechanical and gyro guidance system for 
the first Redstone missile. Ford Instrument Co. built also 
the guidance system for the Jupiter missile. 

Newell’s work was done with originality and self-reliance. 
One might wonder if he got ideas from other organizations in 
those days of technical ferment. However, Newell has denied 
that he got ideas from MIT’s differential analyzers or Servo 
Lab work: In fact. MIT bought Ford components, and Newell 
believed that Ford Instrument was “ahead.” According to 
Newell. Bell Telephone Laboratories, the Naval Research 
Laboratory, the Office of Naval Research, the ENIAC project, 
and the university researchers, including such avid communi- 
cators as John von Neumann, Harold Hien, Jay Forrester, 
Claude Shannon, Marberl Wiener, Warren Weaver, and 
Vannevar Bush, had no effect upon his work. 

From 1965 to 1977, Newell worked for Perkin-Elmer, in 
Norwalk, Conn.. on challenging projects such as the space 
telescope, first on the senior technical staff and then as a 
consultant. But that is another story worth telling. 


Other mechanical analog computers 


At this point in the story, attention is turned from fire 
control to other specialized applications of mechanical ana- 
log computers. The author makes no attempt to describe the 
type generally known as a “differential analyzer” because it 
is already adequately described in other places -except to 
distinguish it from the computers used in fire control. Dif- 
ferential analyzers differed dramatically from _fire-control 
computers. as shown in Table 1. 
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These were the two distinct species that represented the 
high point of mechanical analog computer development, 
each in its own way. Williams’ felt that *‘the analog tradition 
reached its height in the differential analyzers.” This author 
disagrees that either species was superior. 


Torpedo mechanical computers. Torpedo data comput- 
ers for use by submarines were developed by the Arma 
Corporation in 1935. Arma had been building stable ele- 
ments and other gyroscope instrumentation for weapons 
since its founding in about 1920. The torpedo data computer 
automated much of the process of inserting data into a 
torpedo to establish its course, speed, and depth. It was 
primarily a mechanical computer with some electrical com- 
ponents. By World War II most submarines in the US Navy 
had a TDC Mark 3.'’ A simpler and more compact version 
of the torpedo data computer, the Mark 2, was developed 
by William Newell (see item 5 in the Appendix). 

Destroyers of that period carried Torpedo Director 
Mark 27, which contained a mechanical computer. A num- 
ber of approximations could be made, because the resulting 
errors could be ignored when torpedoes were fired in a 
spread. As a result, the equations were much less complex 
than those of the antiaircraft fire-contrel problem.'” As early 
as 1942, the Bureau of Ordnance conceived of a need for a 
system for computing and displaying the data of concern in 
antisubmarine warfare. The resulting product was the At- 
tack Director Mark 2, which contained a mechanical com- 
puter. Fifteen were delivered.” 

In the early 1950s, Arma built a mechanical analog com- 
puter (“coordinate conversion computer’) containing a 
gimbal system. Designed at MIT, it was one unit of a fire 
control system for use by the Navy in the Korean War. The 
torpedo itself contained several small mechanical analog 
computers. They were extremely delicate and complex, with 
the result that their effectiveness was reduced. These com- 
puters included the following mechanical devices: 


1. The course control system that activated a rudder. 

2. A computer to determine the course angle for colli- 
sion with the target. 

3. A depth-control system, relying on a diaphragm to 
measure depth (water pressure) and a pendulum to 
measure rate of change of depth. The pendulum was 
later replaced by a gyroscope to avoid the error due 
to longitudinal acceleration. The change was Newell’s 
idea.” (See item 23 in the Appendix.) 


Bombsight mechanical analog computers. Another 
highly specialized type of mechanical analog computer was 
developed for use in bombers. Bombsights were remarkable 
for their extremely small size and high precision. The 
Norden bombsights contained over 2.000 parts. Develop- 
ment began at the end of World War I and progress was 
rapid: The Bombsight Mark 3 was contracted for in 1922. 
the Mark 11 was accepted in 1931, and the Mark 15 was 
being tested in 1931.” Bombsights were also made by 
Sperry. 


One of the refinements to bombsights was the invention 
by Newell and Lawrence Brown that enabled a bomber to 
navigate by some identified visible point, when the target 
itself was obscured, and yet still bomb the target. 


Sights and directors for small guns. Major naval vessels 
had no small guns until after Pearl Harbor, when the large 
numbers of incoming aircraft had overwhelmed the fire-con- 
trol systems for large guns. As a result, a rapid evolution had 
to take place to provide something better than the open sight 
mounted on the gun barrel, which had been standard arma- 
ment against aircraft in World War I. 

A significant advance was made by the lead-computing 
sight developed in the 1930s by Charles S. Draper of MIT. 
Draper’s sight evolved from his earlier products of an air- 
craft instrument to display rates of turn and his tank gun 
sight. These devices used precessing-rate gyros mounted on 
the line of sight to the target, Each rate was multiplied by a 
suitable factor to produce a proportional lead angle, which 
was applied to the gun direction.” The overall precision was 
on the order of 2 percent. The Navy learned of the Draper 
sight belatedly: One was tested in July 1941, and the sights 
entered service in the fall of 1942 — built by Sperry and by 
Crosley. Eventually 85,000 of the Gun Sights Mark 14 were 
bought for naval vessels. 

The US Navy’s response to the need also included the 
development of some heavy machine-gun directors. Con- 
tracts for development were awarded to Ford Instrument 
for the Gun Director Mark 45, to General Electric for the 
Mark 46, and to Atma for the Mark 47 (the Mark 46 and 
47 never reached production). The Mark 45 was com- 
pleted as early as 1942: however, it was too complicated 
and heavy as a computer, and it was too crowded as a 
workplace, so production of it was stopped. It was re- 
placed by the Gun Director Mark 49, which also was 
being developed by Ford Instrument. The Mark 49 con- 
tained a gyro torqued hydraulically to pfirecess it, and it 
had hydraulic pick-offs. The Mark 49 was replaced by the 
Mark 51.‘* Located on a pedestal remote from the guns, 
it used a Draper sight to transmit train and elevation 
angle orders to heavy machine guns. It was manufactured 
by Sperry Gyroscope Co., beginning in January 1942.'- Its 
performance was poorest for surface targets, which had 
small angular rates as seen by the sight. 

Gun Director Mark 56 was designed at MIT. It utilized 
an unusual mechanical analog computer technology: [vur- 
bar linkages. By properly proportioning the bar lengths, one 
could design linkages to generate a surprising variety of 
functions. Some of the linkage computers were made by 
Ford Instrument Co. Vannevar Bush, in his role as one of 
the organizers of the National Defense Research Commit- 
tee. was able to do much for small gunfire-control develop- 
ments. and he had a hand in its production. 

In addition to the naval gun sights and directors men- 
tioned here for heavy machine guns. comparable or smaller 
systems were developed for use in aircraft, such as the 
largest bombers (B-29). There was. for example, a Mark 18 
Turret Gun Sight. which had a computing mechanism. It was 
followed by the Mark 23 in 1945,"" 
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Other analog mechanical computers. Flight simulators 
for pilot training have been in existence since the “Pilot 
Maker,” alias “Blue Box,” of Ed Link, developed in 1¥29,/ 
Link’s flight simulator contained a pneumatic analog com- 
puter that used principles he had learned in his father’s 
organ factory. A mechanical analog flight simulator was 
designed and built by Ford Instrument Co. in 1944, Later 
flight simulators were based on electric and electronic ana- 
log and then digital technology. Mechanical analog commpul - 
ers were used also in early guidance systems for missiles: 
‘rma did the inertial guidance for the Atlus missile. William 
Newell also invented a guidance system that worked without 
gimbals, integrating components of acceleration and veloc- 
ity to determine present position (see item 27 in the Appen- 
dix). 

The range of the German V2 rocket was determined by 
a mechanical analog computing device. It integrated accel- 
eration twice to get distance traveled; it also contained some 
linkages and differential gears to relate the twice-integrated 
acceleration to horizontal distance.” As the technology was 
refined, new applications were undertaken. Most of these 
and other mechanical analog computers were eventually 
superseded by electrical analog computers. 


The descendants of mechanical analog 
computers 


Mechanical analog computing evolved in two directions, 
branching into developments in AC analog computers and 
DC analog computers. 


AC analog developments. In about 1940 the market for 
tools for performing mathematical operations was quite 
small. Mechanical desk calculators served acceptably for all 
but the largest problems, such as fire control and exterior 
ballistics. When Thornton C. Fry wrote a survey article’” 
about the extent of the use of mathematics in industry, he 
had little to report outside the telephone and aircraft indus- 
tries. One could not then imagine the explosion of electrical 
and electronic technologies that would result in a flood of 
computers available at modest cost. 

The principles of AC (alternating current) electrical an- 
alog circuits had been known since Steinmetz in the 1880s. 
Currents entering a node were known to add. The charge 
on a capacitor was known to be the time integral of the 
current that had flowed through it. It was known that a 
servo-driven potentiometer could be “tapped” to yield a 
function or a product of two variables. {his technology was 
not developed, however, until Bell Telephone Laboratories 
found application for it in a developmental gun director 
early in World War II. 

The BTL project was to develop an AC analog gun 
director, the T-15. It was funded in November 1941, and the 
model was completed a year later and tested in December 
1942.” The T-15 was never put into production: it was, 
however. used for research with targets flying trajectories 
that were not straight lines. 

The T-15 led to a proposal to the Navy, in February 1942, 
to construct an AC analog version of the Ford Instrument 


Company’s Computer Mark !. A contract was awarded in 
September 1942 for development of this “Mark 8 Com- 
puter.” Although it proved to be faster than the Computer 
Mark | in completing the initial transient of acquiring and 
locking onto a target, the Mark 8 Computer was never 
produced. It had one other feature worth noting: a special 


A refinement to bombsights invented 
by Newell and Lawrence Brown 
enabled a bomber to navigate by a 
visible point, when the target itself was 
obscured, and yet still bomb the target. 


electrical integrator that was developed for it. 

Ford Instrument Co., under the direction of Harry Mc- 
Kenny and William Newell, developed an AC analog com- 
puter, the Mark 47, which replaced the mechanical analog 
Computer Mark 1. 

From 1945 to 1950 the Dynamic Analysis and Control 
Laboratory at MIT developed an AC analog computer, 
using ‘Will-cyele AC components in a guided missile flight 
simulator. This was an activity within Project Meteor. The 
flight table was mounted on four concentric gimbals so 
driven as to avoid gimbal lock under all conditions. 


DC analog developments. DC (direct current) amplifiers 
had been used since the post-World War I days of radio. 
They were highly developed in the 1930s by BTL, which 
used them for signal amplification in telephony. They were 
used also by George Philbrick at Foxboro, as early as 1937 
or 1938, for simulation of linear processes and control sys- 
tems.” Developments of amplifiers for use in simulation 
were made also by John Ragazzini et al. at Columbia Uni- 
versity in about 1940. Bell Telephone Laboratories devoted 
itself to the development of DC vacuum tube amplifiers for 
use in analog computers for fire control after about June 
1940. A patent. applied for in May 1941, was issued in June 
1946 as US patent 2404387 to C.A. Lovell, D.B. Parkinson, 
and B.T. Weber. Their contemplated systems used summing 
networks, potentiometer cards for functions, and an integra- 
tor using an amplifier and a capacitor. " 

In November 1940 Western Electric received a contract 
to develop a model of a DC analog gun director, the T-10. 
It was to use the BTL-developed DC analog technology. 
The model was tested successfully in December 1941.” 

The success of the T-10 led to a contract to build the 
production version, the M-9 Gun Director. It was delivered 
in December 1942, and it was placed in service in early 1943. 
It was used during the Vl “buzz bomb” attack on London 
to control the fire of 90-mm guns located along the English 
coast. During the month of August it shot down 90 percent 
of the buzz bombs that arrived, and in its best week it shot 
down 89 of the 91 that arrived. The M-9 (see Figures 10 and 
11) was aided by radar and proximity fuzes.-' A British 
version of the M-9 (the T-24. directing 4.5-inch AA guns) 
had its prototype completed by May 1%42.7° 
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Another offspring of the M-9 was the “M-8 Gun Data 
Computer,” which BTL developed for the US Coast Artil- 
lery Board for control of 6- to 8-inch guns firing at surface 
targets. The M-8 corrected for the parallax angles of differ- 
ent guns firing at the same target and also corrected for the 
earth’s curvature. It was never used in combat, because 
there were no targets for it.*” 

Lest it be gathered that all electronic analog developments 
in World War II were made by Bell Telephone Laboratories, 
note that the Arma Corporation developed, starting in the 
summer of 1940, an electronic analog antiaircraft computer for 
the Mark 47 Gun Director. It was to control 40-mm machine 
guns, but in 1941 it was changed to the 3-inch gun and was to 
be incorporated in the Mark 50 director. Deliveries of 43 units 
began in May 1943, but the computer had some serious diffi- 
culties: It weighed too much, and it was too complex for feasible 
mass production and for ease of maintenance. The system was 
further complicated by the fact that the electronic ballistic 
converter and [uz¢ order computer had to control 40-mm. 
L.Janch, 3-inch/S0), and 5-inch/38 guns.” 

The promise of BTL’s early electronic analog gun direc- 
tors encouraged other computer developments in World 
War II. One, the AN/APA-44, was a bombing and naviga- 
tion computer for aircraft. BTL also developed electronic 
analog flight simulators for pilot training for the PBM-3 
Martin Mariner patrol bomber, the Grumman Hellcat 
fighter, and the Consolidated Privateer patrol bomber.’t 

After World War II, Project Cyclone was established to 
develop a DC analog computer for general-purpose appli- 
cations. The work was done by the Reeves Instrument 
Corporation. Very soon there were competitive commercial 
products available from Electronic Associates, Inc., Applied 
Dynamics, Inc., and eventually about 30 more companies. 


Figure 10. M-9 gun director in action. The tracking unit with its two operators is in the foreground, while the computing units 
are in the truck. 


These “analog computers” became the tools of choice for a 
generation of control system designers, missile and aircraft 
designers. and analytical engineers in all branches of engi- 
neering for purposes of dynamic and often real-time simu- 
lation. These developments left the AC analog computers 
far behind in accuracy and other performance features. One 
of the key steps was chopper-stabilization of the DC ampli- 
fiers, which otherwise had a maddening drift. 

One of the people who worked almost anonymously 
behind the scenes in this period was Perry Crawford at the 
Naval Special Devices Division. He had a hand in the ad- 
vanced thinking underlying Project Cyclone. He also had 
some influence upon the course of Project Whirlwind, an 
early digital computer developed at MIT which is best re- 
membered for its magnetic core memory by Jay Forrester. 
Crawford had written two provocative theses at MIT*"** 
which contributed to the frontier thinking of the time toward 
electrical digital computers.’ 


The defeat of mechanical analog 
computers 


The beginning of the end for mechanical analog computers 
as the computers of choice in fire-control systems began just 
before World War II. They were then at their zenith. No 
competition was in sight. yet the computers that would replace 
them in less than a decade were already in development. 

Mechanical analog computers for fire control were much in 
demand as a result of the rapid growth of the US Navy in those 
days. Accordingly, the Bureau of Ordnance was anxious that 
Ford Instrument Co. might not be able to manufacture them 
fast enough to meet the need. There were critical skills, ma- 
chine tools. and materials that were in short supply, any one of 
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which could have produced a fatal bottleneck. It was only 
prudent that the Bureau of Ordnance then sought alterna- 
tives on a second-source-of-supply basis.*' 

The government’s expenditures for electrical and elec- 
tronic analog computers for fire control and aircraft simula- 
tion have been mentioned. This flow of money sufficed to 
fund the necessary research and development. The sudden- 
ness of the emergence of electrical and electronic analog 
computers is easily attributable to the equally sudden 
awareness of a need. 

It seems plausible that the lack of such funding and 
procurement desire in the previous years was _ responsible 
for the relative stagnation of electrical and electronic ana- 
logs. This stagnation existed in spite of the almost-ready 
availability of virtually all of the required electrical and 
electronic analog components. One of the reasons for the 
stagnation is that the mechanical analog people believed 
firmly that no electronic computer could survive the on- 
slaught of the shipboard shock and vibrations in battle upon 
vulnerable vacuum tubes and solder joints. Probably this 
thinking also kept electrical components, except the sturdy 
servos and synchros, out of mechanical analog computers. 

No one had realized the cost in battle due to the sluggish- 
ness of even the fastest mechanical computers in converging 
upon a target. This discovery was not made until speedier 
electrical analog competitors were developed and demon- 
strated. However, once discovered, this feature of the elec- 
trical analogs proved to be essential in dealing with a multi- 
plicity of very fast aircraft and missiles as targets. 

Another reason for the lack of effort to develop electrical 
analog computers until just before World War II was that 
the required parts (resistors, potentiometers, and capaci- 
tors) lacked sufficient precision for fire control. The neces- 
sary precision was, however, developed when the need 
materialized. 

During World War 11 the electrical analogs were on the 
scene and were being rapidly developed with funds diverted 
from mechanical analogs. Moreover, with production came 
cost reductions for electrical analog which could not be 
matched by the precision mechanical computers. Similarly 
the size and weight of electrical analog computers came 
down rapidly to be more than competitive. The scales were 
tipping in favor of the electrical analogs. By the time they 
tipped all the way, it had been a sudden process over only a 
few years. The shift of contracts to electrical analog com- 
puter manufacturers and the general reduction in level of 
postwar spending crippled the manufacturers of mechanical 
analog computers. 

Mechanical analog technology died back but has not, 
even yet, died out. It is still in use where precise mechanical 
results are required, such as in very large telescopes, printing 
presses, and movable antennas. Mechanical analog technol- 
ogy survives also in many more subtle ways. For example, 
the “schematic diagrams” of mechanical analog computers 
evolved into “analog diagrams” for DC electronic analog 
computer problems or systems (in general- or special-pur- 
pose computers, respectively). Similar diagrams are often 
used in control engineering, digital computer simulation 
technology, and Forrester’s “system dynamics.” The pres- 
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Figure 11. M-9 gun director (covers off). 


ent trend toward massive parallelism in digital computers 
also will continue the need for the analog type of diagram 
well into the future. 


The short reign of electrical analog 
computers 


While the AC and DC analog computers were replacing 
mechanical analog computers. their own eventual succes- 
sors -— the digital computers -were appearing and growing 
in capability. Since that story is well documented in the 
Annals of the History of Computing, it is not repeated here. 
Suffice it to say that electrical and electronic analogs had a 
much shorter reign than mechanical analogs. From Ford’s 
Range Keeper Mark 1 to the virtual stoppage of production 
of mechanical analog computers in the 1YASUs there was a 
reign of about 40 years. The electrical and electronic ana- 
logs, however, reigned supreme only about 10 years before 
they were surpassed and replaced by digital technology. 


large measure of the historical importance of mechan- 

ical analog computers stems from their service in naval 
fire-control systems from World War I to somewhat beyond 
World War II. Much of the credit for US naval fire-control 
systems stems from the design and performance of the Ford 
Instrument Company’s mechanical analog computer prod- 
ucts, including developments from Range Keeper Mark 1 to 
Computer Mark !. These computers were superbly accurate 
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despite their need to be rugged under the abuse of shocks 
and vibration in battle. 

The outstanding inventors and developers of the Ford 
Instrument computers were Hannibal C. Ford and William 
Newell. Their technical leadership, which spanned four de- 
cades, provided a unique corporate capability. 

Ford and Newell deserve to be recognized as mechanical 
geniuses at least on a par with Vannevar Bush. Bush has 
become the better known by far, because of his differential 
analyzers, because of his writings, and because of his visibil- 
ity as an administrator on the national level. In contrast, 
Ford and Newell worked exclusively on classified projects 
unknown to the public, modestly wrote nothing, and were 
administrators only within the company. They let their in- 
ventions and developments speak for them. 

It is unfortunate that the story of Ford and Newell has 
not been known and appreciated among engineers and the 
general public. The US Navy has had the facts all along, but 
it could not speak for many years because of the need for 
secrecy. The material could not be declassified until it no 
longer had current military importance. As a result, only 
those who were involved in the work have been privy to 
much of the story. 

Likewise, in the author’s opinion. mechanical analog 
computers for naval fire control deserve a featured place in 
the history of computing, as differential analyzers have 
enjoyed. 

The outlook for future mechanical analog technology is 
confined to some highly specialized opportunities where its 
advantages outweigh its disadvantages. These opportunities 
are most likely to arise for one or two components rather 
than complete computers. The glory lies in the past. 

Thus, the story of mechanical analog computers deserves 
a place in the history of computers. It is truly important in 
its own right and, in addition, the technology served as an 
early stepping stone toward today’s digital computers. UH 
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Appendix 
Among Newell’s mechanical. hydraulic. and electrical 
inventions were the following: 


1. A hydraulic computer. plus some hydraulic compo- 
nents, such as a device to generate a hydraulic pres- 
sure proportional to a displacement, and a hydraulic 


ea 


10. 


ue 


torquer and pick-off for a gyro in a gun director. 
Patents 2317293. Apr. 20, 1943: 2405052, July 30, 
1946: 2483980, Oct. 4. 1949: 25 13888, July 4, 1950: 
2533306. Dec. 12. 1950: 2550712, May 1, 1951; 
2504571. Oct. 2. 1951: 2766587, Oct. 16, 1956. 
Various rotary damping and/or inertia devices to be 
attached to a servo shaft to smooth the mechanical 
output with a low-pass filter. One of these, called a 
‘*k-motor,” acted only when the signal got rough, 
Patent 2400775. 

Poitras and Tear of Ford Instrument developed an 
arrangement making a follow-up motor’s speed pro- 
portional to error. thereby obtaining an exponential 
characteristic. making it a ‘velocity-lag servo.” This 
used a drag cup and gave an error proportional to 
velocity. To eliminate this error there was intro- 
duced a differential gear between the motor and 
drag cup with an inertia on the other differential 
input. which gave a smaller error proportional to 
acceleration. but no error proportional to velocity. 
Newell. in one application. used an air dashpat to 
obtain the velocity-lag servo ettect. 

An irreversible drive involving wedges to lock two 
disks if direction starts to reverse. as in back torque 
from gun recoil. This device prevents stick-slip oscil- 
lation when driving an inertia. whereas an “irrevers- 
ible’ worm drive does not stop stick-slip. Patents 
2266237, Dec. 16.194 |: 2402073. June 11, 1 94h. 


. A torpedo director (Mark 2). Newell simplified the 


mathematical basis. which enabled the size of the 
computer to be cut in half. Six of these systems saw 
service in World War II. Patent 2403542. July 9, 
tosh, 

A director for defense against horizontal bombing 
runs. By restricting its applicability. Newell was able 
to do it with a much simpler computer than was in 
use. Patents 2403543. July 9, 1946; 2403544. July 9, 
1 O46, 


. A combination of a coarse and fine synchro, using a 


cam-driven link to switch between coarse and fine. 
The patent application was filed in 1934, but the 
work had been done before that. Patent 2405045, 
July 30. 1946. 


. A single-ball integrator with a rack to eliminate 


tangent function effect. Patent 2412468. Dec. 10. 
1046. 

A scheme to prevent large inertial load on a hydrau- 
lic servo from overshooting. which involved intro- 
ducing a spurious signal to start slowing it down 
belere it reached the intended position. This was 
particularly important in synchronizing S-inch guns 
and in bringing heavier guns to a loading position. 
Patents 2427154. Sept. 9.1947: 2840992, July 1, 1954. 
A triangle mechanism to generate the square root of 
the sum of the squares of two input position vari- 
ables. Patent 2435818. Mar. 30. 1448. 

A scheme for using trains of balls, with wheels and 
steering rollers. to integrate complicated trigono- 
metric functions and solve the fire-control tracking 
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problem (related to the earlier fundamental work of 
Maxwell, Ventosa. Hele-Shaw. and Smith).‘’ Patent 
2528284. Oct. 3 !. 1950. 

An electrical servo (with Henry F. MieKenny). Pa- 
tents 2448387, Aug. 31,1948; 2546277, Mar. 21.1951. 
A printing press registration scheme using a photo- 
cell (with McKenny). Patent 2576529. Nov. 27, 195 }. 
An electronic analog resolver — given a magnitude 
R and an angle A. it computes the components R sin 
A and R cos A continuously while compensating for 
the magnetic distortion of the R input. Patent 
2646218, July 21. 1953. 


. A “rate control” system whereby splash- or hurst- 


point error data generated by a spotter topside 
would cause automatic continuous computation of 
corrections to target course and speed (patented in 
the name of Ford et al.). Ford had developed a rate 
control system that reversed the computation and 
found target course and speed, but in doing so inter- 
rupted the generation of the prediction problem. 
Newell used component integrators to generate cor- 
rections to the target course and speed from the 
spotting corrections without interrupting the conti- 
nuity of the fire-control solution. Friedman gives the 
equations.” Patent 2702667, Feb. 22. 1¥35., 


. A rhumb-line mechanical (later electrical) computer 


for Air Force navigation along a great circle from 
one given longitude/latitude to another. Thousands 
of them were built. Patent 2783942. Mar. 5.1957, 


. An offset bombing director to allow homing on an- 


other point when the target cannot be seen (with 
Lawrence Brown). Patent 2615170, Dec. 3. [YS7, 
A mechanical integrator with reduced friction §sur- 
face area. Patent 2693709, Nov. 9,144. 


. An “error reducer” unit for reducing greatly the 


pointing errors of main battery guns (developed in 
about 19M}, Patents 276358. Sept. 25, 1956: 
2800769. July 34). 1957. 

An electrical device containing tapped potentiome- 
ters for generating a class of functions of three vari- 
ables. Patent 2817478. Dec. 24. 1957. 

A computing device for predicting the deck angles 
of an aircraft carrier at the instant an airplane would 
be landing. Patents 2817479, Dec. 24.1957: 2888195. 
May 26. L954: 2888203, May 26.1959: 2978177. Apr. 
4. 1961: 2996706. Aug. 13.1461: 3174030. Mar. 16. 
1965. 

A parachute-release device. with Howard Brevoort. 
Patent 2834083, May 13. 1958. 

A device for squaring using a cone and cylinders 
(with S. Rappaport). Patent 2854854. Oct. 7. 19%. 
A computing module for correcting for the tilt of gun 
trunnions. Patents 2902212. Sept. I. ]Ya¥: 2920817. 
Jan. 12, 1960; 1967663. Jan. IO. 161. 

A depth control for torpedoes using a gyro to sense 
attitude. It avoided the error in the previous Uhlan 
gear design, which had been due to use of a pendu- 
lum for attitude sensing. During initial acceleration 
this gave a spurious attitude signal which caused a 
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deep and many times disastrous dive. Patent 
2920596. Jan. 12, 1960. 


A torpedo motion simulator for engineering pur- 
poses based on the torpedo equations of motion, 
including the water mass and inertia associated with 
the torpedo. Such a simulator was built for develop- 
ment purposes at Ford Instrument Co., possibly the 
first torpedo simulator. 


A “strapped-down” navigation system not using any 
gimbals (developed on a contract in 1958). In a 
personal communication, Newell said he considers 
this to be one of his potentially most important 
inventions. Patents 3049294, Aug. 14, 1962; 2087333, 
Apr. 30. | Y63. 


A scheme for developing an electric current from a 
hot rod and a magnetic field. This is the other inven- 
tion that Newell considers to be potentially most 
important. Patents 3075096, Jan. 22. 1963; 3084267, 
Apr. 2. ]63. 


Newell and Willard B. Constantinides developed a 
deck-tilt corrector which corrected gun angles ap- 
proximately for the level and cross level angles of the 
deck. 


The mechanical analog technology was extended in 
1945 for the development of a bomber navigation 
trainer, mainly by Willard B. Constantinidesof Ford 
Instrument Co. It solved the equations of motion of 
an airplane with far greater generality, realism, and 
precision than the contemporaneous pneumatic 
computers in the famous Link trainers, which dealt 
only with small linear perturbations about steady 
flight. To record the trajectory of the airplane as 
projected on the horizontal plane, the Ford simula- 
tor drove electrically and remotely a mechanical 
“crab” that drew a curve on a large sheet of paper 
on the floor. 


A scheme for using resistors (standard but trimmed 
to precise values of a 1 {HM]- 1 range) to obtain ampli- 
fier input gains, which was patented. 


In the foregoing list the items that were mainly electrical, 
as distinguished from mechanical or hydraulic. were nos. 12, 
13. 14. 16.20.29. and 31. 


Many more people than have been mentioned played 
notable roles under Ford and Newell. Certainly the follow- 
ing at least also deserve to be named here: Ray Jahn, George 
Crowther. George Hamilton. Charles Buckley, Walter Cxin- 
able (the nephew of H.C. Ford), John Kallenberg, Howard 
Brevoort. and Elmer Garrett. During World War II they 
were assisted by Charles Henrich, Charles Pond, Kenneth 
Crawford (brother of Perry). Rasmus Figenschou (of Nor- 
way). John Hauser. George Licske. Mrs. George Elder (née 
Athena Rosarkv). «4]01s Mertz. and the author and other, 
then junior, design engineers. 
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Abstract 


English version 


Computer simulations and computer aided design in the past decades have evolved into a valuable 
instrument, penetrating just about every branch of engineering in industry and academia. More 
specifically, computational fluid dynamics (CFD) simulations allow to inspect flow phenomena in a 
variety of applications. As simulation methods evolve, mature, and are adopted by a rising number 
of users, the demand for methods which not only predict the result of a specific configuration, 
but can give indications on how to improve the design, increases. 

This thesis is concerned with the efficient calculation of sensitivity information of CFD al- 
gorithms, and their application to numerical optimization. The sensitivities are obtained by 
applying Algorithmic Differentiation (AD). 

A specific emphasis of this thesis is placed on the efficient application of adjoint methods, includ- 
ing parallelism, for commonly used CFD finite volume methods (FVM) and their implementation 
in the open source framework OpenFOAM. 


Deutsche Version 


In den vergangenen Jahrzehnten haben sich Computersimulation und Computer gestttzte Design 
Methoden zu einem wertvollen Instrument entwickelt, welches nahezu jeden Bereich der technisch- 
und naturwissenschaftlichen Forschung und Wirtschaft beeinflusst. CFD Simulationen erlauben 
es, Stromungsphanomene in einer Vielzahl von Anwendungen zu untersuchen. Je mehr numerische 
Simulationsmethoden sich fortentwickeln und Anwendung finden, umso grofer wird der Bedarf 
an Methoden, die nicht nur das Ergebnis von spezifischen Konfigurationen vorhersagen konnen, 
sondern auch Hinweise zur Optimierung des Designs geben konnen. 

Diese Arbeit beschaftigt sich mit der effizienten Berechnung von Ableitungsinformationen auf 
CFD Algorithmen und deren Anwendung zur numerischen Optimierung. Die Ableitungen werden 
mittels Algorithmischen Differenzierens (AD) generiert. 

Ein besonderer Augenmerk dieser Arbeit liegt auf der effizienten Anwendung adjungierter 
Methoden, unter Berticksichtigung von Parallelismus, im Kontext von popularen CFD Finite- 
Volumen Algorithmen, und deren Implementierung im Open-Source Stromungsloser OpenFOAM. 
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1 Introduction 


1.1 Outline 


The focus of this thesis lies on the efficient generation of sensitivity information for computational 
fluid dynamics (CFD) applications, using the discrete adjoint approach. Methods and algorithms 
are presented in a general CFD setting, and are implemented in a discrete adjoint OpenFOAM 
framework. ‘The application of these sensitivities to specific optimization problems is demonstrated 
in several cases. ‘lhe presented optimization cases are to demonstrate the flexibility of the 
sensitivities obtained by the discrete adjoint framework, and do not necessarily represent the 
state of the art in numerical optimization. A thorough discussion of state of the art constrained 
optimization techniques would be outside of the scope of this thesis and remain as an avenue for 
future research. 

This thesis is structured as follows. Chapter 1 introduces notations and conventions used in this 
thesis. Related work and the most important contributions of this thesis are briefly summarized. 
Furthermore, a high level overview over numerical simulation and optimization is given. ‘The 
most important theoretical foundations, as well as the motivation for this thesis, are laid out 
in Chapter 2. Basics of the CFD method, Algorithmic Differentiation (AD) and optimization 
are covered. 

Building on the foundations, the application of AD to CFD algorithms is discussed in detail in 
Chapter 3. Starting from a black-box approach as a proof of concept, a variety of improvements 
are presented, which increase the efficiency of AD in the context of iterative CFD solvers. Parallel 
adjoint communication is incorporated into the CFD solvers to retain the primal parallelism, 
requiring adaptations of the linear solvers. 

Using these results, Chapter 4 discusses different strategies implemented to more efficiently 
generate adjoints for steady state simulations. Furthermore, the generated adjoint sensitivities 
are verified against tangents and continuous adjoints. Alternative optimization methods, such as 
parametric optimization are discussed and implemented. 

Extended case studies for both topology optimization and shape sensitivities are presented 
in Chapter 5. A scaling study on current HPC hardware, the RWTH compute cluster, was 
performed. The results showcase the scalability of both the primal and adjoint implementations. 

This thesis closes with an overview over related work, carried out or supervised by the author, 
as well as a summary and outlook. A brief developer documentation for the discrete adjoint 
OpenFOAM implementation can be found in the appendices. 

















1.2 Notation 


Here we introduce the most important notations and conventions, used throughout this thesis. 
Non-standard notations will be reintroduced once they first become relevant. 


Vectors: Bold letters, individual entries are numbered starting from zero, e.g. v = |vo, V1, v2]- 
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For spatial information, alternatively alphabetic indices v = [vz, vy, vz] might be used. 


Matrices: Upper case letters, individual entries are denoted with lower case letter and numbered 


starting with zero. E.g.: 
A= (‘oe a | 
aio 411 


Scalar product: The scalar (inner) product between two vectors x,y € R” is denoted by 
=) 
XY =x y= ry Lig. 


Outer product: The outer product between two vectors x,y € R” is denoted by 
x®@y= xy! cE RrxXn, 


Functions: Lower case letters or in calligraphy, e.g. f(x) or J(y). 


Placeholder: e stands for a placeholder variable, to which super or subscripts and modifiers can 
be attached, e.g. © = e). 


Unit systems: If not otherwise specified SI units are used. 





Unit system for bytes: For the designation of bytes, we use the binary multiple system, dif- 
fering from the SI definition, also known as TiB, GiB, MiB, KiB: 1TB = 1024'GB = 
10247 MB = 1024° KB = 1024? bytes. 


Spatial and temporal relations: In the context of FVM discretizations, spatial relations are 
indicated by sub indices, e.g. x;. Temporal (or pseudo temporal iteration) relations are 


indicated by upper indices, e.g. se = 4; xy.) 


Powers: To avoid confusion with the temporal indices, if not immediately obvious by context, 
we indicate powers by enclosing the target in brackets first, e.g. (y)?. 


Gradient operators: In the context of FVM discretization, the gradient operator is used as a 
spatial operator, ignoring temporal dimensions, if any: 








Op 
Ox 
Scalar Gradient: Vp = ae 
Op 
Oz 
O Ou, OUy Ouz 
Ox Ox Ox Ox 
: . ml _ | dO Ou, Oy Ouz 
Vector Gradient: Vu = V ®u= 35 ips iy |e Hie oe Oe 
Oo Ou, OUy Ouz 
Oz Oz Oz Oz 
OUz OU, OUg 
Ug Ox Oy Oz 
_ Oo 0 O)| _ | duy Ouy OJuy 
Outer product: u® V = | uy E- Dy $.| =|32 a oe 
iis Ouz OUz OUz 








. . : _ OUs OUy Ouz 
Divergence: V-u= 52 + By Oe 
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Ge 4 a Cp 





Laplace Operator: Au = V-u=-V (Va@u)= oe 4 om a cs ow 
Pu A. Pu + om 











1.3 Related Work & Contribution 


In the field of CFD and numerical optimization, a wide range of prior knowledge exists. While 
the foundations of fluid mechanics research range back several centuries |And16], the application 
of computational methods to discretize and obtain solutions to the governing equations became 
popular in the 20th century, with the advent of general purpose computing hardware |Sha04J. 
Compared to the history of CFD, the widespread application of adjoint optimization techniques 
to CFD problems is more recent, however first applications still date back to the 1970s (see 
e.g. |Jam03; GP00| for a short overview of the history of adjoint methods in the context of CFD). 
Most researchers apply either a continuous adjoint formulation, or a discrete adjoint formulation 
on the residuals obtained from solutions of the FVM systems |Gil+-03]. Algorithmic Differentiation 
(AD) is concerned with the generation of derivatives of algorithms, given as computer programs 
(for a historical discussion of AD see [GW08]). While the rules of AD can be applied by hand, for 
complex codes a tool driven approach is highly desirable. 

The application of AD to complex CFD tools is often limited to specific numerical kernels, 
tailored to a predefined goal. A full differentiation of code frameworks is seldom achieved, be 
it for lack of applicable tools or too extreme memory requirements imposed by the data flow 





reversal of the adjoint mode. A recent applications of an AD tool driven approach to a complex 
CFD code includes SU2 [Ecol18; ASG16]. Workflows based on source code transformation were 
used in |Zho+18] and [|MHM18}. 

This thesis pursues the approach to initially cover as much as possible of the used CFD 
framework by AD. The advantage of such an approach is twofold: First, an initial differentiated 
version of the problem can be obtained very rapidly, without much analytical insight into the 
underlying problem. Starting from there, possible optimizations can then be identified, applied, 
and evaluated. Second, a full AD implementation gives the flexibility to pursue a wide range of 
optimization tasks, without needing to adapt the underlying CFD framework to each of these 
applications. Starting from a naive black-box implementation of AD, treating the CFD algorithms 
as a general computer program, different tactics are employed to improve performance and lower 





the memory footprint. ‘This is achieved by exploiting prior knowledge, blurring the lines between 
fully discrete and continuous approaches. ‘To the author’s knowledge, this work is the broadest 
application of AD to the general purpose CFD tool OpenFOAM. Other adjoint solvers for 
OpenFOAM exist, in the form of continuous adjoint solvers and implementations of the discrete 
residual adjoint methods, possibly involving finite differences (FD) [OVW07; He+18]. 
OpenFOAM is particularly suited for this approach, due to its split architecture into a 
general CFD framework, (which was fully covered by AD), and individual solvers and utilities 
based on the framework (which were covered by AD as needed). The changes required for the 
introduction of AD into a complex CFD code base are discussed in detail in Section 3.2. Adjoint 
communication patterns ({Sch14]) were implemented, such that the parallel convergence of the 
primal is retained for adjoint calculations (Section 3.5), which is demonstrated on the RWTH 
compute cluster (Section 5.1). The symbolic differentiation of linear solvers (|Gil08], Section 3.4) 
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required the careful inspection of the FVM discretization matrices involved, in order to correctly 
implement the symbolic adjoints of the parallel matrix vector products (Section 3.5.5). 

The application of AD to iterative solvers poses challenges in terms of memory consumption. 
To overcome a memory bottleneck encountered, an existing adjoint vector compression technique 
was implemented ({NL18], Section 3.8.2). This induced a bigger than expected run time penalty. 
To overcome this, a novel optimization to the adjoint vector compression technique was developed, 
implemented, and studied (Section 3.8.3). This allowed to regain most of the performance of the 
unaltered adjoint vector implementation. 

The developed discrete adjoint solvers are verified against adjoints obtained by the continuous 
adjoint method (Section 4.3). The implications of a frozen turbulence assumption versus a full 
differentiation of the turbulence models are studied. The availability of a fully differentiated 
Spalart-Allmaras turbulence model facilitates the calculation of shape sensitivities of an airfoil 
(Section 5.2). In addition to topological and shape sensitivities, a parametric optimization setting 
is presented (Section 4.6), demonstrating the differentiation of a solver directly coupled to a 
mesh generator. A similar parametric approach was chosen in |Aur+16], however differentiating 
an external CAD environment and mesher. 

A novel connection between the FVM mesh formulation and the bipartite (partial) graph 
coloring formulation was developed, allowing to effectively obtain compressed Jacobians. ‘The 
method and resulting colorings are presented in Section 4.5.4. The obtained colors are used to 
efficiently calculate the Jacobians of the FVM residuals using tangent or adjoint mode. A similar 
approach is presented in [He+18], also implemented in OpenFOAM, however in this work the 
authors used different coloring algorithms and FD to determine the Jacobians. 

While many concepts are presented, such that they can be conveniently implemented in 
OpenFOAM, the methods developed and used in this thesis are applicable to a wide variety of 
CFD algorithms and optimization tasks. 

Previous publications of the author, related to the topics of this thesis, include |TN13; TSN15], 
and |TN18]. Parts of Sections 3.2 and 3.3 (black-box differentiation of OpenFOAM and checkpoint- 
ing) were discussed in |TN13]. Parts of Section 3.5 (parallel black-box adjoints using adjoint MPI) 
were discussed in |TSN15]. Furthermore, parts of Sections 3.4, 3.5 (symbolically differentiated 
linear solvers, embedded into SIMPLE algorithm, involving parallelism), and 5.1 (HPC study of 
3D Pitz-Daily case) were previously presented in |TN18]. 
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Figure 1.1: The V-model of product development, also known as Vee-diagram. Modeled 
after [Osb+05]. See also VDI 2206 |GMO3]. 


1.4 Numerical Simulation 


1.4.1 Introduction & Motivation 


Numerical simulations of physical problems have become an integral part of many modern 
design processes [Ott+03]. Simulation is able to augment classical design verification processes, 
relying on physical experiments, with numerical experiments. The repeatability of numerical 
experiments allows the efficient exploration of the design space for many different configurations 
and boundary conditions. Virtual product development allows to move the test and evaluation 
of preliminary designs towards earlier in the development cycle, avoiding costs like expensive 
tooling for pre-production models. Model candidates can be evaluated in a shorter time period, 
allowing to iterate through the stages of development at a faster pace. With correctly chosen 
models, numerical simulations allow to create observations for conditions which might not be 
readily reproducible or observable in a lab environment, such as: 


e reduced or zero gravity |CS93], 





e extreme temperatures or pressures |Anc+97], 





e extreme length scales, e.g. quantum level [HTK12] or astronomical scales [Aba+-03], 
e short (e.g. microseconds) |Hes+08] or long (e.g. centuries) [Cro00] time scales, 

e hazardous or restricted processes, e.g. nuclear explosions [KGG18], 

e or probabilistic quantum level effects [LW90]. 


Industry fields which heavily rely on numerical simulation to reduce product development times 
include the automotive |Tho98], aerospace |[SV16], and defense |NW11] industries. Statistical 
simulation approaches are prevalent in the financial sector |GGO6]. 

Product development, be it physical (mechanical) or virtual (software), is usually an iterative 
process |TN86]. Requirements are identified, incorporated into a design, and implemented. The 
implementation is tested and integrated into the systems context (e.g. a part into a car or 
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a module into a software framework). During testing and integration, more likely than not, 
issues are identified, that feed back into the requirements, starting off another iteration of the 
development process. This iterative design process is often illustrated with the V-model of product 
development. Many different variants of this model exist, one of them is shown in Figure 1.1. 

The incorporation of numerical simulations into product development is often called computer- 
aided engineering (CAE), in distinction, and as a supplement to, the more classical computer-aided 
design (CAD). Numerical simulation can help to reduce the time spent in each iteration of the 
product development cycle. 

In civil engineering, the finite element method (FEM) |Hug12] is the dominant method of 
predicting the stresses in materials and the resulting deformations. In fluid mechanics, the finite 
volume method (FVM) [Pat80]| is widely popular, but alternative methods, including FEM and 
Galerkin |CKS00| methods, are available. 

Multiphysical simulations aim to bridge the divide between different fields of science, e.g. by 
coupling fluid simulation with structural analysis (fluid structure interaction, FSI) |Zim06], or by 
coupling fluid flow and heat transfer (conjugate heat transfer, CHT). 





1.4.2 Computational Fluid Dynamics 


Computational fluid dynamics (CFD) is the application of computational methods to the field of 
fluid mechanics. Early applications date back to the beginnings of the 20th century, where solutions 
were obtained by hand or mechanical computers |Hun98]|, however wide spread application has 
only been realized with the introduction of general purpose computing devices |Wen08]. Today 
CFD methods are an important application for current high performance computing (HPC) 
resources. With the ever increasing performance of current HPC infrastructure, more complex 
simulations (multi-scale, multi-physics) and solution methods, such as direct numerical simulation 
(DNS) |Ors70] have become feasible. This thesis focuses on applications of the FVM method 
(introduced in the next chapter) in the context of CFD applications. However, other discretization 
methods are available, such as finite element methods (FEM) or probabilistic methods. The most 
common application of CFD is the solution of the Navier-Stokes partial differential equations, 
which allow to predict the flow of viscous fluids. For fluids, that exhibit a negligible amount of 
viscosity, the less complex Euler equations can be solved instead. This is particularly useful for 
compressible flows in transonic flow. 

In the context of FVM CFD simulations, a wide variety of commercial, academic, and open- 
source software codes exist. OpenFOAM (Open Source Field Operation and Manipulation) is an 
open-source software suite, which has a strong user base in both industry and academia, due to 
its versatility, good parallel scalability, and lack of licensing costs. Development of OpenFOAM 
started in the early 90s as a research project at Imperial College London |Wel-++98; Jas96|. It was 
first released to the public as a commercial code, called FOAM. Later the code was released as 
open-source under the GNU GPL-v3 |GPL] in 2004, forming OpenFOAM. Development remains 
very active, however development is currently fragmented in three different forks. The rights to 
the OpenFOAM trademark currently belong to ESI Group, which develops and distributes the 
OpenFOAM-plus fork. 

In this thesis the applicability of the discrete adjoint mode of AD to the CFD design optimiza- 
tion process is explored. As a demonstrator for these techniques a discrete adjoint version of 
OpenFOAM-plus, developed by the author, is used. 





1.4 Numerical Simulation 


For numerical optimization, gradient based methods are popular, as they offer better convergence 
compared to gradient free methods, requiring less evaluations of the underlying simulation models. 
The efficient computation of derivatives is its own research field, with different strategies for 
obtaining the derivatives. This will be discussed in detail in later sections. 


1.4.3 Topology Optimization 


As an example for numerical optimization in the context of CFD simulation, we will use the 
topology optimization technique throughout this thesis. 

The concept of topology optimization was first introduced in FEM analysis of mechanical 
stresses |[Ben89]. Topology optimization aims to find an optimal structure, relative to some cost 
function, within a given design space and boundary conditions. The difference between topology 
optimization and a classical shape optimization is that the topology optimization does not start 
from and adapt an initial design, which for a non-global optimization might introduce a strong 
bias toward a specific design, but is allowed to explore the full design domain. For example, a 
truss structure might be optimized to be as light as possible, while still withstanding a set of 
load conditions. 

Topology optimized designs often appear rather organic in shape and, if optimized without 
design constraints, might be hard to manufacture. However, with the increasing sophistication of 
additive manufacturing methods |Nin+15] and advanced CAD systems, highly optimized complex 
structures are increasingly common, e.g. in the aircraft industry [Emm-+11]. An example for 





such a structure, manufactured with additive manufacturing, is given in Figure 1.2. Topology 





optimization methods generally use many optimization parameters, therefore efficient calculation 
methods are needed. Ideally, the computational complexity should be independent of the number 
of parameters. This motivates the usage of adjoint methods, introduced in later sections. 





Figure 1.2: Structural part of an Airbus A350, optimized for weight. Part sintered from metal 
by additive manufacturing. Source: Airbus |Air18]. 
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In the field of structural FEM, different flavors of topology optimization have emerged, such as 
level set methods [SM13]. 

In contrast to the history of applications in FEM, the application of topology optimization 
to CFD problems is comparatively recent |BP03]. Topology optimization immerses a geometry 
into the available design domain, by selectively blocking cells not located inside the geometry 
for the flow, e.g. by some penalization. A big advantage of CFD topology optimization is that 
the same mesh representation can be used for all optimization stages, eliminating the need for 
expensive remeshing or mesh morphing. If needed, the domain can still be refined near the 
topology boundaries. A disadvantage of the naive implementation is that the immersed geometry 
is not separated from the rest of the design space by real walls, to which wall boundary conditions 
could be easily applied. This complicates the application of wall boundary conditions (e.g. heat 
transfer) and the evaluation of turbulent wall functions. Furthermore it reduces the accuracy 
of the solution. If higher physical accuracy near the walls is required, the immersed boundary 
method [Pes02| can be combined with topology optimization |Mit+08]. 

Topology optimization for ducted flows was introduced to OpenFOAM using continuous 
adjoints |OVW07; Oth08]. The introduction of penalty terms to the Navier-Stokes equations, as 
well as the derivation of the continuous adjoint equations are discussed in detail in Section 2.5. 








2 Foundations 


In this chapter the necessary foundations, needed to comprehend the later chapters, will be 
laid. A brief introduction to (computational) fluid dynamics is given, before the discretization 
with FVM is introduced. Algorithmic differentiation and general optimization methods will be 
discussed. A brief introduction to the AD tool dco/c++ will be given, which will be used later on 
to illustrate abstract concepts with code examples. Finally, the discrete adjoint residual approach 
is introduced. Additionally graph coloring techniques are presented, which will later be applied 
to more efficiently evaluate the discrete adjoint residuals. 








2.1 Navier-Stokes Equations 





The Navier-Stokes [Nav23; Sto51| equations are the most prevalent equations in CFD. They 
describe the conservation of momentum and mass for viscous fluids. In the following, the basics of 
general conservation laws are presented. ‘They are subsequently used to derive the Navier-Stokes 
equations for incompressible Newtonian flow. 








2.1.1 Derivation from Conservation Laws 


The fundamental laws of physics dictate, that certain physical quantities in a system remain 
constant, as the system evolves in time. Important conservation laws include the conservation of 
energy, momentum, and mass. 


Reynolds Transport Theorem 


In the context of fluid flow, the conservation laws specify the conservation of physical quantities over 
moving material volumes M, which travel along the fluid. However, for practical computation it is 
desirable to express these laws for control volumes V. Mass can travel over the system boundaries 
through inlets and outlets. An observed mass volume thus leaves the system after a relatively short 
period of time, making material volumes inconvenient. The Reynolds transport theorem |RBM03| 
provides the required connection between a material volume M/ and corresponding control volumes 
V. 

Let W be a quantifiable attribute of a flow (e.g. mass, momentum, energy) and let ~ = ce be 
the intensive value of VW (amount of ~ per unit mass m), that is 


v= | pam. 


For a material volume M, the total change of quantity WV is determined by the change of V in V 
plus the net flow of W into and out of the control volume through its control surface S. Let p be 
the density of the fluid, u € R° the velocity and n € R® the outward facing normal to the control 
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surface S. ‘Then the Reynolds transport theorem states that 


d O 
af eva =f Slovav+ | pvu-n as, 





dt Jas 
or by transforming the surface integral to a volume integral by applying the divergence theorem 
d O 
a [pea =f (Z(ov) +9 (pow) ) av. (2.1) 
M V 


Derivation of Mass Conservation for Incompressible Flow 





The principle of conservation of mass states that, without the presence of mass sources and sinks, 
the mass of fluid in a region M will be conserved, that is it does not change over time: 


= 
dt Im 


The mass conservation for a fixed control volume V can be derived from Equation (2.1). Choosing 
W = m the corresponding intensive quality is yw = 1 and 


Op Z 


For this equation to hold for any control volume V, the integrand has to vanish at each point 
veEQcCR?: 


—+V-pu=0. (2:2) 


For incompressible flows, the density is assumed to be constant throughout the domain in both 
space and time (dp(x,t)/dt = 0). Thus, Equation (2.2) can be simplified to the differential form 


V-u=—0. 


This equation is called mass conservation or mass continuity equation. From the equation directly 
follows, that the velocity field of an incompressible fluid is divergence free. 
Applying the divergence theorem gives the integral form 


[wnas=o. 
S 


which makes it obvious that all flow (mass) that enters a system must leave it again at some 
other point. 








Derivation of Momentum Conservation for Incompressible Flow of Newtonian Fluids 


Newton’s second law states: 
dmu 


dt = DU, 


where u is the velocity of an object, m is the mass, and f are forces acting on the object. From 
this, the momentum conservation equation can be derived by considering the velocity u as the 
conservation variable ~ in Equation (2.1) 


O 
wi [eu av + | pumas =e, 
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where f are the external and internal (viscous) forces acting on the system, replacing the material 
volume integration. ‘The forces on the right hand side will now be expressed with intensive 
quantities, by making the assumption of Newtonian fluids. For Newtonian fluids, it is assumed 
that the shear stress 7 inside the fluid can be expressed by the shear velocity du/dny as 


du 


7 Fan,’ 


where ny is the direction perpendicular to the flow and p is the (dynamic) viscosity. 

From this assumption the following relations for the stress tensor 7’ and the deformation 
tensor D can be derived (see e.g. [BW97]|). They only include intensive quantities and introduce 
the pressure p into the momentum equations. 


2 
T=—(p+5n¥-u) 1+ 2D 


D=-—(Vu+(Vu)'). 


KO | 


With those tensors, the acting forces can be split into internal viscous forces and external forces 


d 
— pu dV + f puu-nds=[ T-nas+ [ pbav. 
dt Jy s s V 


where b are the external body forces per unit mass. By introducing an infinitesimally small 
control volume V, one obtains the differential form of the momentum conservation equations 


O(pu) 
at 





+V-(puu) = V-T+ pb. 


For isothermal and subsonic flows, the assumptions of constant fluid density and viscosity are 
usually valid. With those two assumptions, the derivatives of p and yz vanish in both space and 
time, allowing to simplify the equations into 


0 1 
a t Ue V)u=vWu- Vp tb, (2.3) 


where v = j1/p is the kinematic viscosity. 


Navier-Stokes Equations for Steady Incompressible Flow 


Combining the conservation of momentum and conversation of mass into one system of partial 
differential equations, the governing Navier-Stokes equations for steady (Ou/Ot = 0) incompressible 
laminar flow read as follows: 


1 
(u® V)u=vV7u— ae +b (2.4) 


V-u=0. (2.5) 
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2.1.2 Navier-Stokes Equations for Topology Optimization 


As mentioned in Section 1.4.3, topology optimization in a CFD context can be implemented by 
blocking the flow in specific cells of the discretization, corresponding to regions which should be 
excluded from the design domain. A commonly used approach is to block the flow by introducing 
an artificial resistance for the flow in the momentum Equation (2.4): 


1 
(u@V)u=vV~u—- ol a (2.6) 
V-u=0. 


The new term au in (2.6) implements a momentum sink, which introduces a flow resistance and 
as such allows to penalize regions of cells considered counterproductive for the flow. [llustratively 
the resistance term qu can be interpreted as a porosity. ‘The pressure drop Ap over a porous 
medium of depth Az is commonly modeled by Darcy’s law |Whi86| as Ap/Aw = —(/kK)u, 
with permeability «. Therefore the momentum source is an implementation of Darcy’s law 
with p/K = a. 

The question in which regions to penalize the flow, that is to find an optimal field a, motivates 
the need for (adjoint) sensitivities of a given objective with respect to the (discretized) parameters 
QQ. 

In order to avoid momentum sources, which would accelerate the flow instead of penalizing it, 
the values of a should be constrained such that a > 0. To avoid too stiff discretization matrices 
and to obtain a steady converged state for the field a, the parameter should also be capped below 
a certain maximum value Qmax. 

The constrained optimization problem to find a feasible and optimal field a can be stated as 





minimize J(u(a), p(a),a) 
subject to r(u(a@),p(a),a) =0 
O<a< Omax 3 


where 7 is a scalar or multivariate objective and r are the residuals of the momentum and mass 
conservation equations. 


(a) no optimization (b) added porous medium (c) reconstruction 
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Figure 2.1: Typical topology optimization workflow, from baseline geometry (left) to final 
optimized geometry (right). 
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2.1 Navier-Stokes Equations 


A typical three step topology optimization workflow is depicted in Figure 2.1. Starting from 
an initial configuration of the domain (left), cells are penalized to improve the flow according 
to some objective (middle). To reduce discretization errors and to make the design parametric 
again, the geometry is transformed to a CAD representation, and remeshed. On this geometry 
the final flow state is calculated (right). 





2.1.3 Turbulence Models 


In contrast to laminar flows, turbulent flows are characterized by strong irregularities of all their 
properties. Most importantly they exhibit strong changes of their velocity and pressure in space 
and time |Dur08]. Those irregularities are chaotic in nature and occur on a variety of turbulent 
length scales. ‘Turbulent flows exhibit a high amount of dissipation due to the internal viscous 
shear stresses. 

Broadly speaking, turbulent behavior occurs once the flow has passed a critical Reynolds 
number. The Reynolds number Re is defined as 

Ree 
mM Vy 

where L is a characteristic length of the flow domain (e.g. diameter of a pipe, length of a car), 
and u is the scalar velocity magnitude of the fluid with respect to the obstacle. ‘The somewhat 
arbitrary choice of characteristic length L makes the Reynolds number only a rough indicator for 
the flow properties of the flow. A pipe flow with Reynolds number Rep > 4000 is considered to 
be fully turbulent, while a flow with Rep < 2000 is considered laminar. Flows in the region in 
between 2000 and 4000 are in a transition area, and are consequently called transition flows. 

The analysis of turbulent flows is a very relevant field, as most fluid flows occurring in nature 
or technical applications exhibit some degree of turbulence. 








The most intuitive, yet very expensive, way of calculating solutions to turbulent flows is to 
utilize direct numerical simulation (DNS). With DNS the Navier-Stokes equations (2.5) are 
solved without any further modeling of the turbulent nature of the problem. For DNS to give 
reasonable results, all length scales of the physical problem, from the smallest turbulent length 
scales to the macroscopic scale, have to be resolved directly, both in space and time, by the 
chosen discretization. This resolution requirement makes DNS challenging and even on current 
HPC hardware it is rarely used for complex geometry. One of its main uses is to derive and 
understand turbulence models and to simulate flows which are very hard to model otherwise, e.g. 
the laminar/turbulent transition region. With the transition of HPC to exascale computing, DNS 
methods will likely become more prevalent, as the lower algorithmic complexity of DNS should 
improve parallel scalability |Che+09]. 

The minimization of the computational requirements, while still reasonably capturing the 
physical influence of turbulence, has been a very active field of research over the last decades (for 
a historical overview see e.g. |Wil+98]). In the following sections a brief introduction of the most 
common one- and two-equation turbulence models for steady flows will be given. 





One Equation Model: Spalart-Allmaras 


A popular one equation turbulence model is the Spalart-Allmaras model |SA92]. It is popular due 
to its comparatively modest computational effort, involving the solution of only one additional 


13 


2 Foundations 





wo 
@ a a WY — Velocity u=t+u' 
S Kf . i] — Mean velocity u 
= V — Turbulent velocity u’ 
a) - 
co ss] 
50 MI 

| 
iS " ’ ry 
8 
SO TRPEI TITNY YTN HTP VP TPYN 


0 
Time (s) 


Figure 2.2: Visualization of the Reynolds decomposition. Function wu is decomposed into a 
smooth (filtered) part « and chaotic part u’. 


PDE. This model is particularly suitable for aerospace and turbomachinery applications, as it 
was developed for aerodynamic flows. 

It models the transport of the kinematic eddy turbulent viscosity v as a convection-diffusion 
equation: 


ae 5) 4. C2 aIIWa - C 7 
07) _y. (pDav) + pl Vall? + Cn.pS0 (1 — fr) - (Ga i-= fu) pm + Sp. 





The equation involves several empirically determined constants C’,a,k, as well as other closure 
equations. For these definitions, we defer to the literature [SA92; AJ12]. From the complexity of 
this governing equation it can be figured, that the derivation of a continuous adjoint turbulent 
model (see Section 2.5), while possible |[Zym-+09], is laborious and the implementation error prone. 





Two Equation Models: Reynolds-Averaged Navier-Stokes Equations 


The Reynolds-averaged Navier-Stokes equations (RANS) are time averaged equations of motion. 
They are used to transform a flow field, that is transient only at the turbulent length scale to a 
steady flow where the amount of turbulence is modeled by an additional set of variables. 

The velocity of a flow is split into a mean and a fluctuating part using Reynolds decomposition, 


u=u+u, 

where U is the mean (time averaged) velocity and u’ the fluctuating turbulent part. (In the 
literature the mean velocity is usually denoted with u, to avoid collision with the adjoint notation 
we use the differing notation u.) Figure 2.2 illustrates the superposition of a (synthetic) continuous 
differentiable function wu, with randomized values from a standard distribution of lower length 
scale u’. This example mimics the characteristics of the Reynolds decomposition. 

The effect of the turbulence can be modeled by an additional transport equation. The RANS 
momentum equation for steady incompressible flow can be given as follows (with 7 = {0, 1, 2}, 
Einstein summation over the indices 7, and x = |[%0, 21, £2| = |x, y, z]): 


ihe _ Op a O ( OU; - wu | 


Ox; (Ox; Ox; “Ox; 
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2.1 Navier-Stokes Equations 


For the solution of the RANS equations, additional closure terms are needed for the u;‘u,’ 
term, which is commonly referred to as Reynolds stress tensor. It is the closure of this term, 
in which the different RANS turbulence models differ from one another. Using the Boussinesq 
approximation [Pop00], the Reynolds stress tensor can be related to a turbulent eddy viscosity 1. 
In the following, two common closure models are presented, which enable to calculate 4, namely 
the k-e and the k-w models. 





k-e model: The k-e equations model the transport of turbulent energy k and turbulent dissipation 
(into heat) rate € [LS83}: 


Ok  O(ku;) 0 | 





| Si 21, big LG; = 





Ot Ox; 7 Ox; on OX; 

Oe  O(Eu;) O |™% Oe E e 
7 = =— |——— 6 24. Eig — € — Cae 
Ot Ox; Ox; E a am k a y : C2 k 


The eddy viscosity needed in the equations, as well as for the closure of the Reynolds stress tensor 
is calculated as 4% = C,k?/e. 

The k-e model is suited for the calculation of sheer free layer flows [Bar+97| and flows with 
low pressure gradients. However, it does not perform well for flows that exhibit large adverse 
pressure gradients |Wil+98]. 


k-w model: The standard k-w model |Wil+98], with default parameters reads as: 


Ok Ok du 9 8 f(A) Ok 
Ot “3 Ox; ~ 1 Ox; 100 Ox; 2 7 Ox; 
Ow Ow 5w Ou; 3 5. O ( 1 = 





a 482; 9k “az, 40° * Ba; 
The eddy viscosity needed in the equations, as well as for the closure of the Reynolds stress tensor 
is calculated as 4% = k/w. The k-w model is best suited for flows with wall effects. Extended 
versions like k-w SST (shear stress transport) [Men93] exist, which switch between k-w behavior 
near walls and k-e behavior in the free stream. 


Other Models 


If one desires to capture the transient effects of the turbulence or calculates a case for which the 
mentioned RANS methods are not well suited, other more computationally expensive methods 
are available. One particularly useful method is the large eddy simulation (LES) model. The 
LES model is a filtering approach, which removes the information of the flow field on the lowest 
turbulent length scale frequencies and models the effect of that information on the flow field by 
various approaches. A flow field filtered in such a way allows to calculate on a coarser mesh, 
compared to a DNS case. LES simulations are inherently transient, they thus consume a lot of 
computational resources. ‘The requirements are amplified when coupled with the calculation of 
the adjoint, as potentially very long iteration histories need to be reversed. ‘The detached eddy 
simulation (DES) is a combination of the RANS and LES approaches, that switches between a 
RANS model in regions of the flow with very small turbulent length scales (especially near walls), 
which can not be covered by LES without a prohibitively fine mesh, and a LES model in regions 
where the spatial resolution of the mesh allows to resolve the turbulence by LES. 
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2.1.4 Computational Mesh 


The calculation of solutions to the governing equations using FVM requires the discretization 
of the domain with finitely small control domains. ‘They need to cover the full computational 
domain, without overlapping, follow the shape of the boundary, and fulfill certain quality criteria. 
This domain discretization is commonly called mesh or grid (for structured meshes). For general 
CFD applications, meshes can be categorized into one of two classes: Structured or unstructured. 
Structured meshes allow the efficient determination of the cell neighborhood of a given cell without 
costly memory lookups. For example, in a 2D mesh, where the cells are numbered in a matrix 
like (row, column) fashion, the neighborhood of cell (away from any boundaries) (i, 7) can be 
obtained by (¢ — 1,7), (¢+1,7), (¢,7 — 1), (¢,7 +1). In general this leads to good cache locality of 
the code implementing the discretization, and sparse (banded) discretization matrices with known 
sparsity pattern. Structured meshes do not necessarily have to be equidistant or orthogonal 
(see e.g. Figure 2.3). However, the number of neighbors and connectivity for each cell is fixed, 
severely limiting the possibility to refine the mesh and the adaptability to complex geometries. 
This downside can be mitigated by allowing hanging nodes, resulting in a mesh which retains the 
good numerical characteristics of a structured mesh, while still allowing to refine near important 
geometry features. 

Unstructured meshes allow the decomposition of the computational domain into arbitrary 
polyhedral subvolumes. ‘Thus, the number of neighbors of a cell is not fixed, and the neighborhood 
can not be trivially constructed. ‘The mesh connectivity must be stored in a suitable data structure 
and looked up when the neighborhood needs to be constructed. This leads to more memory 
access and a non cache-local memory access pattern. 

The convergence speed and solution quality of many numerical solution algorithms depends 
on the mesh quality. Mesh quality can be characterized by different metrics, the importance of 
which varies between different solution algorithms. 














Cell aspect ratio: Ratio between the biggest and smallest area of a cells boundary box (best if 
ratio equals one). 





Cell non-orthogonality: Angle between the vector connecting two cells and the face normal of 
the face connecting both cells (best if equals zero). Compare to vector Sy in Figure 2.4(a). 


Cell non-conjunctionality: Distance from the intersection of the connection of two cell centers 
with the face connecting both cells (f") to the face center (f.) (best if equals zero). Compare 
to Figure 2.4(b). 

























































































































































































































































































































































































































































































































































































Figure 2.3: Three structured meshes. Structured equidistant mesh (left), structured mesh 
refined around region of interest (middle), structured non-Cartesian mesh (right). 
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2.2 Finite Volume Methods 


(a) Mesh non-orthogonality (b) Mesh non-conjunctionality 


Figure 2.4: Visualization of mesh non-orthogonality (left) and non-conjunctionality (right). 


2.2 Finite Volume Methods 


The finite volume method (FVM) was developed in the 1970s and 1980s as a way to discretize and 
solve partial differential equations (PDEs). The FVM is popular in CFD, as it allows to discretize 
directly on the physical domain without transforming to a computational domain first [|MMD-+16| 
(like the test function space required by FEM). It is able to discretize complex domains, granted 
that enough volumes are available to capture the features of the domain. It is related to the finite 
difference (FD) and finite element methods (FEM). For a comparison between FVM, FEM, and 
FD see e.g. [EGHOO]. 

The FVM is based on the concept of balance equations on discrete control volumes. In a first 
step the PDEs, given in differential form, are integrated and transformed into balance equations 
over the discrete control volumes. The resulting volume and surface integrals are transformed into 
discrete expressions by approximating them with numeric quadratures. In a second approximation 
step the quantities from the cell centroids are interpolated onto the faces and are numerically 
integrated over the face area. The surface integrals describe the flux of conservation quantity in 
and out of the volumes through their respective faces. 

The formulation of fluxes through the cell faces makes the method locally conservative (meaning 
that the flux leaving a volume is equal to the flux entering its adjacent volume). This makes the 
method well suited for problems where fluxes are of importance, such as fluid mechanics. It can 
be applied to both structured and unstructured meshes in arbitrary 2D and 3D domains. 


2.2.1 FVM on Scalar Transport Equation 


We will motivate the FVM with the discretization of the general convection diffusion equation. It 
models the transport of a scalar physical quantity (e.g. temperature) within a fluid field, which 
travels with given velocity u. ‘The quantity is convected downstream by the velocity field. In 
addition to convection, the quantity spreads in all directions, due to diffusivity. 

The transport of scalar quantity ~ in a fluid flow can be modeled by the following partial 
differential equation [|MMD-+16]: 


O 
“pe Ve(pud) = V-UVy)t Qh, 
SH ee 


transient term convective term diffusive term source term 


ig 


2 Foundations 


where p and u are the density and velocity of the fluid field transporting w, and p is the diffusivity 
constant for yw. The equation is closely related to the Navier-Stokes momentum equation. ‘The 
momentum can be stated as the transport equation, where w = u. The non-linearity introduced 
in the convection complicates the formulation of the FVM, thus for now we will treat ~ and u as 
separate entities. Starting from the PDE formulation, the construction of the FVM follows along 
the lines of the derivation of the governing equation, but backwards. 

Integrating the differential steady state form (O(py)/Ot = 0) over a (finite) volume Vo C 0 C R?® 
gives the following integral form: 


V-(pud)dV= | Ve (uVid)dV4+ | QY dv. (2.7) 
Vo Vo Ve 


Transforming the volume integrals for convection and diffusion to surface integrals by applying 
the divergence theorem: 





f (pu) -ndS = / (uVw)-ndS+ / QYdV. 

AVo AVo Vo 

This states, that, in absence of transient effects, the amount of quantity entering and exiting each 
finite volume Vc driven through the convective and diffusive fluxes, has to match the amount 
created /removed by the source term. From this integral formulation over cells and faces, we aim 
to introduce approximations, which transform the integrals on each volume Vgo into sums. The 
summands of the individual elements can later be assembled into a global linear system, which 
allows to solve for the field of the unknown quantity w. 

The surface integrals of the fluxes are approximated by replacing them with numerical inte- 
erations on the faces. In the discretized domain the values of the quantity are only explicitly 
defined at the cell centers. The values thus need to be interpolated from the cell center to the 
cell faces. ‘The most common option is to choose one integration point at the face center and 
multiplying it with the face area, yielding a second order accurate approximation (the integral of 
linear functions is evaluated exactly by this integration). We define the total flux J as the sum of 
the convective and diffusive flux as 








J” = puy —pVw. 


The integration over the flux through face f can then be approximated by 


f3-aS Ips; 
f 





where J is the flux calculated at the midpoint of face f and sf is the face area vector. 

The per face calculation of the fluxes from values at cell centers of the adjacent cells makes the 
cell centered finite volume scheme inherently conservative. ‘The flux leaving a control domain 
over one face exactly equals the flux entering the adjacent control domain (with opposite sign). 

For orthogonal meshes, the vector connecting two cell centers does pass through the center 
of the face shared by both cells. In this case the value of yw at the face center can be linearly 
interpolated from the neighboring cells as 





Tat) |; || 
~(1l- . 
PF ( fay) °° * aq? 
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Figure 2.5: ‘Two orthogonal cells, sharing one face. Cell centers are connected by d = d; + do. 


where d is the vector connecting the midpoint of cell C’ to the face center qf and d the vector 
connecting qc and qn (see Figure 2.5). 
The gradient in direction of the normal n = d2/||d2|| can be approximated by the difference 
wn — ¥C 
|d| 
Thus, the flux integral of the face can be evaluated by a linear expression, depending only on the 
quantities of w at the cell centres of cell C’ and its neighbor cell N: 





J¢-S¢ =acvc +anyn . 





For non-orthogonal meshes, the vector connecting two cell centers does not necessarily pass 
through the center of the face shared by both cells (see Figure 2.6). If the intersection point 
is used instead of the actual face centerpoint, the numerical interpolation to the face loses its 
second order accuracy. Furthermore, the face normal direction, along which the gradient has to 
be evaluated, does not align with the vector connecting the cell centers anymore. The flux can 
be corrected to better match the flux which would be obtained by using the correct values at 
the midpoint. However, as this correction contains non-linear terms it can not be added to the 
system matrix but instead has to be treated as a source term for the linear equation system. 


wr =— acwlc +anvn + f(vo, vn). 


Discretizing the remaining volume integral of the source term from (2.7) by a numerical integration 
over the cell 


Q’ dV & Qh Vo, 
Vo 


we can express the balance equation at each cell by the expression 


acvo+ SY) arbp=be, 


FXNB(C) 


where F' ~ NB(C) denotes all cells in the neighborhood of C’. The source term, as well as the 
non-linear parts of the non-orthogonal corrections, are accumulated into the right-hand side bc. 
From those balance equations one can lump together all cells into a linear equation system: 


Ayw = by, 


19 


2 Foundations 


Flow direcvicn. 











Figure 2.6: Cell centered finite volume. Flux is calculated through faces. Note the distance 
between face midpoint and the intersection between face and cell center vector due to non- 
orthogonality. 


where the matrix coefficients ac constitute the matrix diagonal Ago and the coefficients apr 
populate the off diagonal elements Aro and Agr. The matrix can be solved to obtain the desired 
quantities w. The linear equation system needs to be assembled and solved multiple times, if 
non-linear effects are included in either the right-hand side (due to the non-orthogonal correction) 
or the assembly of the matrix (i.e. the matrix entries depend on w, which is commonly the case 
when discretizing the momentum equation, where the velocity is both the desired quantity and 
driving the convection). 

The fluxes of each cell only directly depend on the values at the cell itself and the directly 
neighboring cells. ‘Thus, the matrix Ay, will in general be sparse. Figure 2.7 shows the sparsity 
pattern resulting from the discretization of the momentum equations for the motorbike tutorial 
case of OpenFOAM. 

This case generates a ng = 352570 cell unstructured mesh with nr = 1054817 internal faces 
leading to a 352570 X 352 570 sparse matrix with ny, = n¢c+2-nr = 2460120 non-zero elements. 
Straight from the mesher the bandwidth of the matrix is 349 965 (left in the figure), a renumbering 
with the Cuthill-McKee algorithm |CM69] reduces the bandwidth to 20875 (right in the figure). 
A reduced matrix bandwidth can lead to improved linear solver performance |Maf14|. For example, 
the incomplete LU factorization |CV97]|, utilized as a preconditioner in many iterative solvers, 
benefits from the lower fill-in generated by a mesh reordered with Cuthill-McKee. 











2.2.2 Common FVM Boundary Conditions 


In the FVM, fluxes over the domain boundaries are defined by boundary conditions defined on the 
boundary faces. A variety of boundary conditions exist to model diverse physical behavior. Here 
only the two most common choices are presented, namely the Dirichlet and Neumann boundary 
conditions. The former directly specifies the value of a transport quantity at the boundary face, 
the latter prescribes a flux. 
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Figure 2.7: Sparse FVM discretization matrix of the momentum equations for a large unstruc- 
tured mesh (motorbike tutorial case of OpenFOAM). Left in original node order from the 
snappyHexMesh mesher, right ordered with the Cuthill-McKee algorithm. 


Dirichlet Boundary Condition 


For a scalar quantity w, which is to be convected, e.g. at an inlet, the quantity on the boundary 
is set explicitly to a (possibly time and space dependent) specified value: 


Wp — Whiz - 


n A representation of a cell stencil in a structured Cartesian mesh, with a Dirichlet boundary 
condition applied to the west face, is shown in Figure 2.8. ‘The flux through the boundary face 
can be explicitly calculated with the specified value as 


dh = Mp - 











The flux only depends on the specified value and not on the cell central value wc. This corresponds 
to an upwind interpolation of yy, from a virtual cell center qy. 








Figure 2.8: Finite volume stencil around a cell C’ with a boundary face to the west. Flux wy is 
specified on the boundary face and enters the FVM discretization of the cell. 
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Examples for Dirichlet boundary conditions include: 
e fixed velocity profile at an inlet, e.g. u = Ujn - Np, 
e velocity u = (0,0,0) at a no-slip wall boundary, 


e and fixed pressure at outlet. 


In OpenFOAM the Dirichlet boundary condition is specified by the fixedValue keyword, as 
shown in Listing 2.1 for a constant inlet velocity of 1 m/s. 


dimensions Ber i il 96) eee eile 
boundaryField{ 
inlet{ 
type fixedValue; 
value uniform (1 0 0); 
} 
} 


Listing 2.1: Fixed velocity of 1 m/s at patch inlet, specified in case configuration file 0/U. 


Neumann Boundary Condition 
The Neumann boundary condition fixes the flux through a face to a specified value: 


bb = qlSoll ; 


where qy is the flux per unit-area and ||sp|| is the face area of the boundary face. The prescription 
of the face flux is equivalent to fixing the gradient in face normal direction Vy» - ny to a fixed 
value. As the mass flux m is considered to be constant, the quantity y~, must change accordingly 
to realize the desired flux. 

Common applications of the Neumann boundary condition are: 





e zero gradient pressure condition at an inlet Vp- nz, = 0, 
e zero gradient velocity condition at an outlet Vu- ny, = 0, 


e and symmetry plane. 


In OpenFOAM the Neumann boundary condition is specified by the fixedGradient keyword. 
The by far most often used application of the Neumann boundary condition is to prescribe a zero 
gradient condition, which can also be set by using the zeroGradient keyword. ‘This is illustrated 
for a zero gradient pressure condition on the inlet and domain walls in Listing 2.2 


dimensions [Oo 2 -230 0 0 O01; 
boundaryField{ 
inlet{ type zeroGradient; } 
walls{ 
type fixedGradient ; 
value 0; 
i 
i 


Listing 2.2: Zero gradient pressure condition at patch inlet, specified in configuration file O/p. 
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2.3 Semi-Implicit Solution Algorithms 


The SIMPLE (Semi-Implicit Method for Pressure Linked Equations) Algorithm |PS72] is one 
example for a class of solvers for systems of nonlinear partial differential equations (PDEs). The 
algorithm solves the steady incompressible Navier-Stokes equations by linearizing the equations, 
and discretizing them according to the FVM. ‘The linearization of the convection term makes 
an outer iteration loop necessary. The embedding of linear equation system solvers into a more 
general outer iteration loop is a common occurrence in CFD and more general simulation codes 
which involve nonlinear (differential) equations [Nau+15]. 

The SIMPLE Algorithm decouples the momentum equations from the mass conservation 
equations. ‘his allows to build and solve the linear equation systems for velocity and pressure 
independently, making the resulting linear systems considerably smaller and easier to solve. ‘The 
Navier-Stokes equations can also be solved fully coupled which makes the inner iterations more 
expensive but considerably speeds up the convergence of the outer iteration. 

For the solution of the momentum equation, the pressure is assumed to be known and accordingly 
only enters the equations on the right hand side. The momentum equations are solved component 
wise, giving a velocity field U* = (ug,...,U;,,-1) which fulfills them. For the velocity components 
outside of the implicit direction, the newest solutions from previous SIMPLE iterations are used, 
the coupling between the spatial directions is only introduced during the correction steps, 


*,2+1 __ L 
Ay, U, ee b,,, (U’) 
Au, Ue ww bu, (U") ’ 


where the right hand side vector by is a function of the velocities in the previous iteration 
step. At the beginning of the algorithm, the pressure, driving the velocity field, is only a guess. 
The velocities will in general not fulfill the (discretized) mass conservation equation. Therefore 
additional corrections are required. 

When solving the momentum and mass conversation equations separately, there is no equation 
which allows to solve for the pressure field p directly, as the momentum equation is already used 
to uniquely determine the velocities, while the mass conversation equation only directly depends 
on the mass fluxes and hence the velocities. ‘To close the equations, two correction terms for the 
pressure and velocities are defined. If the corrections can be related to one another, they can be 
used to determine a new pressure field. 

A velocity correction is wanted which corrects the velocity field, such that it fulfills the mass 
conversation at every cell: 





U=U+U'" 
where U* are the velocities fulfilling the momentum equations and U’ are the corrections needed 
to obtain velocities u consistent with the mass conversation equation. 

We assume that the velocity corrections can be uniquely determined from a separate pressure 

correction 

p=p'+p, 
where p* is the current guess for the pressure, and p’ is the correction required such that the 
pressures p drive the velocities to fulfill the mass conservation equation. 
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Tying the velocity corrections to the pressure corrections allows to assemble a linear equation 
for the pressure corrections 
/ 
App =b,, 


which can be solved implicitly. ‘The right hand side vector b, bundles the explicit contribution of 
the pressure correction equation. The obtained pressure corrections p’ can then be used to correct 
the pressures and also to explicitly correct the velocities to better solve the mass conservation 
equation (the latter correction is optional, but improves convergence). The alternating solution of 
the momentum and pressure correction equations is repeated until the flow field has converged. 

In the discretization of the governing equations OpenFOAM introduces the scalar quantity ¢, 
that describes the mass flux of fluid through a cell face (the convective part of the total flux J 
introduced in Section 2.2.1). The mass flux is required in the discretization of the pressure 
correction equation. It is defined as: 


o=pA(u-n)=p(u-s). 


For incompressible problems, the normalization of the Navier-Stokes equations with p leads 
to a normalized mass face flux @ = u-s. As a face flux, it is defined on the cell faces, as 
opposed to the cell centered quantities like velocity and pressure, giving the discrete mass flux 
field @ € R”*. Intuitively the mass fluxes could be interpolated from the cell centered velocities 
onto the faces, however this introduces numerical issues, commonly referred to as checkerboarding. 
Instead the following iterative procedure is chosen to update @¢ in each iteration of the SIMPLE 


algorithm |RC83]: 
H 1 
o= (A2 - “vp °S, 
Up Ap 





where H(u) is the product of the off-diagonal coefficients of a specific cell in the discretization of 
the momentum equations (see also Section 3.7.3) and a,» is the diagonal coefficient. 
Summarizing, the SIMPLE Algorithm consists of the following steps: 





1. Discretize the momentum equations and assemble linear equation system; 
2. Solve discretized momentum equations to obtain intermediate velocity field U: 


3. Compute the uncorrected mass fluxes @ at the cell faces; 





4. Discretize the pressure correction equation and assemble linear equation system; 
5. Solve discretized pressure correction equation to obtain pressure correction field p’; 


6. Calculate new pressure field p from p’, if desired apply under-relaxation: 
pt! =p'+a(p—p’), with 0<a<1; 


7. Correct the mass fluxes @ at the cell faces with the calculated pressure corrections; 
8. Correct velocities U to fulfill mass conservation, yielding U 
9. If desired, apply under-relaxation: U’T! = U' + a(U — U’), with 0 <a <1; 


10. Repeat steps 1—9 until convergence of pressure and velocity fields is obtained or maximum 
number of iterations has been exceeded. 
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The SIMPLE algorithm is popular due to its rather straightforward implementation. For better 
convergence with less under-relaxation, the improved SIMPLEC algorithm can be implemented. 
Due to the decoupling of the PDEs, both algorithms produce linear equation systems which 
are significantly smaller and less stiff than a fully coupled system. Due to the steady growth 
of computing capabilities, a fully coupled approach becomes increasingly feasible for practical 
applications. Such solvers are available in the FOAM-extend project |Jas+18]. All applications of 
AD, shown on the SIMPLE algorithm in the later sections, are feasible for those types of solvers 
as well [STN]. 


2.4 Mesh & Field Conventions 


2.4.1 Mesh Topology 


OpenFOAM uses the FVM approach, as outlined in Section 2.2, to discretize ODEs. The 
computational domain is discretized into finitely small volumes, also called cells. We define the 
following terminology to construct the cells from basic entities. 





e Let 2 be the computational domain 2 C R? and T = 00 the boundary of that domain. 


e Let Q be a set of distinct points Q = {q € (QUT )} (the obvious symbol choice of p is 
already taken by the pressure). 


e Let E be a set of edges, each connecting two points, E = {(qi,q;) | qi,aj € Q,4 #7}. 


e Let F be a set of faces. A face is made up of an edge cycle, containing at least three edges. 
The cycle is defined as a path ((qg, G1), (G1; G2);---5(Gn>Q9)), where the start and endpoint 
of the path are identical and each other point is unique. All points q; of a face lie on a 
common plane in 3D space, that is the face is planar. 


e Let Fr be the subset of F' containing all boundary faces. A boundary face is a face of which 
all points indexed by its edges lie in I’. 





Let Fp be a set of subsets of faces called patches. Fp = {F'p; C Fr}. Boundary conditions 
can be applied per patch. Thus the main use for patches is to group boundary faces, to 
which a boundary condition should be applied. 


e Let Fe be the set of interior faces F’ \ Fp. An interior face includes at least one point which 
lies in (2. 


e Let C be a set of cells. A cell c € C' is a space c C (2 which is enclosed by at least four 
faces without any gaps. Every edge of a cells face is shared by another face of that cell and 
potentially also by faces of other cells. 





e Let the number of cells |C| be denoted as nc, the number of interior faces |Fe:| as nr, and 
the number of boundary faces |Fp| as np.. 


Definition 1 (Volume centerpoint). 
We define the centerpoint qo, € R° of a cellc € C as the point (centroid) which coincides with 
the arithmetic mean of all points within the volume. For a formal definition see e.g. [Cox+69]. 
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Faces 


Cells 


Face Normals 


Points A 


Figure 2.9: Unit cube meshed with 64 hexahedral elements. Some cells are made translucent to 
show the interior cells and faces. Three instances of points, edges, faces, face normals, patches, 
and cells are marked. 


Definition 2 (Face centerpoint). 
Similarly we define the centerpoint qo Ss R? of a face f € F as the point (centroid) which coincides 
with the arithmetic mean of all points on the face. For a formal definition see e.g. [Cox+69]. 


In practice the face centroid of complex shapes is computed by decomposing the face into triangles, 
calculating the areas and centroids of each triangle, and calculating the final centroid from the 
triangle centroids weighted by their area. Assuming uniform density throughout the domain, the 
face/volume centerpoints coincide with the center of mass. 


Definition 3 (Face normal vector). 

The face normal vector nf € R° is defined as the vector, that is perpendicular to the plane 
containing all points of the planar face f € F. By definition it points outside of the domain 
enclosed by the face, and is normalized to length one, that ts ||n¢|| = 1. 


Definition 4 (Face area vector). 
The face area vector sf € R® is defined as the product of the face normal vector with the face 
area Ax: Sf= Arne, that 1s ls || = Ae 





The face area vector s can be used to efficiently calculate fluxes through a cell face. 

Figure 2.9 demonstrates a very simple structured mesh of the unit cube. The points are spaced 
equidistantly in the Cartesian directions at a distance of ;. Consequently the mesh consists 
of 5° = 125 points. The points are connected by 300 edges to form 240 faces. Of those faces, 
4-4-6 = 96 are boundary faces, that can be naturally assigned to six boundary patches coinciding 
with the sides of the cube. Each face can only be part of one patch, however it can be assigned to 
an arbitrary patch, depending on the required boundary conditions (which are set on a per-patch 
level). Finally, the faces form 4° = 64 hexahedral (cube) cells. 
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2.4.2 OpenFOAM Field Conventions 


Table 2.1 lists the OpenFOAM fields corresponding to the most important physical quantities for 
flows. The correlation between the notation for physical quantity and the corresponding discretized 
field is straightforward for all fields, except the velocity. As the velocity u € R° is already a vector 
quantity, we define the velocity field as U = (uo,x, Uo,y, 0,25 ++ +5 Uno—1,as Ung—Lys Ung—1z) € Ree, 
If the field of an individual velocity dimension is required, we denote it as U;, U,, U, € R”"° 
respectively. This notation allows to discern between physical quantity and the discretized field, 
and is also consistent with the OpenFOAM notation for the velocity field, which is U. 





Table 2.1: Relation of physical quantities and their corresponding OpenFOAM fields for com- 
pressible and incompressible flows. 


Physical Quantity Symbol Field ee Data type Y Senne sae ae 
Velocity u U U volVectorField m/s m/s 
Pressure p p Pp volScalarField m7? /s? kg m/s? 
Mass flux o) 0) phi surfaceScalarField m°/s kg/s 
Turb. kinetic energy k; k k volScalarField m? /s? m? /s? 
Turb. dissipation rate E é epsilon volScalarField m?/s° m7? /s* 


For incompressible flows, OpenFOAM scales the pressure with the inverse of the constant density 
p, tmaking the density disappear from the momentum equations (2.3). The unit of this scaled 
pressure, referred to as kinematic pressure, is consequently m*/s? instead of the more common 
kgm/s*. Also the face flux is calculated as u-s for the incompressible case while for the 
compressible case p(u-s) is used. 

Physical quantities in OpenFOAM have to be defined in a consistent unit system and arithmetic 
compatibility of different fields is checked at run-time. While different unit systems, such as 
imperial units are possible, SI units are most commonly used. To enable the run time check of 
units, the dimensions of a field has to be specified in its input dictionary by a vector specifying 
the cardinality of the individual units in a specific order. ‘The order of the unit specification is 
shown in ‘Table 2.2. Listing 2.3 gives an example for a velocity field with units length per time. 








FoamFile{type volVectorField;} 
dimensions [OWA -1 8 0 0 O]; 
internalField uniform (1 0 0); 


Listing 2.3: Definition of a field storing vectors with unit length over time, i.e. velocities. 
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Table 2.2: Order of units in the OpenFOAM unit system and corresponding SI unit |OF18]. 


No. Property SI unit 
1 Mass Kilogram (kg) 
2 Length Metre (m) 
3 Time Second (s) 
4 Temperature Kelvin (K) 
5 Quantity Mole (mol) 
6 Current Ampere (A) 
7 Luminous intensity Candela (cd) 


2.5 Continuous Adjoints 


In the context of solving PDEs, the application of AD leads to a discretize first — differentiate 
later approach. That is the governing (primal) equations are first transformed from a continuous 
problem to a discrete one (defined on finitely small control volumes), optionally linearized and 
then solved. The derivatives of the governing equations are obtained by applying AD to the 
implementation of the discretization process. AD can be introduced at different levels of the 
discretization process, ranging from differentiating the whole discretization and solution process 
(discussed in Section 3.2.2) to only differentiating the calculation of residuals (see Section 4.5) 
and supplying additional analytical insight to obtain the full derivatives |Lot16]. We call this the 
discrete adjoint approach. 

In contrast, the continuous adjoint approach yields a differentiate first — discretize later 
setting. That is from the primal PDEs a corresponding set of adjoint PDEs, along with adjoint 
boundary conditions, is derived symbolically (usually by variational calculus). The resulting 
adjoint PDEs are discretized and solved separately from the primal equations (however primal 
variables may appear in the adjoint equation, resulting in a coupling between primal and adjoint 
equations). Therefore, the discretization of the adjoint equations can be tailored to the physical 
properties of the adjoint, e.g. by employing upwinding of the adjoint convection equation. In the 
context of the adjoint Navier-Stokes equations, the adjoint convection direction is opposed to the 
primal (different sign, see derivation in following sections). A visual comparison of the different 
approaches to obtain the final sensitivities is given in Figure 2.10. 

A drawback of the continuous adjoint method is that the obtained derivatives are not necessarily 
consistent to the primal, as implemented. For some PDEs, the adjoint equations are ill-conditioned, 
leading to bad convergence of the solution algorithms. Also the derivation process of the adjoint 
equations is complex and error prone. For complex, e.g. turbulence, equations, a closed symbolic 
derivation might not be possible at all [Car+10] without changes to the primal equations or 
boundary conditions |Kav-+15]. 

While this thesis is focused on obtaining derivatives using AD, we will briefly introduce the 
continuous adjoint equations, specifically to obtain topology sensitivities. We will use these 
methods to verify the AD implementation in later sections. This introduction is based on the 
derivations in [Oth08]. 

The solution of the equations listed above, including topology optimization using fixed stepsize 
steepest descent, is implemented in the standard OpenFOAM solver adjointShapeOptimization- 
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problem problem problem 


primal adjoint adjoint 


solution solution solution 


Figure 2.10: Comparison of solution procedure for the discrete and continuous adjoint. ‘The 
continuous adjoint differentiates on the differential equation level and then discretizes and 
solves the obtained adjoint equations. In contrast the discrete adjoint discretizes the primal 
equations first and then obtains the adjoints by applying AD to its implementation. Solid 
arrows connect the building blocks of the continuous adjoint, dashed lines the ones of the 
discrete adjoint. 


Foam |OVWO7|. A comparison between the continuous and a corresponding discrete adjoint solver 
can be found in Section 4.3.3. 


2.5.1 Derivation of the Topological Sensitivity 


An optimization problem in CFD can be stated as the task to minimize an objective J, e.g. 
pressure loss in a system, under the constraint that the physical laws of fluid flow are fulfilled. 
Introducing the residual vector r = (71,172,173, 74) as the residual of the Navier-Stokes momentum 
and mass conservation equations, we can state the (not yet discretized) optimization problem as: 
minimize /(a,u,p) 
7 (2.8) 
subject to r(a,u,p) =0. 


In residual form and neglecting external body forces the Navier-Stokes equations read as 


(r1,72,73)) = (u@® V)ut+ Vp—vV7u+au 
TP = —V-u. 
Constraint optimization problems can be reformulated into algebraic equations without constraints 


by introducing Lagrange multipliers. Lagrange multipliers are additional unknown variables 
which are introduced for every constraint equation. 





This allows to reformulate the general optimization problem f(x) under constraint g(x) = 0 from 
minimize f(x) 
x 


(2.9) 
subject to g(x) =0. 
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to the transformed problem 
A(x, A) = f(x) +A (g(x) , 


by introducing the Lagrange multiplier 4. 
A solution (x, A), for which the derivatives of A w.r.t. both x and 4 vanish, is a candidate for 
the solution of the constraint optimization problem (2.9) 


ON(x, A) 
Ox 
ON(x, A) 
OX 
By applying the Lagrangian multiplier approach to the constraint topology optimization prob- 
lem (2.8) a modified cost function LD is obtained. 


=0. 


c= 7+ | (a,p)-r a0 
Q) 


Here the Lagrange multiplier is defined as (U, p) = (Uz, Uy, Uz, p) and ensures that the residual r 
vanishes at each location inside the computational domain (2. The multiplier u is called adjoint 
velocity, the multiplier p adjoint pressure. 

By applying variational calculus and assuming some restrictions on the feasible cost functions, 
the following relation for obtaining the desired sensitivities O£/Oa@ can be found [Oth08]: 


OL. 
Oa; 7 








(a; : u;) Vj. 


For this relation to hold, the Lagrangian multipliers must be specifically chosen, to ensure that 
the variations of £ with respect to u and p vanish. The values for the adjoint multipliers u and p 
can be found by solving the following additional PDEs: 
_ (va * (vu)") u=1V2a— Vp—ai 
V-u=0. 


Comparing to the primal Navier-Stokes equations, the convection of the adjoint velocity in the 
opposite direction of the primal flow can be clearly seen in the negative sign of the convective 


term (va +f (vu)") u. The equations are linear in U, however the matrix vector product (dot 
product on a per equation level) wu’ u introduces mixed terms, which require an iterative solution 


if a segregated solver is used. That is, the convection is discretized as (va't! + (Vi')") u’. 


2.5.2 Derivation of Adjoint Boundary Conditions 


The boundary conditions for the continuous equations depend on the chosen cost function J and 
have to be individually derived for each desired cost function. The boundary conditions for the 
use with ducted flows and total power loss, as defined in (2.10), are given in Table 2.3. 


J = -| (>+ sllul?) u-nd- (2.10) 
r 2 
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Inlet Wall Outlet 
WM: iy = Uae U, =0,u4=0 Vu-n=0 
p Vp:-n=0 Vp-n=0 P = Poutlet 
GD &%=0,t,=Un %=0,U,=0 unl: —u:) +V(n- A): =0 
p Vp -n=0 Vp-n=0 p=U-Ut tnt, +y(n- A), + —fu? — v2 


Table 2.3: Primal and adjoint boundary conditions for inlet, walls, and outlet in ducted flows. 
Primal /adjoint velocities normal /tangential to the patches are denoted by un, = u-n, uz = u-t, 
Cg = Th ay = 14: 


2.5.3, Continuous Shape Sensitivity 


By making an assumption connecting the adjoints to shear forces, the adjoint momentum equations 
can also be used to derive a continuous optimization procedure for surface shapes |Oth08]. For the 
detailed derivation of those relations, consult |Oth08]. More advanced derivations for different cost 
functions, as well as turbulence models are found in a variety of publications by Giannakoglou et al., 
e.g. [Zym-+09]. For a specific amount of outward facing movement in surface normal direction /, 
at an arbitrary point on the surface, the sensitivity can be computed as 

dL di 

— = —Apy(n- V)t- (n- V)ur, (2.11) 

db 
with A representing the surface area affected by the move of the surface by 9. ‘The shape 
sensitivities thus depend on the gradients in normal direction of the primal and adjoint velocities 
tangential to the walls. 

Shape sensitivities give an indication which nodes have to be moved inward or outward respective 

to their face normals. A comparison between this approach and a discrete adjoint solver which 
directly uses the individual points of the mesh as parameters is shown in Section 4.4. 


2.6 Sparse Matrix Storage 


2.6.1 General Sparse Storage Schemes 


In a wide variety of (technical) applications, the matrices obtained by the discretization of 
nonlinear PDEs are sparse. For FVM, the matrices are sparse, because the stencils used to 
approximate the spatial and temporal derivatives are only influenced by a limited number of 
values in the direct neighborhood of the derivation point. A matrix is called sparse if the number 
of non-zero elements is low, compared to the number of zeroes in the matrix. ‘That is for a 
matrix A = (a;;) € R”*”: 











where Nnz = ||{ai; | aij A O}|| is the number of non-zero elements in the matrix A. 

One distinguishes between structurally zero elements of a matrix and zero valued elements, with 
the former being a subset of the latter. For a matrix which is generated by some fixed algorithm 
(code), structurally zero elements are elements which are zero regardless of the values used to 
calculate them (as long as changes in the inputs do not induce changes in the control flow of the 
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algorithm). Additional zero valued elements can be created through numerical operations such 
as multiplication by zero. The sparsity pattern, that is the positions of all structural non-zero 
elements, of Jacobians/Hessians can be obtained by the propagation of pattern sets |Varl1]. The 
determination of sparsity patterns in OpenFOAM is presented in detail in Section 4.5.2. 

Exploiting the sparsity of such matrices is beneficial for a multitude of reasons (in the following 
we assume square matrices, or at least O(m) = O(n)): 





e Storage of the dense matrix is costly at O(n), it is not uncommon for the dense matrix to 
not fit into memory at all. With sparse storage schemes (see below) the memory can be 
reduced to O(nnz). 


e The multiplication of a sparse matrix with a dense vector can be sped up from O(n”) to 
O(nnz) by eliminating memory access and operations on zero entries. 


e Direct solution of linear equation systems (e.g. using LU-decomposition) are sped up by 
exploiting sparsity. 





Sparse matrix storage schemes aim to reduce the memory required to store the matrix coefficients, 
while retaining all information of the matrix and providing efficient access routines to the individual 
entries. Popular choices for sparse matrix storage include the compact row storage (CRS) scheme 
and the coordinate format. 

We will illustrate both schemes using an illustrative example. Let matrix A € 
individual (structural) non-zero matrix entries a;; be defined as 


R**4 with n,, = 8 


ao ao, O ODO 
0 Q11 =Qa42 0 

020 0 99 0 

30 0 0 33 


A= 


The CRS scheme stores the non-zero elements of the matrix in left-to-right and top-to-bottom 
order in vector v (row-wise storage). Consequently, entries of the same row are always adjacent. 
The corresponding column indices of the nonzero elements are stored in vector cy. ‘The vector r; 
holds for each row the position of the first non-zero entry in this row in the vectors v and e;. 
Thus, the memory requirement depends on the number of non-zeroes and the number of rows: 
MEM = nynz - sizeof (value_type) + (nnz +m) - sizeof (index _ type). 








UC = |ao0; A201, 411, 412, 420, 422, 430, a33] 
c; = (0, 1,1, 2,0, 2,0, 3]” 
a — (0, 2,4, 6] : 





The coordinate format stores the non-zero elements in vector v, the row indices of the nonzero 
elements in vector r; and the column indices in c;. Consequently, the memory requirement is 
nz sizeof (value_type) +2-nnz - sizeof (index _ type). 
ig 
Vv = |a00, 201, A411, 412, 420, 422, 430, 233] 
rz = (0,0,1,1,2, 2,3, 3)” 
ce; = [0,1,1,2,0,2,0,3]" . 
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The coordinate format is not unique, in that the entries in v can be stored in arbitrary order. In 
implementation it is advisable to store the entries of v in a cache efficient order. I.e., if matrix 
vector products are required, v should be stored row-wise, such that subsequent entries can be 
efficiently read from cache. 

For nnz > m (for practical applications usually n,, >> m), the CRS scheme occupies less 
memory than the coordinate format. The adjacency of the nonzero entries of a row and the 
explicit availability of the column indices makes the CRS format well suited for performing sparse 
matrix vector multiplications. However, it adds a layer of complexity to the retrieval of individual 
matrix entries. A matrix stored in CRS format can not be trivially transposed. A matrix stored in 
coordinate format can be transposed by simply switching the row and column vectors r; and ¢;. 





2.6.2 OpenFOAM LDU Format 


OpenFOAM uses a variation of the coordinate format to store its matrix coefficients. ‘The specifics 
of the storage scheme will become important when symbolic adjoints are applied to the embedded 
linear solvers. In the context of CFD simulation, the diagonal of the FVM discretization matrix 
is always dense, as the discretization using finite volumes always yields a central coefficient. 
Furthermore, the discretization matrix A € R”"°*"¢ is structurally symmetric and square. 
The diagonal coefficients of the matrix are stored in a dense vector d = [ayy |0<i< no]. A 
diagonal entry a;; can be looked up directly by accessing d;; thus no additional row and columns 
indices need to be stored. All other non-zero entries are stored in vectors J and u, where 
l= aj; | aij A 0, 2 > J] are the coefficients below the diagonal and u = |a;; | aj; #0, 1 < J] are 
the coefficients above the diagonal. In the implementation, the vectors J, d, and uw can be obtained 
by calling the access functions lower(), diag() and upper() respectively. Note, that to resolve 
the symbol clash between velocity u and upper entries u, all vectors corresponding to the LDU 
format are typeset in bold italics. 

For symmetric matrices, only one of the vectors 1 and wu needs to be stored. In addition to 
the non-zero values stored in l and/or u, the row and columns indices of the matrix entries need 
to be stored. ‘The indices are stored in addressing arrays L € N”"¥ and U € N”¥. The indices 
are ordered, such that EL is monotonously increasing and subsets of U with identical LE are also 
monotonously increasing (see following example). This ensures good caching performance for the 
evaluation of matrix vector products. The addressing arrays can be obtained from the lduMatrix 
class by calling lowerAddr() and upperAddr() respectively. 

Due to the structural symmetry of the finite volume discretization, entries of the lower part are 
given by 





QU;,L;, = bis 
and the entries of the upper part by 
QL;,U; = Uji : 
As an example let 
do Ug U1 0 0 0 
lo dy uz uz 0 O 
0 


A= ly lo dy U4 U5 
0 ls la dg Ug U7 
0 0 lr l¢ da UZs 
0 O O ty lg ds 
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be a 6 x 6 banded matrix storing 24 non-zero entries. Then the LDU format of A is given by 


| 
ldo dy dg ds dg ds5| 
= |u0 Uy U2 UZ Ug U5 USE U7 us | 
iL ft 22 3 2 4 
23 3 4 4 5 5]. 


For the solution of linear systems arising from the FVM discretization, boundary conditions need 
to be applied to faces which only connect to one cell. The boundary coefficients are stored in 
two additional vectors B and I, called boundary components and internal components. The 
coefficients in J are the coefficients which correspond to the influence of the boundary conditions 
onto the central coefficients of the matrix stored on the diagonal. In contrast to the I,d and u 
coefficients, those coefficients are not necessarily identical for all dimensions of a fvVectorMatrix. 
The coefficients of I for the correct dimension are only added to d on demand, before calling the 
linear system solver, and removed again after the solver has finished. The coefficients in B can 
be imagined as virtual cells outside of the computational domain, which arise from the boundary 
conditions. ‘They are added to the right hand side of the equation system on demand. 

For later reference, Listing 2.4 gives a truncated overview of the relevant lduMatrix.H and 
lduAddressing.H header files. 
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1/class lduMatrix { 

2| private: 

3 //- LDU mesh reference 
4 const lduMesh& lduMesh_ =; 

5 //- Coefficients (not including interfaces) 

6 scalarField *lowerPtr_, *diagPtr_, *upperPtr_ 5; 
(A jell tae = 

8 //- Abstract base-class for lduMatrix solvers 
9 class solver { 


10 PEObeCcted: 

11 const FieldField<Field, scalar>& interfaceBouCoeffs_=; 

12 const FieldField<Field, scalar>& interfaceIntCoeffs_=; 

13 lduInterfaceFieldPtrsList interfaces_; 

14 eee 

15 Dilbderc:: 

16 const FieldField<Field, scalar>& interfaceBouCoeffs() const; 
17 const FieldField<Field, scalar>& interfaceIntCoeffs() const; 
18 const lduInterfaceFieldPtrsList& interfaces() const; 

19 ee 

20 Ips 

21 

22 //- Return the LDU addressing, access to coefficients 

23 const lduAddressing& lduAddr() const; 

24 scalarField& lower(); 


25 scalarField& diag(); 
26 scalarField& upper(); 


a7 

28 bool hasDiag() const; 

29 bool hasUpper() const; 

30 bool hasLower() const; 

31 bool diagonal() const; 

32 bool symmetric() const; 

33 

34 //- Init the update of interfaced interfaces for matrix operations 
35 void initMatrixInterfaces([...]) const; 

36 

37 //- Update interfaced interfaces for matrix operations 
38 void updateMatrixInterfaces([...]) const; 

39 ee 

40| }; 

41 


42, class fvMeshLduAddressing : public lduAddressing { 
43 |one hv auie ss 

44 labelList::subList lowerAddr_; 

45 const labelList& upperAddr_ ; 


46 elie ee 

AT //- Return lower addressing (i.e. lower label = upper triangle) 
48 const labelUList& lowerAddr() const; 

49 


50 //- Return upper addressing (i.e. upper label) 
51 const labelUList& upperAddr() const; 


Listing 2.4: ‘Truncated LDU description of class lduMatrix. 
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2. Differentiation of Computer Programs 


In this section the fundamental methods to obtain derivatives of arbitrary numerical code are 
presented. After the derivation of finite differences, first- and higher-order models of AD are 
introduced. 


2.7.1 Finite Differences 


Finite differences (FD) are a popular method to approximate the derivatives of uni- or multivariate 
functions. FD introduces approximation errors and, when implemented with floating-point 
precision, also additional numerical truncation and rounding errors. 

The derivative of a continuous function f : R > R at the location x9 is defined as 


7 f (zo + h) — f(x) 
dx h—0 h 


If this limit exists, the function is called differentiable at the location xg. For finite values of h, 
this definition can be used to approximate the derivative: 


~ f(vo +h) — flxo) 
7 (#0) ® : ; Cay 


This finite difference is called forward difference. Analogously the backward difference is defined as 


ai ~ f (to) = f(o = h) 
dx h 


Both approximations introduce an approximation error of O(h), as will be shown in Theorem 1. 
A more accurate approximation, scaling with O(h7), can be found at the cost of one additional 
function evaluation (assuming that f(x) is already known and therefore evaluation at x9 does 
not incur additional cost). 





LF (9) ys Co Ww — Fo — F) 
da" 2h : 
The accuracy of the FD schemes can easily be proven using Taylor expansion, as shown in the 


(2.13) 


following three proofs. ‘The convergence properties will be used in a later section to verify the 
AD implementation. 


Theorem 1 (Onesided FD). 
Let f be a function R > R which is differentiable at all points in the interval |x, xo]. Then the 
approximation error of the forward difference (2.12) scales with O(h) for h > 0. 


Proof. The ‘Taylor expansion of function f at x = x) +A truncated after the third term of the 
infinite sum evaluates to 
df hedey 


f (xo + h) = f (xo) + h— (20) = FD daz (0) Ss O(h?) 


Subtracting f(a) on both sides of the equation and dividing by h gives 


f(zo +h) — f(vo) _ df | hdéf 
h dx 2dx? 
f (xo + h) — f (xo) 


(ro) = —— + O(h). 


(xo) + O(h7) 


o 


=> 
dx 


36 


2.7 Differentiation of Computer Programs 


The proof for the backwards difference directly follows from the Taylor expansion 


d h? d? 
Fao — h) = f(a) — hE (a) + SF (a0) + O89). 


Theorem 2 (Central FD). 
Let f be a function R > R which is differentiable at all points in the interval [aq — h, xo + hl. 
Then the approximation error of the central difference (2.13) scales with O(h?) for h > 0. 


Proof. The Taylor expansions of function f at « = 79 +h and x = x — h, truncated after the 
third term of the sum evaluate to 


d hed 

Feo +h) = Feo) + HSE (0) + 5 T(00) + O(N) 
d h? d? 

Feo —h) = feo) — WSL (0) + 5 TF (r0) + O(N) 


Subtracting the second equation from the first, the second order term vanishes and leads to the 
desired approximation: 

d 
f(to +h) — f(xo —h) = onc 


a6 


(xo) ae O(h?) 


es (a) _ Flo +h) — fleo — fh) 4+ O(h2). 


LI 


The definition of the finite difference can be straightforwardly extended to the multivariate 
case f : R” > R, giving approximations for the 7-th directional derivative as 











i (xo) = F(xo + hei) ~ (xo) = ba — Fo) + O(h) 
cE (9) = Pa = FoF) + on) 
a (xy) v f (Xo + sacle Bt — he; ) 4 O(n?) . 


FD can be applied to obtain second and higher order derivatives, by eliminating the first order 
derivative term from the Taylor expansion. This can be interpreted as reapplying the first order 
FD model to both evaluation points of the first order central difference. 


Theorem 3 (Second order FD). 
The second derivative f of a univariate function f(x): R— R, which is twice differentiable at all 
points in the interval |x — h,xo +h] can be approximated as 


VT py 0 2) ee 
dx? 0" h2 


The approximation error scales with O(h?). 
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Proof. As before the Taylor series of f(ao +h) and f(a2o — h) are 


d h? d? hed? 

f (xo ++ h) = f (xo) ++ nT (09) ++ 7 SE (a0) za et! ++ O(h*) 
d he ae gel 

Fao — h) = f (a0) — ASE (00) + 54 (ao) — SF 4 O(n"), 





Adding both equations and subtracting 2 (29) from both sides yields the desired result: 


F(vo+h) + f(00— h) ~ 2f (a0) = PL (eg) + O(H') 


sl _ F(to +h) — 2f(xo) + f(z — A) 


4,2 (#0) = h2 Si O(h7) ‘ 


L] 


FD allows the evaluation of the Jacobian of a multivariate function f : R” > R™ at cost O(n) - 
cost( f), irrespective of the output dimension m. The truncation error can introduce numerical 
noise. The stepsize h needs to be tuned, such that it is low enough to give a reasonable 
approximation, but high enough not to encounter numerical issues due to machine precision. 
This is challenging, especially for multivariate functions with partial derivatives which differ in 
order of magnitude. Advantages are the comparatively simple implementation and the ability to 
differentiate complex models, to which the source code must not necessarily be available. 


2.7.2 Algorithmic Differentiation 


Algorithmic Differentiation |GW08; Naul2| (AD), sometimes also called Automatic Differenti- 
ation |Bar-+-00], names the process of generating derivatives of a given (numerical) computer 
program, calculating the sensitivity of one or several outputs w.r.t. a set of inputs. Conceptually 
AD implements the evaluation of the chain rule on the sequence of operations connecting the 
inputs to the outputs. 

Let f(x) be a function which maps a vector to a scalar f : R” — R, and which is at least twice 
continuously differentiable (C*). Then the gradient V f(x) : R" — R” of that function is defined 
as 








where e; denotes the 7-th Cartesian unit vector. 
The Hessian of f is given by the symmetric matrix H € R”*” of second order partial derivatives, 
where each entry fj; 1s given by: 


af 


e=he = . 
i "9 Orda, 





H| 


Let g(x) € C' be a different function which maps from a vector to a vector g: R" — R™. Then 
the Jacobian J of g is am X n matrix where each entry is given by: 
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As can be seen from the above definition, the gradient is the special case of a single column 
Jacobian. 

AD allows to evaluate the derivatives of functions implemented as computer programs with 
machine precision accuracy. First order AD assumes that the function is differentiable at least once 
at all points of interest. AD relies on the fact, that each computer program can be decomposed 
at run time into a single assignment code (SAC), which we define below. 








Definition 5 (SAC). 

Each computer program implementing numerical functions can be decomposed into a sequence of 
elemental functions ~; and assignments, mapping n independent inputs tom dependent outputs 
with p intermediate variables: 


forj=n,...,.n+tptm-l1: 
Vj = Yj (Vi)ixjs 


where 1 < j denotes a direct dependence of the variable v; on v;. The result of each elemental 
function y; 1s assigned to a unique auxiliary variable v;. The n independent inputs 7; = vj, 
fori =0,...,n—1, are mapped onto m dependent outputs yj; = Un+p+j, for J =0,...,m—1. 
The values of p intermediate variables uv; are computed fork =n,...,n+p—1. If not otherwise 
specified we restrict the functions admittable as elemental functions to unary or binary functions, 
lamiting the number of arguments for each elemental function to at most two. 


Definition 6 (DAG). 
A directed acyclic graph G = (V, E) is a directed graph which contains no cycles, i.e. there is no 
directed path starting and terminating at the same node [TS11]. 





A SAC can be conveniently represented as a DAG, where the n + p+ m™ nodes are the inputs, 
output, and intermediate variables, uniquely defined by the elemental functions. The edges model 
the dependence of the elemental functions on intermediate values and the inputs: 





Ov; 
Ov; 





(y,u)En © 


£0. 


We label the edges with the partial derivatives, to aid the calculation of the full derivatives by 
executing the chain rule on the paths in the graph. An example program, its transformation to a 
SAC, and the corresponding DAG are shown in Figure 2.11. 

Tangent Mode of AD 


In this and the following definitions for AD models the notation from [Nau12] is used. To declutter 
the indices, later also the notations from |GW08] are employed for first order derivative models. 


Definition 7 (First order tangent model). 
Let f : R" > R™ with y= f(a), 2, 2) © R” and y, yy © R™. Then the first order tangent 
model Aw > IR” x R” > R” X R” of f calculates the primal as well as the Jacobian, evaluated 


in direction x): = o 
Y\ — oa) (0) .\ _ (VF(@)- 2 
(% ) =F (2.2) = (OS) 
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The superscript e“) denotes the first tangent direction. The motivation for this notation will 
become apparent when higher derivatives are introduced. To make notation more compact, and 
to broaden compatibility to existing literature |GW0O8], for first order tangents we define the 
following equivalent notation: 


5.— ()). 


With the gradient in direction of a tangent x, the full Jacobian can be assembled at cost 
O(n) - cost(f), by evaluating the tangent model with all unit vectors x = e; € R”, giving one 
column of the Jacobian for each evaluation. Here O(n) includes the overhead of the tangent 
model evaluation ri , compared to a passive primal evaluation of f. 

The tangent model is now applied to each assignment of the SAC. Assuming differentiability of 
all elemental functions at their respective evaluation points, the tangent model of AD augments 
each elemental assignment with its tangent as follows: 





forj=n,...,nt+t+p+m-1 


Op; . 
=>) Do, Oj (2.14) 


tJ 





Uj = Pj (Vi)ixj- 
Here the variables v are the tangents associated with the primal values v. ‘The directional 
derivatives are evaluated alongside the primal elemental functions, propagating them from the 


inputs to the outputs. 
As an illustration the tangent model for the function 


y = (fo + 21)(1 + £9) 


is shown in Figure 2.11. Transforming this function to a SAC gives the following sets of inputs, 
output, and auxiliary variables: n = ||{vo, v1}|| = 2, m = ||{va}|]| = 1, p = ||{ve2, v3}|| = 2. This 
SAC can alternatively be represented as the DAG shown in the figure. An auxiliary variable t is 
inserted into the DAG. Its partial derivatives are defined, such that the tangent model 





; du, : 4g 4 
V4 = = = Vva(v0, 1) + Yo, 1] 
dt 
is created by multiplying the partials along the paths connecting t to v4. ‘The SAC is transformed 
according to the rules of Equation 2.14, resulting in a program augmented by the tangent 
statements. The resulting calculation is shown on the right of the figure. It calculates the 


tangent y = v4 as well as the primal result y = va. 


Adjoint Mode of AD 


Definition 8 (First order adjoint model). 

Let f :R” > R™ with y= f(a), © € R”, and y € R” be an at least once differentiable function. 
Further let a) € IR" and yy € R™ be adjoint variables corresponding to the primal variables x, y. 
Then the first order adjoint model f(1) : R’™ x RY > R" x R™ of f calculates the primal as well 
as the product of the transposed Jacobian with Y1): 


("0 = fay (¥a,#) = ‘ier ¥en | : 
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V2 = V1 * V9 + Uo * VY 
V2 = Ug * Vi 
U3 = V2 * U1 + V1 * V2 








V2 = U9 * UI U3 = U2 * UI 
V3 = U2 * VY V4 =1*va+1* v3 
U4 = U3 + V9 U4 = U3 + V2 

(a) DAG (b) SAC (c) Augmented SAC 


Figure 2.11: Illustration of the tangent model for y = (2 -21)(1 + x2), with DAG annotated 
with partial derivatives on the edges (left), primal SAC (middle), and SAC augmented to 
calculate the first order tangent model (right). 





The full Jacobian can therefore be calculated at cost of O(m) - cost(f) relative to the primal 
function evaluation. For the common case of a scalar output y, the gradient can be obtained at 
cost O(1) - cost(f). Compared to the factor O(n) - cost( f) of tangent mode this lower complexity 
is the prime motivating feature of the adjoint mode. 





The subscript e(;) denotes the first adjoint direction. Similar to the tangent mode we define 
the following equivalent notation for first order adjoint models: 


:— (1) ‘ 


Again, the adjoint model is now applied to each assignment of the SAC. In adjoint mode, a 
forward evaluation of the original program is succeeded by the propagation of adjoints for all v; 
in reverse order, that is, fori =n+p—1,...,0: 


for 7 =n,...,n+pt+tm-l 


forward section, 
Uj = Di (Ui )ias 








fori=n+p-—1,...,0 (2.15) 
_ Op; _ reverse section. 
= Do ay 8 
Jt 


Here the variables v are adjoint variables associated with the primal values v. The adjoint sensi- 
tivities are evaluated only after the primal elemental functions have been evaluated, propagating 





them from the outputs back to the inputs. In practice the sum resulting in U; is not evaluated all 
at once, but v; is incremented by the incoming partial derivatives one at a time, motivating the 
incremental nature of adjoint code [GWO8]. 
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V2 = U9 * Vi 
U3 = V2 * Ui 
V4 = U3 + V2 





V2 = U0 * UI U3 = U4} V2 = U4 

V3 = VQ * UY Vo+t = V1 * U3; V1 = U2 * U3 

U4 = U3 + V2 Vo = V1 * V9; Ui + = V9 * V2 
(b) SAC (c) Augmented SAC 


Figure 2.12: Illustration of the adjoint model for y = (a - x1)(1 + x2). Left: DAG with adjoint 
extension s, annotated with partial derivatives on the edges. Center: SAC. Right: SAC 
augmented to compute the first order adjoint model. 


Note that the v; computed in the forward section are potentially required as arguments of local 
partial derivatives within the reverse section. They are read in reverse with respect to the original 
order of their evaluation. The additional persistent memory requirement of the adjoint code is 
O(n+p+m). This data flow reversal is the main challenge in adjoint AD. It is responsible for a 
naive implementation of AD typically not being applicable to large-scale numerical simulations. 
The available persistent memory may simply not be large enough [Nau12]. 





In Figure 2.12 we show the application of adjoint mode to the function already considered for 
tangent mode. An auxiliary variable s is inserted into the DAG. Its partial derivative is defined, 
such that the adjoint model 


ds 


PRC = Sho, 21] 


= Vv4 (vo, oe “U4 


is created by multiplying the partials along the paths connecting [v9,v1| to s. Note how the 
forward evaluation of the program is now spatially separated from the calculation of the adjoints. 
The adjoints are evaluated in reverse order, propagating the adjoints of the outputs y = Us 
back to the inputs % = Up and 1 = v;. ‘The intermediate v2 is required to calculate v1 after 
the primal evaluation has already ended, highlighting the additional memory cost added by the 
adjoint method. As variables v1; and v2 have more than one outgoing edge, their adjoint value is 
influenced by multiple paths of the data flow reversal, motivating the incremental nature of the 
adjoint propagation. The auxiliary variable v3 is not required in the reverse section as it is only 
used linearly by other assignments and thus vanishes when evaluating Equation 2.15. 
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Higher Order Derivative Models 


Derivatives of second and higher order can be obtained by recursively applying the first order 
models onto models obtained by either tangent or adjoint mode. In the following, only the second 
order models are presented, as they can be used for a variety of optimization tasks, and can 
further be used to verify adjoints versus tangents, as presented in Section 4.4. For third and 
higher order models, please refer to literature, e.g. [Nau12]. 





Definition 9 (Tangent over Tangent Model). 

Let f : R” > R with y = f(a), c € R”, andy © R be an at least twice differentiable function. 
Applying the first order tangent model to fi (1) (a! (1) , x) yields the second order tangent model 
fG 2) ( gl) gl?) gpG2) | x) : IR” x R” x R” x R° —> RXxXRXRXR, which calculates the following 
relations: 


ylPN ah" 0? F(a) - af) + Vf (a) - at?) 
yh) 7 V f(z) a) 
yO | V f(a) - a 

y f (2) 


One entry h;; of the Hessian can thus be obtained from y\b2) by seeding X(1) = €:, X(2) = @;, 
and X(1,2) = 0. The full Hessian can be obtained at cost n(n + 1)/2- cost (f)) by exploiting 
the symmetry of the Hessian. For sparse matrices, the cost can be further lowered by coloring 
approaches (see Section 2.11). 


Definition 10 (Tangent over Adjoint Model). 
—_ the first order adjoint model to f1) (y Y(1)) £ wu) yields the second order tangent over adjoint 


model i. (a! 2) g, yy Yay) | > IR” x R” x R kX R- R” X R"” X RXR, which calculates the 


following relations: 


oD \ (yay: V? F(a) +2 ne: VG) 
a) _ Vi(x)* - ya) 

y? V f(a) - a?) 

y f(x) 


2) 


The i-th row/column of the Hessian can thus be obtained from oa by seeding x?) = e,, yay =1 


and ya} = (0. The full Hessian can be obtained at cost n- cost ( ran Again the number of 


evaluations of te can be improved by coloring, seeding multiple directions of x‘) at once. 


Definition 11 (Adjoint over Adjoint Model). 

Applying the first order adjoint model to f,1) (4 Y1), £ x) yields the second order adjoint over adjoint 
model f(1,2) (a £(1,2), £, Y(1); y(2)) | IR” xX R” x R x R-> R X R” X R” X R, which calculates the 
following relations: 


Y(1,2) V f(x) +x L( 1,2) 

Za) | _ V f(a)" ya) 

(2) yay: Vf (a) - 2 B12) + V F(a)" - y(2) 
y Fa) 
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The i-th row/column of the Hessian can be obtained from X(2) by seeding x1) = e;, ya) = 1, 
and yg) = 0. The full Hessian can be obtained at cost n - cost ( faa). The cost is thus identical 
to the tangent over adjoint model, and no further gain in complexity is realized by applying 
the adjoint model twice. In practice the choice of which adjoint model to use depends on the 
implementation of the AD tool and caching considerations on the executing machine |Lot16]. 


Definition 12 (Adjoint over Tangent Model). 
Applying the first order adjoint model to fae (a), x) yields the second order adjoint over tangent 
model i (2, x, yay Yay) > IR” x R” xX R xX R - R” X R” XR X R, which calculates the 


following relations: 


m2) \ (Ya) V7 F(a) + Vila)? “va 
1 
ey = V f(a)" Ya) 
7 f (2) 
The 7-th row/column of the Hessian can thus be obtained from xg) by seeding x) =e, es =], 


and y2) = 0. The full Hessian can be obtained at cost n - cost ( oe) 


The adjoint over tangent model is rarely used in practice, due to the increased storage costs 
required for the backward propagation compared to the tangent over adjoint model. 


2.7.3 Tool Driven Derivative Generation 


Broadly speaking AD tools follow one of two approaches, source code transformation or operator 
overloading. Both approaches are briefly presented below. 


Source Code Transformation 


Source code transformation tools parse the primal source code and transform it by applying 
the tangent or adjoint models, Equations (2.14) or (2.15), onto the individual statements, 
producing a new source code which can then be compiled/executed and optimized by a regular 
compiler /interpreter. The generated source code looks and performs much like a code differentiated 
statement by statement by hand, but allows greater flexibility and more rapid code development. 

The source code transformation approach requires that the tool has access, and is able to 
parse, the source code. This makes the code transformation of programs written in complex, 
object oriented languages like C++ challenging. Especially templated codes, where substantial 
parts of the code are only instantiated at compile time, are not well suited to differentiation 
with source code transformation tools. Hybrid approaches are possible. For example, source 
code transformation can be applied to numerical kernels that are limited to C syntax and some 





subset of the C++ standard. ‘The remaining program wrapping the numerical kernels can then 
be differentiated by operator overloading. The resulting partial derivatives of the different code 
blocks can then be multiplied together using the chain rule. 

General purpose AD tools which use the source code transformation approach include TTAPE- 
NADE |HP13] (C and Fortran), dec [F6r14] (C) and Tangent |Gool7| (python). Furthermore 
domain specific tools exist, e.g. DolfinAdjoint [Far+13] for the Dolfin [LW10] / FEniCS |Aln+15] 
packages for solving differential equations using FEM. 
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#Hinclude <iostream> 


struct ADtypetf 

double v; // value component 

double t; // tangent component 

ADtype(const double& v, const double& t = 0.0) : v(v),t(t) f}; 
I} 


ADtype operator*(const ADtype& x1,const ADtype& x2){ 
Lelunm Abi ype cel. v+x2 Vy tl Vel tt et ty): 

i; 

ADtype operator+(const ADtype& x1,const ADtype& x2){ 
Pevucny AD Ly pe cle yoo gel toh Opt) 

Ir 


int main(){ 
ADtype x = 2; 
x.t=1; // seeding 
ADtype y = x¥*x+x; 
Stid@ Cowbes< 2 ye te Cn da Oe 


t 


Listing 2.5: Implementation of a basic operator overloading tool. A custom type calculates 
tangents of programs containing additions and multiplications. Application is demonstrated 
by calculating the tangent y = 2x7 +1=6o0f y= 2-x2+~2 at location x = 2. 


Operator Overloading 


To circumvent the issues faced by the source code transformation approach, the operator over- 
loading approach uses the operator overloading features present in many modern programming 
languages. We will focus on the implementation, as it applies to C++. Operator overloading 
allows to replace the intrinsic implementations of the basic numerical operators (+,—,...) and 
functions (sin, exp, pow), implementing custom behavior. An AD tool implements one or mul- 
tiple custom data types that are used to replace the floating point data types of the primal 
calculation. The custom data type implements the necessary operators to, in addition to the 
primal calculation, propagate tangents/adjoints through all (unary or binary) elemental functions 
occurring in the code. 


As an illustration of this concept a very basic C++ operator overloading tool, that is able to 
calculate tangents of programs containing assignments, additions, and multiplications is shown 
in Listing 2.5. It is applied to the expression y= x%-r%+ 42. 

Operator overloading tools are able to, with minor exceptions, cover the whole language 
standard of C++ and are thus applicable to heavily templated code bases. Codes treated by those 
tools require only minimal code changes (some of which are mentioned in Section 3.2.3) and are 
by definition always up to date with the primal code base, as the instructions calculating the 
derivatives are generated at compile and run time alongside the primal. 





Operator overloading introduces a run time penalty, which is more pronounced in some languages 
than in others. For compiled languages, most of the overhead can be offset by compile time 
optimization, most significantly function inlining. The memory overhead of the adjoint is generally 
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more pronounced than for codes generated with source code transformation, as a representation 
of the SAC has to be stored at run time. The performance and memory consumption can be 
improved by (statement level) preaccumulation, briefly discussed in the next Section. 

Operator overloading tools include ADOL-C [WG12] (C,C++), CoDi-Pack |SAG17] (C++), 
dco/c++ [LLN16] and AdiMat |BBV06] (Matlab). 

Novel approaches are currently in development, further reducing the run time overhead 
by exploiting advanced templating features of C++, at the cost of requiring more code alter- 
ations |Lep+17|. This allows to extend preaccumulation from the statement level to whole 
code blocks. 


2.8 Introduction to dco/c++ 


In this section we will give a brief overview of the architecture and interface of the operator 
overloading AD tool dco/c++ [LLN16]. Brief example drivers, usable to obtain gradients, Jacobians, 
and Hessians are given. 


2.8.1 Scalar and vector tangent mode 
The AD tool dco/c++ implements a generalized tangent scalar data type 
template<class T> dco::gtis<T>:: type; 


which carries a tangent component of type T alongside the value component of the same type. For 
a first order tangent model, this type is instantiated with a floating type data type (i.e. float or 
double). It can be initialized with a zero tangent by assigning a primal value. 


dco::gtis<double>::type x = 42; 


The tangent type carries no further data; thus sizeof (dco: :gt1is<T>::type) == 2*sizeof(T). 
The derivative components of all dco/c++ types can be accessed by the interface routine 
dco::derivative(). Here it returns a reference to the tangent component of the passed variable. 


dco::gtis<double>::type x; 
double t = dco::derivative(x); MA const access 
digo: derivative (x) = 1-0. //¥non const access 


The value component of a dco/c++ type can be accessed by the dco::value() routine. ‘This 
allows to alter the value without changing the tangent /adjoint and to convert a dco/c++ type to 
a passive type, discarding its derivative information. 


dco::gtis<double>::type x; 


double v = dco::value(x); // const access to value component 
dco::value(x) = 42.0; // non const access, tangent unmodified 
42 0: // overwrites tangent with 0.0! 


Complementing the tangent scalar type gtis<T>::type dco/c++ also defines a vector type 
gtiv<T,d>::type. This vector type carries a vector of tangents t € R®% alongside the pri- 
mal value. ‘The tangent model is applied to each vector entry individually, allowing to evaluate d 
seed directions at once. ‘The vector mode reduces the number of elemental function evaluations. 
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gtiv<double,d> sin(const gtiv<double ,d>& x){ 
gtiv<double ,d> jy; 
dco: value GG) = Sani(x); 
double partial = cos(dco::value(x)); 
for€int i=0; i<d; itt) 
dco::derivative(y) Li] = partial*dco::derivative[i]; 
Fevurn y; 


Listing 2.6: Implementation of sin operation for tangent vectors of length d. 


To evaluate n seed directions, instead of n calls to the scalar tangent model, only |n/d] calls to 
the vector tangent model are needed. Further, it improves floating point performance, due to the 
increased cache locality introduced by the loop evaluating the individual tangents. The benefit 
due to caching varies from code to code. For maximum performance, the vector size is fixed at 
compile time and implemented as a plain array. A sensible vector size is e.g. 16. Tangent vectors 
which are too long consume excessive amounts of memory, and may decrease cache efficiency 
due to the vector not fully fitting into one cache line. ‘The size of a vector type in memory is 
sizeof (dco: :gtiv<T>::type) == (d+1)*sizeof(T). 

Accessing the values of the tangents follows the same interface as the scalar tangent type, with 
the difference that dco: :derivative returns a reference to the tangent vector, from which the 
desired element can be accessed using the usual vector operations. 





dco::gtiv<double>::type x = 42; // init tangents to zero 
double t = dco::derivative(x) [0]; // const access 
dco o derivative (x) [1l))— 120- /Q n@maconst access 


An exemplary implementation of the sin operation, using the interface introduced above, is shown 
in Listing 2.6. 

The general purpose driver shown in Listing 2.7 calculates the gradient of a multivariate 
function f : R” — R, assumed to be implemented (externally) as T f (std::vector<T> x), 
requiring 7 calls to the function f. 

To find a sensible vector length d, the driver is tested for a problem size of n = 2'° and tangent 
vector sizes ranging from 2 to 1024. The functions considered are 


n—1 n—1 
y=) v;,° 2; and y=) sin 2. 


The results are shown in Figure 2.13. ‘The results shown for d = 1 are obtained by the scalar 
tangent types and the curves are normalized, such that the run time of scalar execution equals one. 
The products x; - x; are computed cheaply. ‘Thus, the run time is dominated by memory lookup 
of x;, and caching effects are clearly observable. Here the optimal tangent vector size is 32, giving 
a speed up of approximately factor 2.5, compared to the scalar tangent version. ‘Trigonometric 





functions are much more expensive to compute (here by a factor of roughly 25), therefore the 
saved calls of the sin function for the primal and cos function for the partial derivative have a 
bigger impact. Again, a vector size of 32 is a good choice, however the minimum time is achieved 
by choosing a vector size of 256, resulting in a speed up factor of 30. 


AT 


Oo an ana FF Ww NY 


Oo NY BF Be Be Be Be Be Be Be Be 
Fe Oo Oo Wan Dn wo FPF Ww YY fF OO 
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#include "dco_cpp_dev/src/dco.hpp" 
#include <vector> 
using namespace std; 


typedef dco::gtis<double>::type tis_type; 
const int vector_size = 16; 
typedef dco::gtiv<double ,vector_size>::type tiv_type; 


template<typename T> 
T f(€vector<T> x); 


// scalar driver 
vector<double> calc_grad_f_tis(const vector<double>& xd){ 
const int n = xd.size(); 
vector<double> grad(n) ; 
vector<tis_type> x(xd.begin() ,xd.end()); // copy passive values 
fre Gre Oe mess) 
dco::derivative(x[i]) = 1.0; // seed i-th unit vector 
tis_type y = f(x); // augmented primal calculation 
gradli] = dco::derivative(y); 
dco::derivative(x[i]) = 0.0; // reset seed vector to 0 
Ir 


return grad; 


// vector driver with vector size 16 
vector<double> calc_grad_f_tiv(const vector<double>& xd){ 
const int n = xd.size(); 
GONst int d= vector size; 
vector<double> grad(n) ; 
vector<tiv_type> x(xd.begin() ,xd.end()); // copy passive values 
for(int i = 0: i < ceil(n/@®: it) 
// increment through local and global indices 
for(int j=itd; j < min((Cita® «d@a); j++) 
dco::derivative(x[j]) [jZ%d] = 1.0; // seed g-th unit vector 
tiv_type y = f(x); // augmented primal calculation 
for(int j=i*td; j < min((iti)*d,n); jr++){ 


grad[j] = dco::derivative(y)Ljjd]; // extract tangents 
dco::derivative(x[j]) [j%d] = 0.0; // reset seed vectors to 0 
i 
Ir 
return grad; 


t 


Listing 2.7: Driver calculating the full gradient of a multivariate function f : R” — R using 
tangent mode of AD. 
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Figure 2.13: Run time of the tangent vector benchmark for varying tangent vector size d. Run 
time is normalized to the execution time of the scalar tangent. 


The tangent mode of AD does not need any AD specific interface routines, other than the 
access functions dco::value() and dco: :derivative(). 


2.8.2 Adjoint mode 
Concepts 


For the calculation of adjoints, dco/c++ uses efficient data structures to store the information 
required for data flow reversal. First, we introduce the graph data structure, with which the 
data flow reversal can be described. Second, we show how dco/c++ stores this information 
in its internal representation. ‘This will become important for the optimizations applied in 
Sections 3.4, 3.7, and 3.8. 

Every elemental assignment in the code can be expressed as a DAG, modeling the SAC generated 
by the assignment. For an elemental assignment, the output is by definition a scalar, connected 
to one or multiple inputs. The SAC, consisting of only basic unary or binary operations, can be 
conveniently differentiated with the chain rule by overloading the operators. A traditional operator 
overloading approach, e.g. applied in ADOL-C [WG12], operates on data structures similar to the 
DAG with all intermediate nodes included in the graph representation. By implementing operator 
overloading via a template expression engine, the elemental gradients of a single assignment 
in the code can be assembled during the augmented forward run by multiplying together the 
partial derivatives created by the interior edges of the DAG. This technique is commonly referred 
to as statement level preaccumulation |GWO8]. For the specific implementation in dco/c++, 
refer to [LLN16]. 

On the graph level, preaccumulation transforms the DAG into a bipartite graph connecting the 
inputs to the scalar output. ‘This significantly reduces the storage requirements for the data flow 
reversal, as no intermediate nodes and edges of the SAC need to be stored. 

An edge e € E in the bipartite graph structure (V, FE), pictured in Figure 2.14, corresponds to 
a preaccumulated derivative of an output with respect to a specific input. The values of the edge 
labels can be calculated during the augmented forward section, as they do not depend on any 
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Figure 2.14: Illustration of the adjoint model for code consisting of two elemental assignments. 
For each assignment in the original code, the individual edges created by the corresponding 
SAC can be transformed into a bipartite graph using preaccumulation. 


values created in the adjoint propagation phase. A vertex v € V represents an adjoint variable. 
During the reverse propagation phase, also called interpretation phase, the adjoint information 
is propagated through the bipartite graphs modeling individual assignments y = f(x). The 
edge labels storing the preaccumulated partials Oy/Ox; are multiplied with the corresponding 
adjoints y of the left hand side of the primal assignment. ‘The resulting products are used to 
increment the adjoints of the inputs x; which are accessible via the edges of the graph: 


m= seg Vie 10,39, n} . 








The values of the adjoint variables are stored in a contiguous vector in RAM, called the adjoint 
vector. The preaccumulated partial derivatives are stored in a graph like structure, storing 
the partials as well as the position of the element in the adjoint vector which needs to be 
incremented during the reverse propagation phase. We call this data structure the adjoint stack, 
as it continuously grows during the augmented forward run and gets evaluated sequentially in 
opposite order during the reverse propagation phase (last in, first out). In actual implementation, 
the stack is implemented as a vector instead, to allow reinterpretation. We call the combination 
of the adjoint stack and adjoint vector the tape. 














typedef std::vector<double> adjoint_vector; 
struct partial_edgef 

double partial_val; 

int target_idx; // entry of the adjoint vector which will be incremented 
}; 
typedef std::vector< std::vector<partial_edge> > partial_edges; 
void interpret_tape(const partial_edges& s, adjoint_vector& v){ 

for(int if— smswm@e()-1; i>=0;1--) 

for (congt #oautial_edge& p : slil) 
v[p.target_idx] += p.partial_val * vlil; 

t 


Listing 2.8: Conceptual implementation of the tape interpretation. 
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Figure 2.15: Storage of preaccumulated partial derivatives and the target index in the adjoint 
vector (left), and adjoint vector (right). The preaccumulated partials are used to increment 
the adjoints during the reverse sweep. 





For each assignment in the code (or in the SAC without preaccumulation), an entry on the 
stack will be generated, storing all partial edges corresponding to this assignment. A conceptual 
implementation of the tape interpretation is given in Listing 2.8, the corresponding implementation 
of the operators generating the tape entries is given in Appendix D. 





For a graphical representation of the tape created by the SAC from Figure 2.14, as well as 
the incrementation process, see Figure 2.15. Note that the position of the variables which need 
to be loaded from the adjoint vector in order to increment a specific adjoint are not necessarily 
close together. For example, a simple iterative program dependent on a static parameter would 
need to increment the adjoint of the parameter, stored at the first position of the adjoint vector, 
for each iteration of the loop. Furthermore, one adjoint of the iteration variable x needs to be 
incremented, which will be stored at some (decreasing) distance to the adjoint of the parameter. 
This leads to random access memory patterns (which for this simple example could be handled 
by multiple cache lines, but will fail for more complex iterations) and also makes it difficult to 
offload chunks of the adjoint vector to secondary storage, as entries in multiple chunks may need 
to be incremented for the adjoint of a single expression. 

The tape only stores the preaccumulated partials for a specific state of the primals. ‘To evaluate 
the adjoints for different primal values, the tape has to be newly recorded, requiring a full 
re-evaluation of the program. Tools which store the full SAC (e.g. ADOL-C or optionally in 
CoDiPack) will only evaluate the Jacobians during the reverse interpretation sweep, allowing to 
evaluate the adjoint model for different primal states without re-recording. This approach comes 
at the expense of significantly higher memory consumption and lower performance for a single 
evaluation of the adjoint model. In contrast, different adjoint seeds can be evaluated with an 
already recorded tape. 











An example tape representation directly exported from the internal dco/c++ data structure is 
shown in Figure 2.16. This tape is created by three iterations of the Babylonian iterative root 
finding algorithm. It calculates ,/a with a = 2 from an initial guess of « = 2 using the iteration 
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(1, 0.353542) 


(2, 9.61169e-05) 


(3, 0.0017301) 





Figure 2.16: Internal representation of the tape, generated by Babylonian root finding algorithm. 


procedure x = 0.5*(a/x+x). The edge labels are the partial derivatives stored in the tape, the 
vertex labels are a pair of the location of the adjoint in the adjoint vector and the value of the 
adjoint after the reverse propagation has finished. ‘The adjoint of the parameter a is located at 





the first entry in the adjoint vector and evaluates to 0.353542 ~ . The Babylonian root finding 
algorithm is discussed in further detail in Section 3.8.2. 


dco/c++ Adjoint Interface 


The basic operations dco: :derivative() and dco: :value(), already introduced for the tangent 
mode, still apply for the adjoint mode. Additional routines, corresponding to the management 
and interpretation of the tape, are needed. dco/c++ provides a global tape, which is accessible 
through the global_tape pointer at any point in the program. Specialized tapes with local 
scope can also be created, however this feature is not needed in the context of this thesis. The 
global tape needs to be allocated at the beginning of the program and is alive until it is explicitly 
destroyed or the program exits. 


#include <dco.hpp> 
int main(){ 
// allocation 
dco::gais<double>::global_tape = dco::gais<double>::tape_t::create(); 


some_calculation(); 
// destruct ien 


dco::gais<double>::tape_t::remove( dco::gais<double>::global_tape ); 


t 





If not otherwise specified, the tape is allocated as a chunk tape, meaning that the adjoint stack 
does not have a fixed size and grows on demand. New space for the stack is allocated in chunks, 
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that are not necessarily adjacent in memory. An alternative is the usage of a fixed size tape, 
which yields slightly higher performance at the expense of less flexibility. 

Variables corresponding to inputs, for which derivatives are desired, need to be registered in 
the tape. At this point they will be assigned a positive tape index, uniquely identifying them to 
avoid name aliasing. ‘The tape index can be used to look up the corresponding adjoint from the 
adjoint vector. 

The registration process helps to reduce the run time and memory overhead of the adjoint 
by performing an activity analysis |HNPO5|. That is, by default variables are considered to be 
passive and do not enter the DAG with any partials. A variable becomes active, once it gets 
assigned the result of an elemental function involving an active variable. 





dco::gais<double>::type x = 21; // x still passive, dco::tape_index(x)=0 

x = 2*x; // activity analysis: x=42, dco::tape_index(x)=0 
dco::gais<double>::tape->register_variable(x); // x active, dco::tape_index(x)=1 
x = 2*x; // aliasing of x, x=84, dco::tape_index(x)=2 





Once the independent variables have been registered, the primal calculation can be executed. 
The overloaded operators will populate the adjoint stack with the preaccumulated gradients and 
update the tape indices of the variables involved in the computations. 

The aforementioned concepts are illustrated in the general purpose driver shown in Listing 2.9, 
which calculates the gradient of a multivariate function f : R” —> R, equivalent to the tangent 
driver discussed earlier. 


fanmelude “dco cpp_dev/sre/dco. hpp “ 
#include <vector> 
using namespace std; 


typedef dco::gais<double> als_mode; 
typedef dco::gals<double>:: type als_type; 


template<typename T> T f(vector<T> x); 


vector<double> calc_grad_f_ais(const vector<double>& xd){ 
ais_mode::global_tape = ais_mode::tape_t::create(); 
const int n = xd.size(); 
vector<double> grad(n) ; 
vector<als_type> x(xd.begin(),xd.end()); // copy passive values 
ais_mode::global_tape->register_variable(x.begin() ,x.end()); 


ais_type y = f(x); // augmented primal calculation 
dco::derivative(y)=1; // seeding 
ais_mode::global_tape->interpret_adjoint(); // reverse propagation 


grad = dco::derivative(x); // extract derivatives of vector x 
ais_mode::tape_t::remove(ails_mode::global_tape) ; 
return grad; 


t 


Listing 2.9: Driver calculating the full gradient of a multivariate function f : R” — R, using 
adjoint mode of AD. 
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The tape persists the interpretation and can be re-evaluated with different adjoint seeds. 
Re-evaluating a previously recorded tape allows to calculate different adjoints for the same primal 
evaluation path, without the need to execute the augmented primal calculation. Due to the 
incremental nature of the adjoint propagation, it is usually necessary to zero all elements of the 
adjoint vector prior to seeding and propagating new adjoints. 


// zero all entries of the adjoint vector 
dco::gais<double>::global_tape->zero_adjoints() ; 


The contents of the tape can be reset without destroying the tape, which is more efficient than to 
destroy and re-allocate the tape repeatedly. 





// delete tape entries and corresponding adjoint vector 
dco::gais<double>::global_tape->reset_adjoints () ; 


For the tape routines interpret_adjoint, zero_adjoint and reset, corresponding routines 
that allow to operate only on a subset of the tape, exist. ‘The current position in the tape can be 
queried with get_position. 


dco::gais<double>::position_t to 

= dco::gais<double>::global_tape->get_position() ; 
// ao es the 
dco::gails<double>::position_t from 

= dco::gais<double>::global_tape->get_position() ; 


dco::gais<double>::global_tape ->interpret_adjoint_to(to) ; 
dco::gais<double>::global_tape ->interpret_adjoint_from_to(from,to) ; 
dco::gais<double>::global_tape ->zero_adjoint_to(to) ; 
dco::gais<double>:: global_tape ->zero_adjoint_from_to(from,to) ; 
dco::gais<double>::global_tape->reset_adjoint_to(to) ; 


Listing 2.10: Operations can be restricted to a part of the tape. 


2.8.3 Higher Order Derivative Models 


For higher order derivative models, the tangent and adjoint data types, as well as the access 
routines can be recursively nested. This will be demonstrated in Section 3.6, for now we will 
focus on first order data types. 


2.9 Gradient Based Optimization 


2.9.1 Steepest Descent 


One of the most intuitive and popular choices for gradient based optimization is the method 
of steepest descent. Here we focus on our CFD optimization setting with parameters a € R"¢, 
consisting of pre-processing, processing, and post-processing. ‘The chaining of the processing 
functions requires the use of the total derivative. From a current parameter state a’, the next 
state is determined by moving the state a distance \’ in the (negative) direction of the gradient 
of J w.r.t. a: 

ait! — a! =)! - Vo 7 (a‘) . 
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2.9 Gradient Based Optimization 


As the gradient of a function points in the direction of steepest ascent, a reduction in the cost 
function can be achieved by moving the state in the opposite direction. 


The optimal step size \’ can be chosen by performing a line search along the gradient direction 
and finding the A which minimizes the cost function: 


ve min J (a’ —A-VaJ (a’)) . 


This creates an additional optimization problem, albeit only for the scalar parameter A, requiring 
additional cost function evaluations as well as derivatives d7(a@’ — A-VaZ) : dA. As A isa 
scalar, the derivatives can conveniently be calculated with tangent mode or FD. 


To avoid the cost and complexity of calculating additional derivatives, a pragmatic approach is 
to choose a somewhat arbitrary value for A, ensuring that it improves the cost functional, that 
is J(a't) < J(a’). A popular implementation is the bisection algorithm. Starting from an 
initial guess, A is iteratively divided, until an improvement in the cost function is achieved. The 
bisection approach is illustrated in Algorithm 1. 


Algorithm 1: Bisection line search algorithm. 
Input: previous parameter state a’ 
Data: gradient VqJ, initial line search stepsize Astart 
Output: new parameter state a’t! 


1A A start ) 

2 while J(a’ —\VaJ) > J(a’) do 
3 | A+ /2; 

4 end 

sp avtle a' AV : 


For ||VaJ|| 4 0, the algorithm is guaranteed to terminate, as for lim)_,9+ a reduction must 
be achieved, else —-VWaqJ would not be a descent direction, contradicting the definition of the 
gradient. No additional derivatives are required for the bisection. ‘Thus, the additional cost 
function evaluations can be performed in passive mode, as the gradient d/ / da’ is considered 
fixed during the line search. 





To ensure convergence, the step size can be chosen such that the Wolfe conditions |Wol69] are 
fulfilled, adding further complexity to the line search algorithm. 

As an example, showcasing the limitations of steepest descent, Figure 2.17 illustrates the 
convergence of the steepest descent algorithm for the Rosenbrock function |Ros60| 


f(a,y) = (1 — 2)? — 100(y — 2°) 


with a fixed step size. Finding the optimum of the Rosenbrock function is notoriously difficult for 
gradient based optimization methods, due to the long valley with only a low gradient leading to 
the optimum. Consequently the steepest descent algorithm needs several thousand iterations to 
converge from the chosen starting position to the optimum at (1,1). 
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Figure 2.17: Iteration history of the steepest descent algorithm (green trajectory) and Newton’s 
algorithm (red trajectory), from starting point P = (—1,3) to the optimum at O = (1,1) of 
the Rosenbrock function. 


2.9.2 Newton's Method 


With the additional information of curvature available from the Hessian H = V7 f(x), an improved 
iteration called Newton’s method can be constructed. 


xitl = x! = (V? f(x'))~ ; V f(x’) 


=Z 


This method allows to find (local) minima characterized by V f(x) = 0. Due to the cost and 


stability issues involved in the explicit calculation of the inverse Hessian (Vv? f (x'))~*, in practice 
an equivalent two stage procedure is used, where the update step is calculated by the solution of 
a linear system. 


V7 f(x!) 2= V(x!) + 2= 5 (VF (x), V(x) 
xi x! 7 

For certain optimization problems, Newton’s method can dramatically reduce the number of 

iterations needed, outweighing the additional complexity required to obtain the second order 

derivative information. For example, the Rosenbrock example in Figure 2.17 only needs four 

iterations of Newton’s method to converge to machine precision, compared to thousands of 

iterations with steepest descent. 

Newton’s method can be shown to converge to the solution quadratically starting from a point 
within an interval to the solution. However, convergence is not necessarily guaranteed from all 
starting points |Deul1]. It is sometimes advisable to initialize the solution with a number of 
steepest descent iterations to get into the region of convergence of Newton’s method and then 
switch to the faster converging Newton’s method. 
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For problems of high dimension, where the assembly of the Hessian is considered too costly or 
complex, Quasi-Newton methods can be used. Quasi-Newton algorithms iteratively construct 
better approximations for the inverse Hessian, retaining some of the convergence properties of 
the Newton’s method. A popular Quasi-Newton method is the BFGS algorithm which has been 
implemented in multiple variations |Bro70; Fle70; Gol70; Sha70}. 


2.10 Discrete Adjoint Residual Approach 


Many researchers in the past already implemented an approach which only requires the residuals 
of the FVM discretization systems to be differentiated. Most just call it the discrete adjoint 
approach |Mav07; RU13; NJO7; He+18; Gil+-03]. To avoid confusion with the algorithmic 
discrete adjoint, applied to the whole non-linear iteration step, we will call it the discrete adjoint 
residual approach. 

For the usual (laminar) flow state in discretized form x = (U, p), convergence of the governing 
equations implies that the residual R of the Navier-Stokes equations is (near) zero 


R (x(q@),a) = 0. 
Thus, the total derivative of the residual can be expressed as 


dR OROx OR 
da axdal’ Jos (2.16) 


The sensitivity of the states w.r.t. the parameters can be calculated by transforming (2.16) into 


Ox (*) = (2.17) 


das \ ax} Oa’ 





The desired sensitivities d7/da@ can be calculated by inserting (2.17) into the total derivative 
for J: 


(2.18) 


da Ox\dx) da’ 


dJ(x(a),@) OF | OJ Ox _ AJ AT (AR “OR 
da 0a Oxda Oda Ox 


Defining 


* Ox \ Ox 


one can eliminate the inverse operation from (2.18) by solving the linear equation system 


aR\* aq\* 
SS b= (5) 


for Ax and substituting the result into (2.18) 


ar oF (oR) 





(2.19) 


dT (x(a),a) OF — (oR) 
da ~ Oa Oa ian 


When using the adjoint mode of AD, the (matrix-vector) product (OR./da)* -Ax can be evaluated 
without explicitly calculating the Jacobian OR/0a, by exploiting that the adjoint model allows 
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to calculate the projection of the transposed Jacobian in an arbitrary direction. Therefore, Ax 
can be chosen as a seed direction. ‘The adjoint model then calculates the desired matrix-vector 
product. The explicit dependence 07 /Oq@ of the cost function on the parameters is often zero, 
or else can be calculated cheaply with one evaluation of the adjoint model. The costly operations 
are therefore the calculation of the Jacobian of the residual w.r.t. the state OR/Ox, and the 
solution of the linear equation system. 

The linear system can potentially be solved using a iterative matrix free solver (e.g. BiCG). 
Matrix free solvers, in contrast to regular solvers, only require a mean to evaluate matrix vector 
products of the matrix with arbitrary vectors. AD allows the efficient evaluation of the (transposed) 
Jacobian vector product. Using a matrix free solver the full Jacobian neither has to be evaluated, 
nor stored. However, due to the poor condition of the linear system, in practice matrix free 
solvers are usually not a feasible option. Using matrix free solvers, the problem can not be 
effectively preconditioned and will not converge well. Therefore, the full (sparse) Jacobian has to 
be calculated, either in adjoint or tangent mode, or using FD. 

Note, that some authors calculate the Jacobian with FD and still call their implementation 
discrete adjoint, in reference to the adjoint equations (e.g. [He+18]). 

To effectively calculate the sparse Jacobian OR/Ox, coloring techniques can be used, lowering 
the complexity from O(n), to O(d), where d is the maximum number of cells which influence 
the discretization of a cell, that is 








J Ox; 
=0 


Coloring techniques are presented in the following section. 
The discrete adjoint residual approach has been implemented in the discrete adjoint OpenFOAM 
framework, along with the necessary coloring heuristics. Details are presented in Section 4.5. 














2.11 Matrix Coloring 


2.11.1 Jacobian Compression 


The calculation of sparse Jacobians and Hessians can be considerably sped up by exploiting the 
sparsity and orthogonality of rows and/or columns. 


Definition 13 (Structural Orthogonality). 
Two vectors v,w € R” are structurally orthogonal if and only if at each index at least one of the 


vectors is zero, that is 
n—1 


S (vjw;)? = (0. 

i=0 
The tangent model computes the multiplication of the Jacobian J = Vf in a direction x, the 
adjoint model multiplications of the transposed Jacobian in direction x. If multiple columns of 
the Jacobians are pairwise structurally orthogonal, all non-zero entries of those columns can be 
computed with one evaluation of the tangent model, by superimposing the unit vectors which 
would be used to calculate the corresponding columns. 

Similarly all non-zero entries of structurally orthogonal rows can be computed with one 

evaluation of the adjoint model. 


08 


2.11 Matrix Coloring 


Definition 14 (Jacobian column compression). 

Let J be the Jacobian of a function f : R” > R™, with associated row and column index sets LT 
and J. Let Jo C JI be a set of columns indices for which the columns of the Jacobians are 
pairwise structurally orthogonal, that is 


Jij » dijo =0 Wed, niJ2E€ Jon F ja - 


Thenv = J: $32 Jo e:) oe Jo A&i contains all non-zero elements of the columns corresponding 
to the indices Jo. 


Definition 15 (Jacobian coloring). 

A Jacobian coloring groups rows/columns of the Jacobian into sets of structurally orthogonal 
rows/columns. We refer to those sets as colors of the Jacobian. Each row/column is assigned a 
color, and is thus included in exactly one set. 


Definition 16 (Row Seed Matrix). 

A row seed matrix is a matric Sp € {0,1}°%" with SX) S;; = 1 for alli = 0,...,¢. The 
individual rows of the seed matrix Spr can be used as seed vectors for the adjoint model of AD, to 
compute a row of the compressed Jacobian Jo = Sr- J € R&*”. 


Definition 17 (Column Seed Matrix). 

A column seed matrix is a matrix Se € {0,1}™** with = Se =, ora G =O aces Te 
individual columns of the seed matrix Se can be used as seed vectors for the tangent model of AD, 
to compute a column of the compressed Jacobian Jg = J- Se € R™*°. 

Definition 18 (Bipartite row/column graph of a matrix). 

The bipartite graph of a matrix A is defined as G = ((R,C), FE). Each row of the matrix A 
corresponds to a vertex r;, in the set R. Each column of the matriz A corresponds to a verter c; in 
the set C. An element r; of the set R is connected to an element c; in C by and edge (r;,c;) € E 
if and only if the matriz entry Aj; 1s non-zero. 


The bipartite graph of a matrix can be partially or fully colored to obtain a feasible Jacobian 
coloring [GMP05]. 
We will briefly present the previous concepts on the following example matrix A € R*™%* with 
8 non-zero entries. 
Q00. 606 (Q01 0 0 
A 0 Q11 0 Q13 
0 0 a22 4923 
Q30 0 0392 0 


A possible column coloring for the example is C = {{co,c3}, {c1, c2}}, a possible row coloring is 
R = {{ro,r2},{r1,7r3}}. The resulting seed matrices are 


1 0 1 O 
and sa=|j 1 0 if 
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Figure 2.18: Bipartite graph representation of non-zero pattern of matrix A, partial distance-two 
row coloring, partial distance-two column coloring. 


The resulting compressed Jacobians, from which all nonzero entries can be reconstructed, are: 


aoo 01 

a13 11 a00 G01 422 423 
Ac = ASc = and Ar =SRA= 

a23 22 a30 Q11 432 13 

a30 432 


The bipartite graph for the matrix, as well as the feasible partial row and column colorings, given 
above, are shown in Figure 2.18. 


Lemma 1. 
A valid partial distance two row coloring on the bipartite graph solves the row compression problem, 
allowing to directly recover the nonzero entries from the compressed Jacobian [GMPO5]. 
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3 Differentiation of Complex Iterative CFD Algorithms 


In this chapter the differentiation of complex iterative CFD algorithms is discussed. Particular 
emphasis is placed on the efficient implementation of the developed discrete adjoint methods in 
OpenFOAM. First we will focus on a black-box application of AD and then later implement 
various Improvements. 





3.1 Foundations 


3.1.1 Optimization Problem 


In the following we define the general optimization problem motivating most of this thesis. 
We consider a general set of parameters y € R?. For topology optimization, we call this 
set y = a € R”¢, for shape optimization y = GB € R”Fr. 

Using these parameters, we define an optimization problem over a calculation consisting of 
three distinct phases: 


Pre-processing P: Initialize the solver and create an initial state x° € R”™. 
Processing F: Solve the underlying ODEs (iteratively), to obtain final state(s). 
Post-Processing 7: Create a scalar output from the final state(s) by evaluating a cost function. 


The full chain y = 70 FoP needs to be differentiated in order to obtain the gradient se. 
Optimization can then be applied to +y, in order to alter the solution and improve the cost 
function output, as calculated by the post-processor. ‘To keep parameters in a feasible range, 
we assume box constraints for the individual parameters 7;. In practice these can be commonly 
replaced by a global lower and upper bound. 


minimize (7) 
_ 
subject to §;<y<u;,7=0,...,p—1. 


Depending on the region of application, different combinations of unsteadiness are possible. We 
introduce feasible definitions for the functions P,*, and 7 for varying degrees of unsteadiness 
below. Ducted flows are often laminar and steady, while external aerodynamic flows often exhibit 
varying levels of transient effects. 


Steady Data, Steady Parameters 


This is the most common case for optimization of flows. It assumes that the simulation converges 
to a steady state. We denote the final state, reached after k iterations, as x* to emphasize the 
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fixed point nature of the iteration. 


P : RP + R™ x! = P(y) 
F:R™ x IR? —.R"* x" = F(x", y) = f'(x* 1 4) O... o f?(x', +) o f(x’, 7) 
T:R™*xR? OR y= J x4) 


For transient problems, the formulation of the parameters and cost functions can incorporate 
varying levels of unsteadiness. 


Transient Data, Steady Parameters 


The scalar cost function depends on states from potentially all & time steps, but the parameter 
set is kept constant during the time iteration. Such a formulation is e.g. useful to get a mean 
value of an oscillating phenomenon, using the parameters to reduce the fluctuations. 


P : RP + R™ x” = P(y) 
F :R™ x RP > Rk (x",...,%°) = F(x®, y) pf" x 7) 0... 0 f7(x", 9) 
TJ R™** x RP OR Ce on 


Transient Data, Transient Parameters 


As before, the scalar cost function depends on states from potentially all k time steps, but the 
parameter set is allowed to change between time steps, expanding it from -y € R? to R?**. 





P:R? > R™ x” = P(y) 
FR x RPX* 5 RMX Oi, 9x") = fh (x® ty") 0...0 ft (x? 4) 
TR ™** & RPX’ RR y = JI(x",...,xt,7",...,77) 


3.1.2 Reference Cases 


In the following sections we will discuss the basic concept of AD in the context of CFD solvers. 
To illustrate the implementation of those concepts, they are introduced to the OpenFOAM CFD 
framework. Whenever we need a practical example we will refer to the following two test cases, 
which are well suited for topology optimization. 


Angled Duct 


The first test case is a 2D geometry of a 90 degree bend, depicted in Figure 3.1. The geometry is 
stated dimensionless with a characteristic length of L. 

Two pipes of diameter L and length 2L are connected to a square of edge length 3L. The flow 
enters from the lower left and leaves on the top right. A constant velocity profile is prescribed at 
the inlet, a zero gradient condition is applied at the outlet. ‘The outlet pressure is fixed to zero 
with a zero gradient condition at the inlet. Walls are modeled as no-slip. A vortex forms in the 
upper left corner. ‘The vortex is induced by the shear forces, created by the fluid which flows 
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OL 


4L 


2L 





Figure 3.1: Geometry of the angled duct case. Inflow on the lower left, outflow on the top right. 
Blocks of the structured mesh are indicated in light gray. ‘The geometry is symmetric along 
the dashed diagonal line. 


from the lower pipe into the upward facing pipe. ‘This vortex region is an obvious location for 
optimization, aiming to reduce the power loss in the system. For higher flow velocity, a second 
vortex region forms in the lower right corner. A structured mesh is created by blockMesh (see 
also Section 4.6). It can thus easily be scaled to different mesh resolutions. 

At low Reynolds numbers, the use of a turbulence model is optional. For higher Reynolds 
numbers, the k-w turbulence model is used. 

Different mesh refinement levels, starting from the coarsest possible mesh for this geometry, up 
to over 200 000 cells, are listed in Table 3.1. The mesh levels below refinement level 15 are hardly 
useful to obtain a realistic solution. However, they might be used to inspect sparsity patterns and 
the discretization. The corresponding blockMeshDict.m4 file is listed in Appendix C.10. The 
mesh resolution can be controlled by a GNU m4 parameter, creating the final blockMeshDict for 
the mesher. If not otherwise specified a refinement level of 30, resulting in nc = 11700 cells, 
is used. 





Table 3.1: Refinement levels of the angled duct case. Level 1 is the coarsest possible mesh, finer 
meshes are obtained by uniformly refining the block edges. 


Level NC Level NC 
1 13 15 2925 
2 52 30 11 700 


D 329 60 46 800 
10 1300 120. = 187200 
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Figure 3.2: Dimensions of the Pitz-Daily example. Inflow on the left, outflow on the right. 
Dimensions in mm. 
















Figure 3.3: Geometry of the Pitz-Daily test case, consisting of 13 blocks (bold black lines) 
meshed with nc = 12 250 cells (gray lines). 


Pitz-Daily Case 


The second case is the flow over a backward facing step. ‘This problem, or variations of it, have 
been extensively studied and used for verification in a variety of disciplines, in particular for 
the verification of turbulence models [Rum12|. This particular geometry was first introduced 
in [PD83] and is consequently referred to as the Pitz-Daily case. It is included as a standard 
tutorial and verification case in OpenFOAM. The geometry is shown in Figure 3.2. The flow 
enters from the left, passes the step and leaves the domain through the outlet at the end of the 
nozzle shaped exit. 





At the inflow a constant velocity of 10m/s is prescribed (Dirichlet condition) with a zero 
gradient condition on the pressure (Neumann condition). At the outflow a zero pressure, as 
well as a zero velocity gradient condition, are applied. At the walls a no-slip condition is used. 
The boundary conditions for the turbulence are calculated using the kqRWallFunction for the 
turbulent kinetic energy k, and epsilonWallFunction for the turbulence dissipation rate ¢. ‘The 
resulting Reynolds number of Re = 25400, calculated width the inlet width as the reference 
length, as well as the solution singularity expected at the step, makes the use of a turbulence 
model advisable. Else an accurate and steady solution will not be obtained on coarse meshes. 
The mesh is again obtained by blockMesh, using the parameters provided by OpenFOAM. The 
mesh is slightly graded towards the walls, to obtain better turbulence resolution. A planar view 
of the mesh for the configuration with ng = 12250 cells is shown in Figure 3.3. 











3.1.3 Power Loss Cost Function 


For both reference cases, we utilize the power loss, induced by the pressure loss inside the system, 
as the objective. Other cost functions are possible, e.g. lift and drag are defined in Chapter 5. 
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The power loss is calculated as the difference between the (total) pressure integral on the inlet 
and outlet boundaries of the flow domain, multiplied by the flow velocity: 


z=] - (p+ lu?) u-nar+ | 
[, 2 I 


inlet 
The integrals have opposite signs, as the scalar product u-n yields a positive flux for flow 
entering the domain and negative flux for flow leaving the domain (normals are defined to be 
facing inwards). 
On the discretized result the integrals become sums over the boundary patches. As the flux @ 
is defined on the cell faces, they are used instead of the cell centered velocities, which would need 
to be interpolated to the faces first. 


c= », -és(or+5 (4) J + » -és(or+5 ($4) J. 


FEFT tet FEF othe 


1 
= (>+ sllul?) u-nd. 


outlet 








Like in the integral formulation, the results of the two sums have different signs, due to the 
different sign of the fluxes @ for inlet and outlet faces. 
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3.2.1 Motivation for Application of Tool Driven AD 


Software in general and especially in computational engineering can be very complex. The 
growth of complexity in software systems is believed to be exponential in time |Leh96; Dvo09], 
corresponding to the exponentially growing capabilities of computing hardware, according to 
Moore’s Law |Sch97]. Specifically, a general purpose tool like OpenFOAM, which is not tailored 
to a very specific and narrow domain and use case, will require a very large and complex code 
base. For illustration, refer to Figure 3.4, which shows the complex linkage pattern between the 
elemental OpenFOAM libraries. If one wants to retain the scope and broad applicability of the 
primal software, then as much as possible of the primal code base has to be covered by AD. 

Table 3.2 shows the growth of the OpenFOAM code base during the last few releases. ‘The 
developers of primal OpenFOAM have recently adopted a biannual release cycle. ‘This makes an 
efficient upgrade path, which brings the adjoint version quickly up to date with the primal version, 
paramount. The code base is distributed over thousands of code files, the logic abstracted behind 
many layers of class inheritance, templatization, and function macros. ‘This, as well as the usage 
of advanced C++ language features, makes the application of current source code transformation 
tools only applicable to limited numerical kernels. In order to cover the whole package by AD, 
the application of an operator overloading tool is the only viable approach at this point. 

The number of changes required to the code base for the application of operator overloading 
using dco/c++ is summarized in Table 3.3. The changes are broken down to the sub libraries 
contained in the OpenFOAM src/ directory. As can be expected, the number of changes roughly 
correlates to the amount of code included in the specific libraries, with the OpenFOAM core library 
and the finiteVolume library receiving most changes. ‘The high number of changes in the 
Pstream library stems from the addition of the AMPI library sources and not from major changes 
in the existing code base. ‘The specifics of the code changes required are addressed in Section 3.2.3. 
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Figure 3.4: Library dependency within the OpenFOAM framework of the simpleFoam solver 
(blue node). Blue edges indicate libraries directly linked to the solver, gray edges linkages 


within the OpenFOAM framework. 


Table 3.2: Growth of OpenFOAM code base from version 3.0 to 17.06-plus. src/ contains 
the OpenFOAM libraries, applications/ contains the individual tools and solvers (e.g. 


simpleFoam). 


3.0 C++ 
C++ Header 


3.0-plus C++ 
C++ Header 


16.12-plus C++ 
C++ Header 


17.06-plus C++ 
C++ Header 
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src/ 
files LOC 
2878 439k 
3118 201k 
2995 457k 
3218 208k 
3188 498k 
3423 225k 
3281 515k 
3529 231k 


applications/ 
files LOC 
884 133k 
908 ATk 
909 141k 
949 A8k 
879 132k 
971 ATk 
895 134k 
1023 A8k 


3.2 Differentiation of Complex Simulation Software 


Table 3.3: Comparison of number of files and lines of code (LOC) between stock OpenFOAM 
and discrete adjoint OpenFOAM. 


Library Name Identical Files Changed Files Identical LOC Changed LOC 
OpenFOAM 1501 102 173596 768 
finiteVolume 1103 12 81077 300 
mesh Tools 389 7 61060 71 
lagrangian 673 22 58431 61 
dynamicMesh 219 7 98421 17 
thermophysicalModels 590 8 59950 30 
mesh 148 4 37218 5 
TurbulenceModels 236 19 28052 67 
sampling 157 5) 27274 30 
functionObjects 254 5 26490 11 
conversion 68 1 14763 1 
regionModels 146 5 13845 8 
surfMesh 95 i 9609 2 
parallel 49 4 9287 10 
fvOptions 98 2 7900 4 
fileFormats 83 3 7100 11 
fv MotionSolver 19 1 6227 3 
edgeMesh 39 0 5463 0 
rigidBodyDynamics 94 0 A773 0 
sixDoFRigidBody Motion D3 0 3684 0 
triSurface 31 2 3026 10 
OSspecific 34 0 3350 0 
combustion Models 53 4 Sool ) 
ODE 35 1 2974 3 
transportModels 44 1 2720 1 
randomProcesses 28 2 2221: 2 
dynamicFvMesh 17 i! 1802 1 
renumber 19 0 1595 0 
engine 29 1 1540 3 
topoChangerFvMesh 15 0 1532 0 
genericPatchFields 8 0 1469 0 
Pstream Z 19 1208 3259 
fvAgglomerationMethods 6 0 999 0 
rigidBody MeshMotion 4 0 604 0 
regionCoupled 2 0 ATA 0 
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3.2.2 Differentiating a Complete Steady Iteration History 


Without assuming any additional knowledge about the intrinsics of CFD and iterative solution 
methods, one can tackle the task of calculating the gradient of a cost function 7(x):R” >~ R 
w.r.t. the parameters 7 by algorithmically differentiating the whole computer program, which 
implements the calculation of x and 7. This is commonly called black-box approach |Nau12], as 
no knowledge about the inner working of the model is required, as long as the inputs, outputs, 
and parameters of the model are well defined. ‘The black-box approach assumes differentiability of 
the implementation at all locations of interest. With FD one can calculate such a gradient, even 
without having access to the source code implementing the model, by perturbing the parameters 
and observing the change of the outputs. ‘This is especially useful when differentiating through 
functions which are only available as pre-compiled libraries (e.g. due to licensing and intellectual 
property concerns). 

In the following, we assume the parameters ~y to be the momentum penalty terms a of topology 
optimization, as this will be used in the illustrative examples. ‘The same statements apply virtually 
unchanged for the general case 7 € R?. Further, we assume the case to converge to a steady 
solution, as introduced in Section 3.1.1. The final state of the problem is obtained by repeatedly 
applying functions f’(x’~!, a) : R"**"c — R” to the initial state, which models the propagation 
of physical quantities in time. 

For steady problems, the state converges to a fixed point x* at which point the variation Ox /Ot 
of the solution over time is zero, even if calculated with a transient solver. The series of functions 
f’ also converges to a function f*(x*) = limp... f*(x*). For black-box differentiation, these 
fixed point properties are not exploited, however they will become important when using reverse 
accumulation and piggybacking in later sections. 

The gradient of the cost function y = J(x") after k iterations w.r.t @ is defined by the chain 
rule as: 





k; 


aT _ > aT afi \ aft , ag 


Oxk | OrI-1 | Oa Oa’ 





(3.1) 


where f° = P(q@) is the preprocessing step which creates the initial state x°. For k > oo, this 
gradient is the desired gradient of the fixed point: 


i a gy, AF(x*) 4, dT (x") 
ee a a 
Figure 3.5 shows the DAG calculating y from x° and @ by applying iteration steps f', f?, 
and f%. This iteration will more than likely not fully converge to a fixed point, however the 
derivatives of this partially converged state can still be evaluated according to Equation (3.1). 
The same expression can be constructed from the DAG by summing up the multiplication of 
partial derivatives along all paths in the DAG which connect @ to y. The summation on the right 
of Figure 3.5 illustrates this procedure for the given three iteration example. The sum is written 
to illustrate the pathwise summation, obviously it could be calculated with less operations by 
using distributivity. 
For the converged solution x* of a steady case, the value of the final state does not depend on 
the chosen starting point x° anymore. (At least in the sense that dy ‘| Ox = 0. For non-convex 
spaces, the solution might converge to a local minimum, the location of which depends on the 
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Figure 3.5: DAG of a three step iteration. The derivative d7/da@ can be determined from the 
graph by multiplying and summing over paths from a to y. 


starting point x9. In the following we assume that a unique solution exists and that it can be 
found by solving the Navier-Stokes equations iteratively from an arbitrary starting point x°). 

The vanishing influence of x° can be derived from the following argument: If the series of 
functions f’ is contractive (it converges to a fixed point), then the norm of the individual 
Jacobians Of? 7 Ox’! is lower than one for any matrix norm: ] Of? / Ox ] <1. The norm of 
the full derivative accumulated by the chain rule is therefore zero in the limit case: 





k; 






































Jim, | LL 5x1 || S (max | il) “== 0. 
<1 


The limit still holds if not every function is contractive, but the number of non-contractive 
functions is finite. 

With the same argument it can be shown that dJ/da@ converges to a fixed point. Assuming 
convergence of the primal state, both terms 07 , Ox" and Of? / Oa in (3.1) are bounded. Let us 
denote these bounded terms as cx and cq. Then dJ//da@ is bounded by 


mae 
fia < [e035 Ha 


2=0 9=74+1 





With max; ] Of? / Ox’! ] < 1 the sum is bounded by a geometric series, which is known to 
converge to a finite limit for k > oo. 
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Figure 3.6: Iteration history of the primal and adjoint residuals for the fully converged angled 
duct case, evaluated over 1 000 iteration steps. 





The independence on the starting point implies, that any errors made in the early stages of 
the procedure do not influence the outcome of the simulation x*, as long as the iteration scheme 
is robust enough to still converge to the correct solution x* for the perturbed trajectory. ‘The 
adjoints will accumulate some errors, due to the addition of the partial derivatives along all paths 
from a@ to y, however those errors are minor, due to dy A dx’ < 1 for the early stages of iteration. 
This allows to save some iteration time, by allowing the residuals of the inner linear systems to be 
higher for iterations which are still distant from the solution, therefore needing less linear solver 
iterations. In OpenFOAM this concept is called relative tolerance. Relative tolerance specifies 
that instead of solving to a specified absolute tolerance, the equations are solved such that the 
final residual is reduced by a specified factor from the initial residual. ‘The initial residual will 
shrink as the outer iteration progresses towards the solution x”. 

The evolution of adjoint x can be used as a convergence criterion, which indicates that the 
adjoint & has converged (alternatively also the absolute change in @ can be observed directly). 
The adjoint x holds the derivative 07 r Ox’ as the adjoint propagation steps backward through 
the iteration loop from 2=k toi = 1. 

Figure 3.6 shows how d/ ; dx’ and the increments to @& shrink, as the interpretation progresses 
backwards through the iteration history. For reference, the residual of the (forward) pressure 
equation is shown as well. The change in sensitivity after 700 iterations occasionally falls below 
the output precision and is rounded to zero, breaking the logarithmic scale of the figure. ‘Thus, 
this curve is omitted after 700 iterations. ‘The results were obtained on the angled duct case 
shown in Section 3.1.2. 

In Figure 3.7 the iteration procedure is stopped with only partially converged primals. When 
stopping the iteration after only 50 iterations, the residual x° drops only by a factor of 20, 
indicating that the solution is not yet sufficiently independent of the starting value. This results 
in an offset in the adjoints compared to the fully converged case. 
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Figure 3.7: Iteration history of the normalized adjoint residual x (left) and the corresponding 
sum of sensitivities (right). Early termination of the primal iteration leads to high adjoint 
residual and wrong adjoint sensitivities. 


3.2.3 Introduction of AD into the OpenFOAM Code Base 


In order to implement AD by operator overloading, all floating point variables which lie on a path 
in the computational graph between inputs and outputs of the program, have to be replaced with 
an instance of a different data type, that allows to track the derivatives as well as the primals 
(see Section 2.7.2). For our goal of differentiating a big software package, such as OpenFOAM, it 
is hard to know in advance if a variable will influence the derivative of the desired output variables 
in any way. Furthermore, OpenFOAM, despite being a highly sophisticated C++ code, that heavily 
uses templating, is not designed for multiple scalar types to coexist at the same time. ‘Thus, we 
chose to exchange the type of all floating point values with a dco/c++ data type. OpenFOAM allows 
to choose the floating point datatype between double and float. This is enabled by the following 
central typedef in src/OpenFOAM/primitives/Scalar/doubleScalar/doubleScalar.H, which 
allows to exchange the data type for all floating point variables. 








typedef double doublesScalar; 


At this location we can insert the dco/c++ data type, for example for first order adjoint mode: 


typedef dco::gais<double>::type doublescalar; 


and for first order tangent mode: 


typedef dco::gtis<double>::type doubleScalar; 


The definitions for higher order data types are listed in Section 3.6. 
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In addition to the type change, certain common modifications to the code are required to 
circumvent some limitations of the C++ language. One occurrence is the implicit casting of floating 
point values to integer types. ‘Though considered to be bad style, this is permissible by the C++ 
standard and is used in the OpenFOAM code occasionally. There is no default implicit cast from 
the non-primitive dco/c++ data types to primitive types and dco/c++ also does not provide them 
by copy constructors or assignment operators, due to the potential of inadvertent (derivative) 
data loss. ‘Thus, the passive floating point value needs to be extracted from the dco/c++ data 
type manually first, and can subsequently be implicitly or explicitly cast to an integer. 


dco::gais<double>::type x = 42; 
ji favs ink Hx 
int i = static_cast<int>( dco::passive_value(x) ); 


The C++ implicit conversion rules allow for the following conversions in that order: 


1. Zero or one standard conversion sequence, 





2. zero or one user-defined conversion, 
3. and zero or one standard conversion sequence. 


This can become problematic, when the chain of type conversions becomes elongated by 
the additional complexity of the dco/c++ data types. For example, the conversion chain 
double -> Foam::scalarField -> Foam: :dimensionedScalarField is permissible. However, 
the chain double -> scalar -> Foam::scalarField -> Foam::dimensionedScalarField is 








not, because the user-defined conversion is already used up by the implicit conversion from the 
double literal to the (dco/c++) scalar type and is not available for the conversion from scalarField 
to dimensionedScalarField anymore. Thus, the compiler has to be explicitly told that the 
double literal is supposed to be a scalar. This problem, and its solution, is also illustrated in the 
following listing. One can construct an instance of class B from a primitive double data type 
with implicit conversion double -> A -> B, however the conversion double -> ScalarType -> 
A -> B is not possible. 


class ScalarTypef 

double val; 

ScalarType (double val) : val(val){} 
+; typedef ScalarType scalar; 


class Af A(scalar d)q} 
class B{ BCA ajdh }; 


int main(){ 
I oles B(42.0); with scalar=double 
// fails: B(42].0)Q\with scalar=ScalarType 
B(ScalarType (42.0)); // correct for both types 
} 


Listing 3.1: Example code for failing implicit type conversion. The literal 42.0 has to be passed 
as ScalarType to the constructor of B explicitly. 
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In our code base the transformation of implicit to explicit conversions was done by hand. 
There are tools which strive to identify and fix those issues by automatic source code transforma- 
tion |HUB16]. However, the result should still be checked manually, such that no inadvertent 
loss of derivative information occurs. When deriving new software with AD in mind, explicit 
type conversions should be used as much as possible, to avoid type ambiguity and overlong type 
conversion chains. 


Non-Differentiable Functions within OpenFOAM 


The only function encountered within the OpenFOAM code base which triggers a floating point 
exception, due to evaluation at a non differentiable point, is the square root function at location 
zero. Evaluating the square root function at location zero leads to a division by zero when 


evaluating the derivative. 
Oy 1 
=f => == 4 
Ue Ox 2/x 
This case is not commonly triggered but e.g. occurs if and only if calculating the Lo norm of a 
vector of size zero. ‘To sidestep this problem, we replace the derivative of the square root function 
by a version which adds a small « > 0 to the independent when the non differentiable point of 


the square root is hit. 





if 
an ae a | r>0 
¥ 2./x+e t=0 


Thus, at x = 0 the derivative evaluates to a very high number, mimicking the behavior of FD 
and approximating the right-sided limit of the square root function: 


1 
mn pe — = 0O 
xr—0T ZA) 
Similar approaches are also taken by the primal OpenFOAM implementation to sidestep issues in 
the evaluation of turbulence functions. We have not observed any influence of the final sensitivities 
on the choice of €, leading us to believe that either the adjoints get propagated only into branches 


of the DAG which are not actually connected to the parameters, or that at least one of the partial 
derivatives on the path, connecting the parameters to the output, containing the calculation of 


V0, is zero. 


3.2.4 Black-Box Differentiation of simpleFoam Solver 





By introducing AD to the computational kernels of OpenFOAM, adjoint solvers can be developed 
on a high abstraction level. No particular insight into the low level implementation is required. 





As an example the implementation of adjointSimpleFoam will be presented. This solver is 
based on the regular OpenFOAM steady, incompressible, simpleFoam solver. It enables the 
calculations of gradients w.r.t. the design parameters a@ required for topology optimization. The 
simpleFoam solver implements the SIMPLE algorithm, as presented in Section 2.3. Further it also 
implements the faster converging SIMPLEC |VR84] algorithm, which can be enabled optionally. 
In the following a brief overview over the passive solver is given, so that the changes required for 
the adjoint versions become obvious. 
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The simpleFoam solver implements the iterations of the SIMPLE algorithm using a while loop, 
which will run until the maximum allowed iterations have been performed or previously specified 
convergence criteria have been met. Inside the loop body, the equation systems linearizing the 
Navier-Stokes equations are assembled and solved. ‘The momentum equations are implemented 
in the external file UEgn.H, the pressure correction equations in pEqn.H. ‘Those files are inlined 
by the C-preprocessor into the loop body at compile time (include directives inside function 
declarations are rarely used in C/C++ code, but commonly occur within the OpenFOAM code). 


int main(){ 
#include "createMesh.H" 
#include "createFields.H" 


while (simple.loop()){ 
// --- Pressure-velocity SIMPLE corrector 
#include "UEqn.H" 
#include "pEqn.H" 


laminarTransport.correct() ; 
turbulence ->correct(); 


runTime.write(); 


Listing 3.2: Implementation of the main iteration loop of simpleFoam. 


The implementation of the momentum equations (Listing 3.3) showcases the high abstraction 
level and object oriented design of OpenFOAM. ‘The differential operators occurring in the 
Navier-Stokes equations (compare to Section 2.1) are directly visible in the assembly of the 
system matrix UEqn. The divergence operator V - (@u) is discretized by fvm::div(phi, U), the 
Laplacian vV7u by fvm::laplacian(nu, U). 

The right hand side consisting of the pressure gradient Vp is build by -fvc::grad(p). Note, 
that this is a simplified implementation for laminar flows. The actual implementation includes 
additional terms to model turbulence, which we omit here for clarity. 


fvVectorMatrix UEqn 
( 
fvm::div(phi, U) 

- fvm::laplacian(nu, U) 

= five:: ¢rad(p) 
); 
UEqgn ().relax ()¥ 
fvOptions.constrain(UEqn () ); 
UEqn.solve() ; 
fvOptions.correct (U) ; 


Listing 3.3: Implementation of the momentum equations in UEqn.H. 


The pressure correction equations, as well as the momentum correction, are implemented in pEgn .H, 
shown in Listing 3.4. The notation is not as intuitive as for the momentum equations, however 
the general structure of the pressure correction equation is still visible in Line 11 of the listing. 
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volScalarField rAU(1.0/UEqn().A()); 
volVectorField HbyA(C"HbyA", U); 
HbyA = rAU*UEqn().H(); 


surfaceScalarField phiHbyA("phiHbyA", fvc::interpolate(HbyA) & mesh.Sf()); 
MRF .makeRelative(phiHbyA) ; 
adjustPhi(phiHbyA, U, p); 


fvScalarMatrix pEqn 


( 
fvm::laplacian(rAU(), p) == fvc::div(phiHbyA) 
ee 


pEgqn.setReference(pRefCell, pRefValue) ; 
pEqn.solve(); 


phi = phiHbyA - pEqn.flux(); 


pe relax ©), 

// Momentum corrector 

U = HbyA - rAtU()*fvc::grad(p) ; 
U.correctBoundaryConditions () ; 
fvOptions.correct (VU); 


Listing 3.4: Implementation of the pressure correction equation in pEqn.H. 


The penalty term for the topology design parameters is introduced to the solver by adding 
the component-wise product of a with the velocities U as a source to the momentum equa- 
tions. This is implemented by modifying the entries on the diagonal of the system matrix 
with fvm::Sp(alpha, U) (Listing 3.5). The function fvm::Sp accepts an implicit source term 
with positive contribution; thus the introduction of @ is not lowering the diagonal dominance of the 
system matrix. Alternatively, the resistance term could also be implemented by adding the term to 
the right hand side of the equation system as an explicit source term with -fvc::Su(alpha, U). 


fvVectorMatrix UEqn 
( 
fvm::div(phi, U) 
- fvm::laplacian(nu, U) 
+ fym::Sp (alpha , .U) 


- fvce::grad(p) 
Ne 


Listing 3.5: Implementation of the source term in adjointSimpleFoam. 


For the black-box differentiation through the whole SIMPLE iteration history, no additional 
changes to the main iteration loop are required. Next, the steps required to seed and obtain 
the adjoints are added. Before entering the main loop, the tape data structure of dco/c++ is 
initialized. ‘The parameters @ are registered as inputs, allowing dco/c++ to optimize the tape 
by applying varied analysis |HNPO05|. The tape is then recorded while executing the iteration 
loop. After the loop finishes, the value of the objective 7 is calculated. The calculation of the 
objective function is implemented in a separate library which allows to calculate basic objective 
functions like power loss, drag, and lift. The adjoint of the objective is then seeded with one (the 
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1D unit vector, for parallel processing only on the root node) and the adjoint reverse propagation 
process is started. After the propagation has finished, the desired gradient of the objective w.r.t. 
the parameters @ is available in the adjoints of alpha and can be extracted and written to the 
sensitivity field sens. 


int main(){ 
#include "createMesh.H" 
#include "createFields.H" 


dco::ais::global_tape = dco::ais::tape_t::create(); 
dco::ais::global_tape->register_variable(alpha.begin() ,alpha.end()); 


while (simple.loop()){ 
// Pressure-velocity SIMPLE corrector 
#include "UEqn.H" 
#include "pEqn.H" 


laminarTransport.correct(); 
turbulence ->correct (); 


runTime.write(); 


} 
scalar J = CostFunction(mesh).eval(); 


if (Pstream::master ()) 
dco::derivative(J)=1; 
dco::ais::global_tape->interpret_adjoint () ; 


forAll (alpha ,i) 
sens[i] = dco::derivative(alphalLli]); 
sens.write() 


Listing 3.6: Main iteration loop of adjointSimpleFoam 


Note the usage of the forAll macro in the above code, defined by OpenFOAM as a helper to 
iterate through a generic list-like data structure. It only simplifies the creation of the loop counter 
ranging from zero to the size of the list, it does not create an iterator on the supplied list (for this 
the forAllIters macro exists, however this macro is rarely used). The straightforward definition 
of the macro is given below. 





//- Loop across all elements in a list 
#define forAll@ist, iva \ 
for (Foam::label i=0; i<(list).size(); ++i) 


Listing 3.7: Definition of the forAll macro in stdFoam.H 


The use of this macro is encouraged by the OpenFOAM coding style guide |SG18], and therefore 
it will appear in some of the coming listings. 

The above implementation of a black-box solver is the simplest conceivable implementation 
of discrete adjoints using operator overloading, without the exploitation of further knowledge 
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about the problem. Such optimizations will be discussed in future sections. Building on the 
calculation of sensitivities, an additional optimization loop can be placed on the outside of this 
code to optimize the parameters e.g. using steepest descent. 


3.3. Checkpointing 


The black-box differentiation of complex programs poses challenges, if the amount of RAM required 
to store the tape data structure outgrows the available physical memory. For iterative problems, 
an effective approach to lower the memory footprint is to incorporate recomputation, trading 
off lower memory usage against increased run time. A systematic approach to recomputation 
techniques is known as checkpointing [|GW08] and is introduced in the following. 


3.3.1 Introduction to Checkpointing 


Checkpointing is a technique commonly used to lower the memory footprint needed to adjoin 
complex programs. It uses the deterministic nature of computer programs, which allows to restore 
the exact state of a program from the state at some earlier (or initial) point in the execution 
history. 

Checkpointing involves a memory vs. run time trade off. The storage of all intermediate values 
needed for the reversal process is replaced by storing only selectively, and recomputing the missing 
values when they are needed. To avoid having to completely restart the program to generate 
those missing values, intermediate states of the program (checkpoints) are stored (either in RAM 
or on disk), which allow to restore the state of the program at an intermediate step and resume it 
from there. For the common occurrence of an iterative computation, embedded inside a main 





loop, it is practical to only record a small number of loop steps, adjoin them, and then resume 
the program from an earlier state in order to record the missing loop steps. ‘This process can be 





repeated recursively, until all loop steps have been adjoined. 

Figures 3.8 and 3.9 illustrate the basic checkpointing procedure for four iterations. In the first 
figure only one checkpoint is placed before the first iteration, saving the initial state x°. This 
allows to recreate state x’ of the simulation at any iteration i, by recalculating the state from 
the initial state x°. A single iteration step is adjoined at a time. Thus, only the partials of one 
step need to be stored at a time, instead of four. As the last iteration step is adjoined first, the 
previous (third) iteration step can not be immediately adjoined afterwards. The state x7, needed 
to execute the augmented primal iteration f?, is not available in memory. State x? must therefore 





be recomputed by restoring the only available state x°, and executing (in passive mode) iterations 
f' and f*. Only then can iteration f? be executed in augmented forward mode. After the 
required intermediate values of this iteration step are available in memory, the adjoints obtained 
from adjoining the first iteration step (f*) can be fed back as an input to the calculation of 
the adjoints for iteration step f°. In the figure, this feedback of adjoints is indicated by right 
facing arrows The procedure is repeated for iteration steps two and one, requiring additional 
recomputations of the states x” and x!. After all iteration steps have been adjoined, the data 
flow reversal can be finished by adjoining the pre-processor. ‘The approach to only checkpoint the 
minimal amount of information necessary and recompute all other information is called recompute 
all approach. 

The amount of passive recomputation can be minimized, by not only checkpointing the initial 
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Figure 3.8: Reversal of iteration history with only one checkpoint for state x°. All other states 


Figure 3.9: Reversal of iteration history with checkpoints for states x 
to reverse all iterations without any extra recomputation (checkpoint all approach). 
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state x9, but also all other states. This allows to immediately start the necessary forward 
evaluations of the iteration steps, without performing passive recomputations to advance the 
iteration state first. For the example in Figure 3.9, those states are x°,x!, and x*. Checkpoints 
for x° and x* are not required, as they would never be restored (the post processor is adjoined in 
conjunction with the last iteration step immediately after executing the three passive forward 
evaluations). The adjoints of state x° and x* are available after the first interpretation step and 
thus a checkpoint for x? is not required. The approach to store all possible checkpoints is called 
checkpoint all approach. 





For the general case of k iteration steps, the cost of adjoining the entire iteration history using 
the recompute all and checkpoint all schemes are as follows. Adjoining one iteration step at a 
time the checkpoint all approach requires the augmented forward and reverse evaluation of k 
iteration steps, as well as the (passive) forward computation of k — 1 iteration steps to reach the 
first active step. ‘he recompute all approach needs the same k active evaluations. In addition 
it needs 
k(k — 1) 


(k—-1)+(k-2)+...41= 5 


passive iterations to recover the state from the initial state for every reverse evaluation step. 
Therefore ((& —1)-(k—2))/2 additional passive iterations are needed, compared to the checkpoint 
all approach. It is clear that the behavior of recompute all with a run time factor of O(k) 
compared to the black-box evaluation is undesirable. Checkpoint all has an attractive run time 
factor of O(1), compared to black-box (assuming passive and active steps are of same run time 
cost, the factor is lower than two. In practice it will be lower), however the number of checkpoints 
which can be stored is limited by the available virtual or physical storage space. Thus, in practice 
a checkpointing scheme is chosen, that aims to minimize the calculation time, while limiting the 
number of checkpoints to still fulfill the memory constraints. 

The question of how to optimally place a fixed number of checkpoints, in an evolution of arbitrary 
function calls with known cost, is a combinatorial NP-hard problem |Nau08]. A special case is 
the application to iterative methods, where checkpoints can be placed between each iteration, 
each iteration is assumed to have equal cost in terms of run time and memory (or at least the 
cost is quantifiable a priori), and where the number of iterations performed is known beforehand. 
For those assumptions, provably optimal spacings can be given without solving an optimization 
problem first. Such a provably optimal spacing is given by the revolve-algorithm |GW00; GW0O8]. 

For the reversal of the SIMPLE algorithm in discrete adjoint OpenFOAM, we first considered 
equidistant checkpointing. Intermediate steps of the problem are stored at a fixed distance 
and the checkpoint locations remain constant for the whole program execution. ‘This simplifies 
the implementation significantly, but ignores the possibility to reuse checkpoints once they 
are no longer needed, because all iteration states reachable from the checkpoint have already 
been adjoined. 





The discrete adjoint OpenFOAM package also implements binomial checkpointing in the form 
of revolve algorithm. The revolve algorithm places checkpoints with a logarithmic spacing, making 
sure that checkpoints are spaced more densely next to the current interpretation step, minimizing 
the number of recomputation steps. Checkpoint locations are reused once they are not needed at 
their former position anymore. 

Figure 3.10 shows the application of revolve algorithm to 100 iteration steps of the SIMPLE 
algorithm for different numbers of checkpoints. For example, for three checkpoints, revolve places 
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Figure 3.10: Calculation and reversal of 100 iteration steps with 100 (checkpoint all), five, and 
three revolve checkpoints. One iteration step is recorded at a time. Horizontal lines indicate 
the evolution of the checkpoint positions during the reversal procedure. 


checkpoints at iteration steps (0,65,92). The upper checkpoint is consecutively lowered until 
iteration step 65 has been reached and the center checkpoint can be replaced at a lower location. 
The checkpoints at iteration 0 is restored and is used to place the new checkpoints at (0, 38, 58). 
This process is repeated until all iteration steps have been recorded and adjoined. 

All checkpointing by default is done on the outer level of the iteration loop, therefore at least 
one complete iteration step has to fit into RAM. With some implementation effort, this can be 
transformed to capture checkpoints in between different PDEs (e.g. for the SIMPLE algorithm to 
place checkpoints between the momentum, mass conservation and turbulence equations). Further 
breaking it down to inside the PDE solver level is more challenging, due to the strict scoping 
of C++. The automatic placement of checkpoints at arbitrary positions in the program is currently 
a development target for dco/c++. Theoretical groundwork has already been laid in |Lot16]. In 
practice we have observed that problems which would require such high amounts of RAM profit 
immensely from being distributed to different MPI nodes (see Chapter 5), making more granular 
checkpoints rarely necessary. 

For rising number of checkpoints, the difference between equidistant and binomial checkpointing 
vanishes. In the limit case, where every iteration step can be checkpointed, binomial checkpoint- 
ing naturally cannot improve over an equidistant scheme anymore. ‘The number of required 
recomputations for equidistant and revolve checkpointing over the number of placed checkpoints 
is illustrated in Figure 3.11. 








3.3.2 Implementation of Checkpointing in Discrete Adjoint OpenFOAM 


One requirement for the application of checkpointing is the separability of the main iteration 
loop, that is the solver has to be implemented such that the iteration procedure can be run for a 
specified number of iterations from an arbitrary state. 

The stock OpenFOAM solvers are not structured to easily achieve this. All iterations are placed 
in a main loop and all flow fields are scoped locally to the main routine of the solver. Therefore a 


80 


3.3 Checkpointing 


le+06 
Revolve 
= Equidistant 
= 100000 Ratio (y2 Axis) 4 
md oO 
= 10000 = 
: : 
= 
z 1000 : 
« 
oY 
100 





i! 10 100 1000 


Number of checkpoints ngp 


Figure 3.11: Number of required recomputations m for checkpointing 1000 iteration steps for 
increasing number of checkpoints ncp. Run time ratio between equidistant and revolve in 





green on second y-axis. 


wrapper around the adjointSimpleFoam solver was created, moving the flow fields into a class 
and separating the individual loop iterations into a member function. ‘This function can be called 
to advance the simulation by a single iteration step. Additionally this class implements abstract 
methods from a CheckController class, which is designed to make the checkpointing interface 
applicable to a variety of solvers by defining a common set of routines to be implemented by every 
checkpointed solver. The separation of iteration steps is also useful for interfacing with external 
optimizer packages, which require the repeated evaluation of the primal simulation or gradients. 

The checkpointing functionality is implemented in the discrete adjoint OpenFOAM framework 
by an abstract interface, from which the different checkpointing strategies are derived. ‘The 
following functionality is provided by the checkpointing interface: 











Store checkpoint: Store the primal values of the required flow fields x into a temporary 
buffer B; € R”™. 


Restore checkpoint: Overwrite the primal values of the flow fields x with the content of 
buffer B; € R”™. 


Register variables: Register the variables of the state x in the tape and remember the assigned 
tape indices in tT € N”™. 


Store adjoints: Extract the adjoints from the tapes adjoint vector (from the locations stored 
during the register variables step) and store them in a temporary buffer T € R™. 


Restore adjoints: Inject the stored adjoints from the buffer T back into the tape at the locations 
currently occupied by x. 
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Figure 3.12: Class diagram of the checkpointing interface. 


Figure 3.12 shows the class structure of the checkpointing interface, developed for discrete 
adjoint OpenFOAM. It allows to checkpoint instances of geometricField, which is a base class of 
volScalarField (e.g. used for pressure), volVectorField (e.g. for velocity), surfaceScalarField 
(e.g. face fluxes), and surfaceVectorField (e.g. surface normal vectors). In addition also 
generic scalar data can be checkpointed. Currently the checkpointing strategies equidistant 
(CheckEquidistant) and revolve (CheckRevolve) are implemented as specializations of the ab- 
stract CheckMethod base class. For verification, a dummy class (CheckNone) which implements 
black-box differentiation is implemented as well. 

The memory size occupied by ncp checkpoints is 


Mop = (ncp + 1): nx - sizeof (double) + nx - sizeof (Foam: :label) , (3.2) 
—— ee 
nop checkpoints + one adjoint buffer tape index store 


where Foam: :label is a typedef for the data type used for integer data. Assuming that 64 bit 
double and long types are used, the memory requirement becomes 


Mcp = (nop + 2) + nx - 8 bytes. 


The size of the tape index storage can be reduced from O(nx) to O(1) by using the fact that all 
variables of a field are registered consecutively, and thus the tape indices of a field are adjacent 
and deterministic. 

Figure 3.13 shows the run time of the adjointSimpleCheckpointingFoam solver for adjoining 
all iteration steps of the SIMPLE algorithm on the angled duct example. Observed are two, five, 
and 100 checkpoints and iteration ranges from [0,1] to [0,100]. The curves for 100 checkpoints, 
which correspond to a checkpoint all approach, show that the run time does not increase linearly. 
This is due to the decreasing cost of iteration steps near to the converged solution, as the embedded 
linear solvers need less iterations to converge to their tolerance limit. 

In OpenFOAM, the ratio between tape cost and checkpoint cost is usually very high, making it 
feasible to store a high number of checkpoints. ‘This is demonstrated by ‘Table 3.4 and Figure 3.14. 
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Figure 3.13: Run time over number of calculated iterations adjointSimpleCheckpointingFoam 
for varying density of checkpoints. Dashed lines indicate symbolically differentiated linear 


solvers. 


Table 3.4: Checkpoint size and average tape size with standard deviation o for one SIMPLE 


iteration step. 


Case Checkpoint size Tape size SDLS Tape size black-box 
(MB) Avg. (MB) o (MB) Avg. (MB) o (MB) 
Pitz-Daily L774 442.55 0.57 2282.19 167.127 
Angled duct (Lvl. 6) 1.11 232.08 0.19 990.05 134.39 
Angled duct (Lvl. 12) 4.36 923.51 0.81 3811.86 546.49 
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Figure 3.14: Average tape size (left y-axis) and checkpoint size (right y-axis, blue curve) for the 
angled duct case of different mesh resolutions. 


The figure shows both the checkpoint and average tape size for 20 SIMPLE steps of the angled 
duct example, for varying mesh densities. ‘The coarsest possible configuration for this case consists 
of 325 cells. From this configuration, 20 different levels of mesh refinement are created, ranging 
up to 130000 cells. 

The checkpoint size grows linearly with nc, as predicted by Equation (3.2). With symbolically 
differentiated linear solvers (SDLS, see Section 3.4), the tape size also grows linearly. The tape 
size largely remains constant over all iterations of the SIMPLE algorithm. Using black-box 
differentiation of the linear solvers, the tape size becomes less predictable, due to the varying 
number of inner linear solver iterations. Thus, in the figure we also give error bars and the 
standard deviation of the tape sizes. On average, the tape size still grows roughly linearly with no, 
albeit with a much bigger factor. 

The table includes the angled duct case for refinement levels 30 (11700 cells) and 60 (46 800 
cells). For both configurations, the ratio between tape size (utilizing SDLS) for one SIMPLE 
iteration step and one checkpoint is roughly 210. For the Pitz-Daily case with 12 225 cells, the 
ratio is even higher, with roughly 250. This is due to the additional complexity introduced by 
the differentiation of the k-e turbulence model. 

Thus, binomial checkpointing only reaches its full potential for cases with many iteration steps, 
that can not be covered well by equidistant checkpoints. Such cases are e.g. generated by transient 
flow simulations with high Reynolds numbers, which require a very small time step At to converge 


reliably. 
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3.3.3 Verification of the Checkpointing Implementation 


One requisite for a successful application of checkpointing is that all variables belonging to the 
states of the iteration loop are correctly identified. All variables which 


e are overwritten inside of the iteration, 
e are scoped outside of the iteration, 
e and have an influence on the final outcome of the calculation 


need to be included in the checkpoint. ‘The obvious first step to check the correctness of 
the checkpointing implementation is to compare the results to results obtained by black-box 
differentiation. However, if the results are incorrect, further insight into the process is needed, to 
determine which part of the calculation is incorrectly handled. 

With only minor modifications to the solver and AD tool, it can be verified if all required state 

variables have been identified or if any are missing. Every iteration step has to be self contained, 
i.e. the operations inside of an iteration step are only allowed to depend on the primal values of 
the current state, variables which are local to the current iteration, or globally defined variables 
that are never overwritten (e.g. the parameters a). Any dependency to previous iterations has 
to be localized, that is added to the state, by the checkpointing routines. ‘This makes sure that 
when resuming from a checkpoint, all required values are available (in scope) and have the right 
numeric value. 
The AD tool can be adapted to identify edges in the tape, obtained by the black-box solver, which 
point to regions which will become illegal when checkpointing is applied and the order of loop 
execution becomes non-consecutive. A set of legal and illegal edges in the tape is illustrated in 
Figure 3.15. This technique was used to identify an issue in the checkpointing of shape adjoints. 
This will be discussed in Section 4.4. 





Adjoint Vector 

















Figure 3.15: Legal (green) and illegal edges (red) for checkpointing. Edges are allowed to point 
to the global optimization parameters, to entries local to the iteration, and to entries of the 
directly preceding state. All other edges are illegal. 
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3.4 Symbolic Differentiation of Embedded Linear Solvers 


Typically, the run time of PDE solvers is heavily dominated by the execution of linear equation 
solvers (compare to Chapter 5), which repeatedly solve the linearized and discretized differential 
equations. While differentiating all linear solvers with AD technically works, it is in practice 
undesirable, as the black-box differentiation of certain classes of linear solvers exhibit numerical 
instabilities |Chr18]. Furthermore, it is very costly to fully differentiate those solvers, especially 
in terms of memory required for the storage of the tape. For iterative solvers, the cost is strongly 
correlated with the condition number of the linear system. The cost grows linearly with the 
number of iterations required by the solver, which in turn grows with the condition number. 

We will now outline how the adjoints of linear systems can be obtained, without fully dif- 
ferentiating the solution process by AD. When calculating the solution x to the linear system 
Ax = b without applying AD techniques, the adjoints A and b are not propagated automatically. 
However, using the known adjoints of the solution x, the adjoints of A and b can be calculated 
symbolically |Gil08; Nau+15]. 


3.4.1 Symbolic Adjoint Relations for Linear Systems 


Theorem 4 (Calculation of adjoints of RHS vector b). 
For a regular matrix A € R"*”, vectors z € R",b € R”, and x = A-'D, the adjoints of b are 
given by b= ATE. 


Proof. See |Gil08; Nau+15]. a 


Instead of explicitly calculating the transposed inverse A~’ of A, the adjoints of the right hand 
side can be calculated at cost O(n?) (assuming dense matrices) by solving the equivalent linear 
equation system 

AP ob x. (3.3) 


This is particularly beneficial for sparse systems, as the inverse of a sparse matrix is in general 
not sparse. For the sub-case of symmetric matrices and direct linear system solvers, an existing 
(LU,QR or Cholesky) factorization of A from the primal solve Ax = b might be reused, at the 
cost of storing the possibly more dense factorization instead of A. This lowers the complexity 
from O(n?) to O(n?) for the forward and backward substitution of the factorization. 


Theorem 5 (Calculation of matrix adjoints A). 
For a regular matric A € R°*", vectors 2 € R",b € R” and « = A~'D, the adjoints of the 
individual matrix entries A are given by the outer product A= —b@ 2a’. 


Proof. See |Gil08; Nau+15]. OC 


The adjoints A and b can thus be calculated by solving the additional equation system (3.3) and 
evaluating an outer product. We assume that the memory locations of A and b are not aliased 
and thus A and b are truly independent. 

To summarize, the discrete AD differentiation of the linear system solvers x = S(A,b) is 
replaced by the following steps: 
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During primal solution: 
A-x=b ~> x:=S(A,b) ; 


During reverse propagation of adjoints: 


A’ -b=xX > b:=S(A‘,x) (3.4) 

A:=—-be@x’. (3.5) 

In the context of CFD, the linear solvers are embedded into a non-linear (iterative) calculation. 
The matrix A is usually stored in a sparse representation (e.g. compressed row storage or coordinate 


format). The outer vector product in Equation (3.5), forming A, only needs to be applied to the 
adjoints corresponding to non-zero entries. While the outer product produces a dense matrix 





of adjoints, structurally zero matrix entries will by definition never influence any results, their 
adjoints are thus deemed irrelevant. 

Note, that for the symbolic adjoint relation to hold, it is required that x is actually the solution 
to A:x =b, that is x = A~!b. When using iterative linear solvers, stronger accuracy limits may 
be required during the primal evaluation, in order to achieve the desired accuracy in the symbolic 
adjoint. Furthermore, the concept of relative tolerances (where one does not prescribe an absolute 
residual threshold, but a reduction of the residual in relation to the initial state by a certain 
factor) should not be applied when differentiating the linear system symbolically. It has been 
shown, that the concept of relative tolerance can still be applied to symbolically differentiated 
iterative linear solver schemes, when certain corrections are applied |AHM16]. 

A case study for the propagation of errors, due to the residual of the primal and adjoint linear 
systems is shown in Figure 3.16. Observed are 150 SIMPLE iteration steps of the angled duct case, 
introduced in Section 3.1.2. Linear solvers are GAMG for the pressure equation and Gauss-Seidel 
for the momentum equations. The linear systems are solved with a solver tolerance of e¢ for the 








primal solvers (for pressure and velocity) and e, for the linear solvers in the reverse propagation 
phase. The errors are obtained by comparing to a reference solution, obtained by ef = €, = 10. 
which is at the limit of machine precision. ‘The obtained results are not completely smooth, as the 
tolerances €f and €, are only the upper limit for the linear solver. The actual tolerances achieved 
by the discrete number of iteration steps of the linear solver might be lower, and do not scale 
linearly with the prescribed tolerance e. 

dco/c++ allows to stop the recording of the adjoint stack (switch to passive mode) and restart 
the recording at a later time (switch to active mode). The resulting gap in the tape has to be filled 
when the adjoints are propagated from the outputs to the inputs, during the adjoint propagating 
phase. For this purpose, dco/c++ allows to create functions to be called at a specific point in the 
adjoint propagating process, using a callback interface. Figure 3.17 shows a conceptual overview 
of the solution process of one velocity-pressure correction step with the corresponding calls to 
linear solvers (inside the gray boxes) and how adjoint information is propagated. The information 
missing from the tape due to the gaps must be supplied by evaluating Equations (3.4) —(3.5) 
inside the adjoint callback functions. 

As the tape is switched off during the linear solver calls, passive versions (using regular floating 
point data types) of the linear solvers can be called, increasing performance and reducing memory 
overhead. However, for this the data of the matrix and vectors need to be copied into passive 
containers, introducing some additional code and data duplication. 

The implementation of the symbolically adjoined linear solvers in the context of a sparse 
iterative CFD solver will be discussed in the following section. 
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mean error 





10~°© 107 





Figure 3.16: Mean error between reference result and result obtained by forward linear solver 
tolerance ef and reverse solver tolerance ¢,. Differentiated are 150 SIMPLE iteration steps of 
the angled duct case, starting from a solution of the potential equations. 


3.4.2 Implementing SDLS in OpenFOAM 


In OpenFOAM, the assembly and solution of linear equation systems is implemented in the 
finiteVolume library, using general linear equation system solvers implemented in the OpenFOAM 
library. The solvers for scalar and vector fields are implemented in 


e fvMatrices/fvScalarMatrix/fvScalarMatrix.C and 
e fvMatrices/fvMatrix/fvMatrixSolve.C 


respectively. For vector fields, the matrix (FVM discretization) coefficients are scalars and 
identical for all dimensions. The coefficients differ only for the interfaces, which are not part of 
the LDU coefficients but are stored separately. Interfaces represent the boundary conditions and 
internal processor faces, if solving a parallel case. The equation systems are solved in a segregated 





fashion, i.e. the vector components are decoupled and solved for independently. 

To solve a vector equation (e.g. from the discretized momentum equation), three (for the general 
3D case) scalar equations with the same matrix coefficients but different interfaces and right hand 
sides are solved. The coupling between the equations is only introduced by the outer non-linear 
iteration scheme and the right hand side contributions. ‘To compute the symbolic adjoints of the 
linear solvers, it is therefore sufficient to focus on the solution of scalar equations and let the AD 
tool automatically differentiate the assembly of the scalar subproblem from the vector problem. 

The matrix classes in the finiteVolume library pass a lduMatrix (see Section 2.6), which is 
assembled from the chosen discretization schemes and boundary conditions, as well as a right 
hand side vector and an initial guess (usually the result of the previous non-linear iteration) to 
the linear equation system solvers defined in the OpenFOAM library. 

Currently, the following iterative linear equation system solvers for lduMatrices are imple- 
mented in OpenFOAM: 
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(a) Augmented forward creating gaps (b) Adjoint reverse propagation filling gaps 





Figure 3.17: Outline of the procedure for filling the gaps in the tape created during the 
augmented primal section by executing passive versions of the linear solvers only. New outer 
iteration values U‘t! and p**! are generated from U’ and p’. The shaded parts indicate the 
scope of the linear solvers. The upward facing dashed lines indicate the part of the solution 
process that is treated symbolically. Consequently this part is not stored in the tape and has 
to be supplied in an adjoint callback function in the (reverse) adjoint propagation step. 








GAMG: Geometric Algebraic Multigrid solver with multiple choices for the smoother |Bra77], 
PBiCG: Preconditioned Biconjugate gradient method |Fle76], 

PBiCGStab: Preconditioned Biconjugate gradient stabilized method |Van92], 

PCG: Preconditioned Conjugate Gradient method (for symmetric matrices) |HS52], 


smoothSolver: Simple preconditioned Gauss-Seidel solver. 


All solvers are coupled to the 1duMatrix class with the same interface, and thus for the calculation 
of symbolic adjoints any of the above mentioned solvers can be used for the primal and adjoint 
callback linear systems. When using symbolic adjoints for the linear solvers, there is no particular 
need to use the same solver for the primal and adjoint linear systems. If e.g. the smoothSolver 
is used for the primal, to obtain a better stability of the primal convergence, it might be possible 
to use a solver with better convergence properties like GAMG for the adjoint. 
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We will highlight the major implementation points of the solver for scalar matrices implemented 
as fvScalarMatrix: :solveSegregated() in fvMatrices/fvScalarMatrix/fvScalarMatrix.C. 
The code is simplified, in that it always assumes an asymmetric matrix. For symmetric matrices, 





the matrix transposal is not necessary, and not all matrices elements need to be incremented. 
However, the symmetric part of the matrix has to be incremented as well, making the code more 
complex. Furthermore, the code always assumes that the tape is switched on at the entry to the 
function, and thus symbolic differentiation is required. ‘The full code, without those assumptions 
and the ability to switch off symbolic differentiation on demand, is included in Appendix B. 

In Listing 3.8 the code for the assembly and solution of a scalar lduMatrix is shown. First the 
solution vector psi is initialized with the field of the last iteration psi_ as an initial guess. ‘Then 
the diagonal of the matrix is modified with coefficients of the boundary conditions (Line 5). Note 
that the matrix coefficients are stored in the fvMatrix object, and are accessible by dereferencing 
the this pointer. In order to not alter the matrix permanently, the diagonal is stored (Line 4) 
and restored at the end of the function (Line 21). The right hand side vector totalSource 
is assembled from the member field source_ and the influence of boundary conditions (Lines 
7,8). After those preparations, the lduMatrix solver object can be assembled from the matrix 
coefficients stored in *this (Line 13), the boundary and internal coefficients (Lines 14,15), 
the scalar interfaces of the solution vector psi, and the solverControls object. ‘The newly 
constructed solver object is then used to solve the linear equation system with the right hand 
side totalSource for the unknowns psi (Line 18). Finally, the solution vector psi is corrected 
to comply with boundary conditions (Line 21). 


Foam::solverPerformance Foam::fvMatrix<Foam::scalar>::solveSegregated (const 
dictionary& solverControls){ 
auto& psi = const_cast<GeometricField<scalar, fvPatchField, volMesh>&>(psi_); 


scalarField saveDiag(diag()); 
addBoundaryDiag(diag(), 0); 


scalarField totalSource(source_); 
addBoundarySource(totalSource, false); 


// Solver call 

solverPerformance solverPerf = lduMatrix:: solver: :New( 
psi.name(), 
*this, 
boundaryCoeffs_, 
internalCoeffs_, 
psi_.boundaryField().scalarInterfaces(), 
solverControls 

)->solve(psi.primitiveFieldRef(), totalSource) ; 


diag() = saveDiag; 
psi.correctBoundaryConditions() ; 


Listing 3.8: fvMatrix solve routine, can be differentiated by AD without change. 


This code is taken straight from the OpenFOAM code base and can be used without any changes 
to compute the black-box derivatives of the linear solvers. 
In Listing 3.9 the modifications that were made to incorporate the symbolic differentiation of 


90 


3.4 Symbolic Differentiation of Embedded Linear Solvers 


the linear equation systems are shown. 

The gap in the tape is opened by switching off the tape in Line 10 and closed by switching it 
back on in Line 43. This gap only wraps the creation and solution of the 1duMatrix solver object. 
However, it does not include the code which modifies the matrix and source terms according to 
the boundary conditions. This initialization code is still handled by AD. The modified solver 
creates a callback object, which is inserted into the tape in Line 42. It will be called during the 
reverse propagation phase, when the gap in the tape is encountered. 

A dco/c++ callback object allows to store 


Callback function: Function that will be called when the interpretation of the tape reaches the 
gap. Contents of the callback object are passed as argument to this function. 


Output variables: Variables, the adjoints of which will be passed as an input to the callback 
function. 


Input variables: Variables, the adjoints of which will be calculated by the callback function. 


Data variables: Variables, the values of which are copied and made available to the callback 
function as auxiliary input variables. 


For the symbolic differentiation of the linear solver, we store the following data (Lines 34-37): 


e A string, which holds the field name (e.g. "p" for pressure), in order to look up the solver 
settings for the linear solver in the reverse section. 





e An int, which holds a direction, (0,1,2) indicating for which dimension should be solved. 
A scalar field is indicated by direction —1. 


e A fvMatrix, which holds a copy of matrix (*this), used to solve the primal equations. 


e A scalarField, which holds a copy of the solution psi, computed by the primal linear 
equation system solver. 


The adjoints of the solver output psi are required as inputs to the callback function, and are 
thus registered as output variables in Line 40. 

By passing the input adjoints and the data variables to the callback object, the desired adjoints 
can be computed during the execution of the callback function. ‘The algorithms to compute the 
symbolic derivatives are shown in Listings 3.10 and 3.11. The first listing shows the retrieval of 
the output adjoints and auxiliary variables, the assembly of the adjoint system and its solution. 
The data variables, in particular x and the matrix coefficients corresponding to A are restored in 
Lines 3-8. The incoming adjoints x of the solver outputs are read in Lines 11,12. The internal 
and boundary coefficients of the matrix are set in Lines 20,21. ‘These coefficients differ for the 
different components of a vector field. ‘To extract the correct coefficients from the callback 
object, here the stored direction is needed. For a scalar field, component (i) always returns a 
reference to the scalar field, no matter which dimension is passed. Therefore, those lines both 
work for Type=scalar and Type=vector. 

Next, the matrix A is transposed (here always assumed to be necessary). For the OpenFOAM 
sparse matrix format, this can be done by swapping the upper and lower coefficient vectors, 
as well as the boundary and interior coefficient vectors (Lines 24-31). As the matrix A stored 
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1 solverPerformance fvMatrix<scalar>::solveSegregated(const dictionary& 
solverControls){ 

GeometricField<scalar, fvPatchField, volMesh>& psi = 
const_cast<GeometricField<scalar, fvPatchField, volMesh>&>(psi_); 


scalarField saveDiag(diag()); 

addBoundaryDiag(diag(), 0); 

scalarField totalSource(source_) ; 
addBoundarySource(totalSource, false); // assemble RHS vector 


Oo ana a FF WwW WD 


10 ADmode::global_tape->switch_to_passive(); // Gap in tape starts here 

11 

12 ADmode::external_adjoint_object_t* D = ADmode::global_tape-> 
create_callback_object() ; 


13 

14 forAll(totalSource, i) 

15 D=>repister input (LotalsSource il); 

My ene (likin oS OS ln oS tygjocas O) geese () 3 Berar), 

17 D->register_input (this->upper() Li]); 

18 for(int i = 0° i < this->lower (©). size() = i++) 

19 D->register_input (this->lower() Li]); 

20 for(int i = 0; i < this->diag().size(); i++) 

Bi D->register_input (this->diag() li]); 

22 

23 // solve (*this) * psi = totalSource 

24 solverPerformance solverPerf = lduMatrix::solver::New 

25 ( 

26 psi.name(), 

27 *this, 

28 boundaryCoeffs_, 

29 internalCoeffs_, 

30 psi_.boundaryField().scalarInterfaces(), 

31 solverControls 

32 )->solve(psi.primitiveFieldRef(), totalSource) ; 

33 

34 D->write_data(psi.name()); 

35 D->write_data(Foam::direction(-1)); // dummy direction for scalar fields 

36 D->write_data(*this); // copy of lduMatrix representation 

37 D->write_data(psi); // copy of solution to (*this) * psi = totalSource 

38 

39 forAll(psi.primitiveField() ,i) 

40 psi.primitiveFieldRef () Li] = D->register_output (dco::passive_value((psi. 
primitiveFieldRef()[il))); 

41 


42 ADmode:: global_tape->insert_callback<ADmode::external_adjoint_object_t >( 
symbolic::fill1SolverGap<Foam::scalar>,D); 

43 ADmode::global_tape->switch_to_active(); // Gap in tape ends here 

4A 


45 diag() = saveDiag; 
46 psi.correctBoundaryConditions() ; 
47\ } 


Listing 3.9: fvMatrix solve with creation of solver gap and checkpoints of the necessary data. 
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in the callback object is read only (and might be reused later), a copy is constructed for the 
transposed matrix. 

The solver controls are read in from the system/fvSolution file, specifying the class of linear 
solver and required solution tolerance (Lines 33, 35). The adjoint equation linear systems 


A'b=-xX 


is then assembled and solved in Lines 37-48. The result is available in a1_b after the solver 
has finished. 

The second Listing 3.11 takes the adjoints computed from the adjoint linear equation system 
and writes them to the corresponding places in the adjoint vector of the tape. ‘The adjoints 
of b can be directly written to the tape via the registered input adjoints. For the adjoints of A, 
the outer product is calculated as aj; = b; - x; and written to the corresponding locations in 
the LDU addressing (obtained in Lines 8,9). For the diagonal part of the matrix, the indices 
directly corresponds to the index of the cells, therefore no addressing is required (Lines 23-25). 
For the off-diagonal entries, the indices have to be retrieved from the LDU addressing first. The 
incrementation of the off-diagonal coefficients is implemented in Lines 12-20. 

It is important, that the input adjoints are incremented in exactly the order in which they were 
registered with the callback object, as else wrong elements of the adjoint vector are incremented. 

Here we omitted the treatment of parallel boundaries and symmetry. The former will be 
introduced in a later section, the latter is detailed in Appendix B. 





93 


3 Differentiation of Complex Iterative CFD Algorithms 


1} template<class Type> 
2 inline void fillSolverGap(typename Foam::ADmode::external_adjoint_object_t *D){ 


18 


94 


const Foam::word& fieldName D->read_data<Foam::word>(); 
const Foam::direction& cmpt = D->read_data<Foam::direction>() ; 


const Foam::fvMatrix<Type>& A = D->read_data<Foam::fvMatrix<Type> >(); 


const Foam::volScalarField& x_ref = D->read_data<Foam::volScalarField>(); 


const Foam::scalarField& x = x_ref.primitiveField() ; 


Foam::scalarField ai_x(x); 
iPod iCal a ae) 


ai_xli] = D->get_output_adjoint(); // read incoming adjoints from tape 


Foam::fvMatrix<Type> A_T(A); // will hold transpose of A 


// component() will return scalarField for Type=scalar 


Foam::FieldField<Foam::Field,Foam::scalar> bcmpts = A_T.boundaryCoeffs(). 


component (cmpt) () ; 


Foam::FieldField<Foam::Field,Foam::scalar> icmpts = A_T.internalCoeffs(). 


component (cmpt) () ; 


// transpose matrix by swapping coefficient arrays 
A_T.lower() = A.upper(); 

A_T.upper () A.lower (); 

icmpts = A_T.boundaryCoeffs().component (cmpt) (); 
bempts = A_T.internalCoeffs().component (cmpt) (); 


// Lookup solver controls 
Foam::word reverseFieldName = fieldName + Foam::word("Reverse") ; 
Foam::volScalarField ai_b(reverseFieldName ,x_ref); 


const dictionary& reverseSolverControls = ail_b.mesh().solverDict(L...]); 


VSO vicar ae Geo ee — at lee 


Foam::lduMatrix::solver solver, = Foam::lduMatrix::solver::New 
( 

fieldNameCmpt + Foam::word("Reverse"), 

VEE 

bempts, 

icmpts, 


ai_b.boundaryField().scalarInterfaces(), 
reversesSolverControls 


ee 


J// solve for Wr 


solverPerformance solverPerf = solver->solve(ai_b.primitiveFieldRef (), 


[...] // copti¥wmedgin next listing 


Listing 3.10: Adjoint callback routine: Solution of the adjoint system. 


ado): 
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[Si 7 “continual ionwor spire vious ic dig 

// increment input adjoint for b 

for(int i) — O07 ie Goal be size ©) oni t+ 
D->increment_input_adjoint( dco::value(ai_bLli]) ); 


t 


// Addressing for upper and lower half of matrix 
const labelUList& uAddr = A_T.1lduAddr().upperAddr () ; 
const labelUList& lAddr A_T.1lduAddr().lowerAddr () ; 


// increment adjoints for A (upper part) 

double inc = Q; 

for(int i = 0; i < this->upper().size() @ igr){ 
D->increment_input_adjoint( dco::value( -ait_b[lAddr[iJ]*x[uAddr[i]] ); 

Ir 


// increment adjoints for A (lower part) 

for(int i = 0: i < this->lower( .aize@): i++) { 
D->increment_input_adjoint( dco::value( -at_b[uAddr[iJ]*x[lAddr[i]] ); 

i 


// increment adjoints for A (diag part) 
for(int i = 0; i < nd; i+t+)f{ 

D->increment_input_adjoint( dco::value(-ail_bli]*xLli]) ); 
i; 


// treatment for parallel boundaries will go here, see later Sections 


Coed 


Listing 3.11: Adjoint callback routine: Incrementation of input adjoints. 
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3.5 Adjoints of Parallel Communication 


In this section, we will investigate how to efficiently adjoin iterative MPI parallel programs, 
particularly in the presence of linear solvers. First the general concepts of MPI are introduced, 
then the calculation of adjoints via AMPI is discussed. Building on those foundations the 
calculation of adjoints in the CFD context is explored. Lastly, it is shown how the SDLS approach 
presented in Section 3.4 can be adapted for (A)MPI parallel calculation. 





3.5.1 Message Passing Interface 


The Message Passing Interface (MPI) |Mes94] is the de-facto standard for implementing parallel 
C, C++ and Fortran codes on distributed memory machines. With MPI, each compute node runs 
one or multiple processes of the same executable, each calculating a subproblem of the serial 
problem (e.g. for CFD calculating the flow field on a subdomain of the original domain). All 
communication with other processes is wrapped into messages, which are passed from one process 
to another by the MPI libraries. A message is essentially a chunk of memory contiguous bytes 
of arbitrary length, that is accompanied by some meta data (e.g. tags). One process can not 
directly access the memory of another process, even if both share the same virtual memory 
space. All data which is not available in the local memory space has to be distributed using 
messages. Communication in MPI is either point-to-point or collective. Point-to-point messages 
originate from one process and are delivered to exactly one other process. An example for this is 
the MPI_Send and corresponding MPI_Recv construct for sending and receiving messages. ‘The 
signatures for the MPI_Send and MPI_Recv routines are defined in the MPI standard as 





int MPI_Send(const void *buf, int count, MPI_Datatype datatype, int dest, int 
tag, MPI_Comm comm) ; 


and 


int MPI_Recv(void *buf, int count, MPI_Datatype datatype, int source, int tag, 
MPI_Comm comm, MPI_Status *status) 


respectively. ‘The MPI_Send command sends a message of length count and type datatype to 
the process specified by dest. The call to MPI_Send will block until the receiving process has 
called MPI_Recv with the process ID of the sending process as its source argument. On the other 
process MPI_Recv will block until the sending process has called MPI_Send. ‘Thus, blocking MPI 
communication inherently leads to a synchronization of the involved processes. However, it can 
also lead to deadlocks, if the MPI_Send and MPI_Recv calls are not correctly paired. 

Collective communication is used when a message is required to reach multiple processes at once, 
or when data is to be reduced. Collective communications can be grouped into different cases: 


One to all communication: Data is sent from one process to all others. E.g. MPI_Bcast. 


All to one: Data is sent from all processes to one root process, potentially involving a reduction 
operation. E.g. MPI_Reduce. 


All to all: Data is sent from all processes to all other processes, potentially involving a reduction 
operation. E.g. MPI_Allreduce. 
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Figure 3.18: I[llustration of an AMPI communication pattern. A primal MPI_Send generates an 
adjoint MPI_Send in opposite direction. 


Processor 1 
eae aaeeas Interpretation 


The specific communication patterns, used to implement the collective communications, are 
not specified by the standard and are implementation dependent (popular MPI implementations 
are OpenMPI, IntelMPI, and MpiCH). 


3.5.2 Adjoint MPI 


Adjoint MPI (AMPI) is a library developed at STCE [SN12], in cooperation with Argonne National 
Labs and the Institute for Research in Computer Science and Automation (Inria) [Utk+09]. 
It introduces the concept of adjoints to MPI. The challenge for adjoints in the context of 
parallel communication is the split between (augmented) primal evaluation and reverse adjoint 
propagation. Adjoint information needs to be propagated back from the outputs of the program 
to the inputs. For data that is received in the augmented primal evaluation, using regular MPI 
calls, the corresponding adjoints are not incremented on the remote process during the adjoint 
reverse propagation. ‘Therefore, the adjoints on the remote process are incomplete. 

In order to correctly evaluate the adjoints in presence of MPI communication, the MPI calls in 
the augmented primal section have to be accompanied by corresponding MPI calls in the adjoint 
reverse section, which distribute the adjoints back to the relevant processes. A basic case for this 
is illustrated in Figure 3.18. Data is sent from one process (Po) to another process (P,) with a 
pair of blocking MPI_Send and MPI_Recv calls. In the reverse section the direction of the calls is 
switched, transferring adjoint information from P; to Pp. 

The AMPI library provides a set of wrapper functions for the regular MPI calls. In the 
augmented primal section, AMPI keeps track of the MPI calls and then passes the calls on to 
the MPI library implementation. To track the MPI calls, AMPI keeps a list of all executed calls 
including the required meta data, such as message source and destinations, tags, and message 
length. In order to execute the required MPI calls during the reverse interpretation phase, AMPI 
inserts callback objects (using the same mechanisms introduced in Section 3.4) into the dco/c++ 
tape, which are called when adjoint data needs to be distributed using MPI during the adjoint 
reverse propagation. 

For collective communications, the adjoint communication patterns are more complex. The 











adjoint of a variable x needs to be incremented for every statement which uses it nonlinear in 2. 
Thus, if the local variable x is sent to multiple processes (e.g. using MPI_Broadcast) the adjoint x 
needs to (potentially) be incremented by adjoints sent back from all those processes. Hence, 
a primal call to MPI_Broadcast is augmented by a call to MPI_Reduce with reduce operation 
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L+ = Yi + Y2 + ¥3 


Figure 3.19: Primal (black) and adjoint (blue) communication for MPI_Bcast. The MPI_Bcast 
call in the primal section, distributing value x from root Po to all other processes, is transformed 
into a MPI_Reduce call in the reverse section, summing up the partial adjoints on the root Po. 





MPI_SUM in the reverse interpretation, which sums up all partial adjoints of x from the involved 
processes, and then increments the adjoint of the root process by this sum. ‘This communication 
pattern is illustrated in Figure 3.19 for four processes and root process Po. 

Table 3.5 lists the primal MPI calls and the corresponding calls during reverse propagation for 
some common MPI calls. It has been shown that all common one-sided MPI operations can be 
adjoined with a constant overhead, compared to the primal MPI call, with the exception of the 
MPI_Reduce operation with MPI_PROD as the reduction operator |Sch14|. The number of adjoint 
increments required for a product reduction scales linearly with the number of involved processes, 
due to the product rule. 

AMPI can be interfaced with different AD tools, requiring that these tools provide a set of 
interface functions. An interface for dco/c++ is included in the open-source AMPI release. 





3.5.3 Combining AMPI with OpenFOAM 


OpenFOAM implements distributed messaging in a hierarchical fashion, by wrapping low level com- 
munication routines in a separate layer. This is organized as follows. On the high level, information 
is stored in data structures specific to the CFD domain. For example, GeometricField<scalar, 
fvPatchField, volMesh> stores a field of scalar typed values. It includes references to the 
boundary conditions and the underlying volume mesh. For convenience, the definition is abbre- 
viated to volScalarField via a typedef. The communication of (field) data between different 
processes is abstracted into a library called Pstream, which allows the implementation of different 
parallelization strategies. However, at the moment only MPI communication is implemented in 
the regular OpenFOAM release. 





Table 3.5: MPI routines used in OpenFOAM, their AMPI versions, and communication patterns 
specific to the augmented primal and adjoint sections of the adjoint code. 





Primal MPI call AMPI call MPI call in reverse section 


MPI_Bsend AMPI_Bsend MPI_Recv 
MPI_Recv AMPI_Recv MPI_Bsend 
MPI_Allreduce AMPI_Allreduce MPI_Allreduce 
MPI_Bcast AMPI_Bcast MPI_Reduce 
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class AmpiTypeHelper{ 
private: 
const std::type_info& type; 
Foam::string caller; 
AmpiTypeHelper(const std::type_infok type, Foam::string caller) 
type(type), caller(caller){} 
AmpiTypeHelper () ; 
public: 
template<typename T> 
static const AmpiTypeHelper create(Foam::string caller=""){ 
const AmpiTypeHelper t(typeid(T), caller); 
return Cc; 
} 
bool operator ==(const AmpiTypeHelper &b){ 
return (b.type == this->type) ; 
Jr 
bool is_active_type() const { 
return type == typeid(dco::gais<double>::type) 
|| type == typeid(Foam::Vector<dco::gais<double>::type >) 
|| type == typeid(Foam::Tensor<dco::gais<double>::type >); 
Jr 
}; 


Listing 3.12: Type helper, used to determine the type of data passed to the low level 
communication routines. For debug purposes, the caller function can also be passed along. 


To simplify the low level communication routines, the OpenFOAM development team decided to 
not pass down type information from the high level data structures to the low level communication 
routines. All message data is casted to the char data type (which occupies one byte in C) before 
being passed to the low level routines (more specifically just the data pointer passed to the low level 
is cast to char*). This allows the use of the MPI_BYTE data type inside the MPI send and receive 
routines for all types of data, avoiding specializations of the low level communication routines for 
different data types. As OpenFOAM knows which data to expect on the high level, the type of 
the received data is not needed. However AMPI, which wraps around the MPI routines on the 
low level, needs to be able to separate floating point data from passive data (e.g. integers, strings). 
For passive data, the primal communication routines can be kept unchanged, as no derivative 
information is associated with them. For (active) floating point data, the communication routines 
have to be changed to accommodate the reverse propagation of adjoints. 








To incorporate the type information into OpenFOAM, instead of rewriting the whole communi- 
cation layer, a type helper is introduced. It is inserted into all calls to the low level MPI read and 
write routines. The code for the AmpiTypeHelper is shown in Listing 3.12. The type helper is 
constructed from an arbitrary variable and stores its std: :type_info information. This allows 
to boolean compare the type of the variable to another type. It offers a routine which returns 
true if the corresponding type is or contains a dco/c++ active type. With that information the low 
level communication routines can be augmented to decide whether to issue a MPI or AMPI call. 
An AMPI call only needs to be issued if active types are involved, and the augmented primal is 
executed (i.e. the tape is active) while the call to MPI is executed. 


The procedure outlining the (adjoint) communication flow for standard OpenFOAM and 
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Figure 3.20: Unaugmented data flow from high level classes to communication layer. 
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Figure 3.21: Type augmented data flow from high level classes to communication layer. ‘Type 
information allows AMPI to determine whether adjoint communication patterns need to be 
applied. 
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discrete adjoint OpenFOAM using AMPI is illustrated in Figures 3.20 and 3.21. 


To give an understanding of the amount of (adjoint) communication during a typical solver 
execution, Table 3.6 shows the total number of (A)MPI calls for 10 iteration steps of the 
adjointSimpleFoam solver on the Pitz-Daily case. ‘The domain is decomposed onto 2, 4, and 8 
processors. Listed are MPI calls, that is calls to the standard MPI routines without involvement 
of AMPI, AMPI calls, that is calls to MPI routines from inside the AMPI wrapper routines 
during the augmented primal evaluation, and AMPI_ MPI calls, that is calls to MPI routines 
issued by AMPI during the reverse propagation sweep. 





OpenFOAM provides its own implementations for collective operations (instead of relying on 
MPI_Gather and MPI_Scatter), which are implemented using the basic MPI_Send and MPI_Recv 
routines. ‘Therefore, the only MPI operations present in significant number are blocking MPI_Bsend 
and MPI_Recv pairs, and MPI_Allreduce (with the MPI_SUM reduction operator). MPI_Allreduce 
is mainly used to calculate the residual of the linear solver iterations, while the send and receive 
calls are used to distribute data needed for the matrix vector products. 

From the table it is obvious, that if the differentiation of the linear solvers is performed 
symbolically, only very few AMPI calls remain (only around 2% of all calls to MPI need to be 
passed through AMPI). For SDLS, the communication of data needed for the matrix vector 
products in the iterative linear solvers move from AMPI calls to MPI calls. This significantly 
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Table 3.6: Calls to different (A)MPI routines for 10 steps of adjointSimpleFoam on the 2D 
Pitz-Daily case. Tabulated are decompositions to 2,4 and 8 processors, with and without 
SDLS. 


no SDLS SDLS 

) A 8 2 A 8 
MPI_Bsend 0 0 0 A5738 145578 517770 
MPI_Recv 0 0 0 A5738 145578 517770 
MPI_Allreduce 10 20 AO A0414 108592 333896 
AMPI_Bsend 23758 74946 265298 640 1920 7040 
AMPI_Recv 23758 74946 265298 640 1920 7040 
AMPI_Allreduce 20718 55116 168112 166 Bo2 664 
AMPI_Waitall 20 36 56 20 36 56 
AMPI_MPI_Bsend 23758 74946 265298 640 1920 7040 
AMPI_MPI_Recv 23758 74946 265298 640 1920 7040 
AMPI_MPI_Allreduce 20716 55112 168104 164 328 656 
Total MPI calls 352 1010 2358 1382232 400738 1371754 
Total AMPI calls 136506 410120 1397716 2930 8448 29788 
Total AMPI MPI calls 68232 205004 698700 1444 4168 14736 
Total sum 205090 616134 2098774 136606 413354 1416278 





reduces the number of calls to MPI routines during the reverse propagation phase and also lowers 
the number of adjoint callback objects in the dco/c++ tape, reducing run time and memory. 

The inclusion of SDLS comes at the cost of additional linear solver calls during the reverse 
propagation phase, which in turn issue additional MPI calls. Therefore, the number of MPI_Send 
and MPI_Recv calls for the SDLS case is higher than the corresponding AMPI_Send and AMPI_Rev 
calls for the non-SDLS case. However, for the studied case the increase is lower than a factor of 
two, meaning that the reverse propagation issues less MPI calls than the AD implementation 
does during reverse propagation. As the number of iterations during primal and reverse linear 
equation solves is not necessarily equal, the opposite could also be true for a different case. 

The additional steps required to incorporate SDLS in an AMPI setting are discussed in detail 
in Section 3.5.5. 








3.5.4 Tangent MPI Communication 


The tangent data type of dco/c++ can be used with MPI with only minor modifications. The 
memory layout of the (scalar or vector) tangent type always bundles together a value with its 
corresponding tangent(s). This contiguous memory layout allows to reuse the existing MPI send 
and receive routines with correspondingly increased message size. ‘The message size doubles for 
tangent scalar mode and increases by factor (1 + n,) for tangent vector mode. 

When using the built in MPI reductions of MPI_Allreduce, for some reductions additional 
logic is required. The reductions assume that each entry of the sent data stream contains an 
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Figure 3.22: Reduction of tangent data using MPI_Sum. The MPI reduction treats the two 
tangent types as four floating point values and sums them up individually. 





element for which associativity and commutativity with other elements holds. For a vector of 
tangent data types, the primal values and tangents are stored interleaved, however MPI interprets 
the vector as a (bigger) vector of primals. 

Inspection of the OpenFOAM source code reveals, that only two of the built-in reductions 
are used, namely MPI_SUM for summation and MPI_MIN for finding the minimum value across a 
vector. For MPI_SUM no special actions have to be taken, because the tangent of a sum of two 
tangent types is the sum of its tangents. The built-in sum reduction will thus calculate the correct 
primals and tangents by default. Note, that this only holds, because the MPI_SUM reduction sums 
up vectors element-wise without changing their dimension, and thus primals and tangents are 
summed up without mixing. This reduction is illustrated in Figure 3.22. 

For all remaining reduce operations (of which only MPI_MIN is currently used in the OpenFOAM 
code base), we enforce the use of a manual reduction implementation, which separates the 
communication of data and the reduction operation. As this manual reduction calls functions 
that are covered by the AD tool, instead of the MPI intrinsic reduce operations, the correct 
tangents will be calculated. 


3.5.5 Parallel Symbolic Differentiation of Linear Solvers with AMPI 


The symbolic differentiation of (sparse) linear solvers becomes more complex when parallel 
communication is involved. For distributed CFD calculations, commonly a ghost cell approach is 
used, where values of remote domains are cached locally and updated only on demand to reduce 
communication complexity. In a black-box differentiation setting, the update of the boundaries 
will be passed through, and handled by, AMPI. However, if such information is passed during a 
passive section, e.g. during the symbolically differentiated linear solver calls, the flow of adjoints 
is interrupted. The corresponding adjoint information has to be supplied in a callback function, 
that is called during the reverse interpretation sweep, as previously discussed in Section 3.4. 





In order to correctly adjoin the boundary communication, it is important to first understand how 
processor boundaries are treated. ‘The geometric domain decomposition utilized in OpenFOAM 
leads to both decomposed matrix and vector entries. Each processor holds a subset of the solution 
vector and a subset of coefficients of the discretization matrices. The entries of the solution vector 
correspond to values defined on cells/faces located in the decomposed domain. 





The global coefficient matrix is decomposed row wise to the individual processors. The matrix 


102 


3.5 Adjoints of Parallel Communication 


const label nCells = diag().size(); 

const label nFaces = upper().size(); 

// diagonal (cell) coefficients 

for (label cell=0; cell<nCells; cellt+t+){ 
ApsiPtr[cell] = diagPtr[cell]*psiPtr[cell]; 

i 

// off diagonal (face) coefficients 

for (label face=0; face<nFaces; face++){ 
ApsiPtr[luPtr[face]] += lowerPtr[face]*psiPtr[1Ptr[lface]]; 
ApsiPtr[1lPtr[face]] += upperPtr[face]*psiPtr[uPtr[face]]; 

Ir 


Listing 3.13: Calculation of the local matrix vector product between the LDU coefficients and 
vector psi. 


coefficients stored in the local LDU matrix description correspond to a block diagonal sub-matrix 
in the global discretization matrix and can be multiplied with the contents of the solution vector 
without inter-processor communication. Matrix entries which lie outside of the diagonal block 
require multiplication with vector entries that are not located on the same processor. The 
corresponding vector entries need to be communicated to the local processor, in order to be 
added to the matrix vector product. Those matrix entries are stored separately from the LDU 
description and are called matrix interfaces. ‘The corresponding entries of the solution vector are 
called interface values. 

OpenFOAM separates the matrix vector product into two parts. In the first part the product 
between all local matrix coefficients and local vector entries is calculated. This part involves no 
MPI communication and therefore there is no need to supply additional AMPI information. 





The second part involves the multiplication of the matrix interface coefficients with the interface 
values. For each row of the coefficient matrix, the values of the interface coefficients need to be 
brought in from remote processes to the local process, calculating the part of the matrix vector 
product corresponding to the row. 

To hide communication latency, the implementation separates the parallel matrix vector 
multiplication into three stages: 


Preparation of matrix interfaces: Send matrix interface coefficients B to the corresponding 
processors. For non-blocking communication, this allows to start with next step before 





interfaces are received. 


Multiplication of local coefficients: Calculate local matrix vector product according to List- 
ing 3.13. Indirect memory access to both the result and the entries of the multiplicant 
vector, due to the LDU addressing. (Caching friendly if the mesh is numbered efficiently. ) 


Multiplication of interface coefficients: At a later stage add missing product terms to the result 
according to Listing 3.14. Entries of the interfaces are already matched correctly, therefore 
indirect memory lookup only occurs for the result vector. 
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const labelUList& faceCells = this->interface().faceCells(); 


forAll(faceCells, elemI){ 
result[faceCells[elemI]] += coeffslLelemI]*vals[elemI]; 
} 


Listing 3.14: Calculation of the matrix vector product between processor interface coefficients 
coeffs with entries of the remote vector psi, stored in vals. Entries of coeffs and vals are 
already correctly aligned, therefore no indirect access on the right hand side. Implementation 
in lduInterfaceFieldTemplates.C 
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Figure 3.23: Structured 4 x 4 mesh, decomposed onto four processors Po—P3. Cells are numbered 
such that distance between local cells is minimal. To compress notation, cells are numbered to 
a hexadecimal base. 





An example for the mesh decomposition, the LDU decomposition and the global and local 
coefficient matrices is given in Figure 3.24. illustrated is a structured 4 x 4 grid, depicted in 
Figure 3.23, decomposed onto four processors. The global cells are numbered such that the 
maximum (index) distance between two cell indices on the individual processors is minimal. To 





obtain a more compact notation in the global coefficient matrix, we number the 16 cells in a 
hexadecimal base from cp to cf. There are processor boundaries between processor pairs (Po, P1), 
(Po, Po), (P1, P3), and (P2, P3). The vectors l,d,u in the figure list the processor local entries. 
The index vectors L and U give the addressing for J and uw. 





For each processor boundary, two entries of the solution vector x have to be copied from the 
remote process. For example, v?! = [21,73] is copied from P®° to P! and v'? = [x4, 26] in the 
opposite direction. 

For convenience, the numbering in the figure corresponds to global cell numbers. In the 
implementation each processor numbers its local cells from zero. Coefficients outside of the diagonal 
blocks (that is coefficients of faces on a processor boundary), require additional communication. 
Entries of the distributed vector must be brought to the processor, computing the corresponding 
rows of the matrix vector product. For example, uw i4 and u36 on the processor boundary (PO, P1) 
require the computation of the product between coefficients associated with processor Pp and the 


parts of the solution vector located on Py, i.e. x4 and x6. 
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Figure 3.24: Block diagonal matrix, discretizing the mesh from Figure 3.23. Colored coefficients 
on the main blocks are local to the processors and can be multiplied with the corresponding 
entries of the vector x without further communication. Entries in off-diagonal blocks need to 
be communicated to corresponding processors, to correctly evaluate the matrix vector product. 


The SDLS implementation presented in Section 3.4 calculates the adjoints of the local LDU 
coefhicients and vector entries only. In order to correctly capture the adjoints of the parallel 
matrix vector product, we need to also symbolically calculate the adjoints of the matrix interface 
coefficients and the interface values. ‘The calculation of the adjoints is outlined in Algorithm 2. 
Lines 1—20 calculate the adjoints of the local coefficients as outlined in Section 3.4. For symmetric 
matrices, the adjoints of the off-diagonal coefficients need to also be incremented by the adjoints 
of the omitted opposite part (Lines 10 and 18). 

Lines 21-28 list the calculation of the outer product —b- x? for the interface coefficients. In 
order to calculate the adjoints, the interface values of the remote processors need to be brought 
into the local scope (Line 23). Using the remote value of x and the local value of b, the individual 
values of the outer product can be formed. Lines 21—27 directly correspond to Lines 37—55 of the 
implementation shown in Appendix B.3. 
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Algorithm 2: Calculation of the entries of A, b including processor boundary coefficients. 
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Input: primal solution x, incoming adjoints x 


Data: matrix coefficients: (l,d,u), du Addressing (L,U), boundary coefficients B 


Output: adjoints b, (1, d, u) ,B 
b+ b+ solve(A’ , x); 
forall diagonal entries d; at index i in d do 
| d; — d; — bj - x3; 
end 
forall lower entries |; at index i in l do 
9 <= Uy; 
ke L;; 
Ll aeae 
if A symmetric with no upper part then 
| l, 1, — by 7X5; 
end 
end 
forall upper entries u; at index i in u do 
je Li; 
ke U;; 
Uj — U; — d; + LE: 
if A symmetric with no lower part then 
| Ui — Ui — by 25; 
end 
end 
forall processor boundary fields p; with index 7 do 
update boundary cells of x and b on patch p,; 
x* < boundary cell values of x from neighboring processor; 
b’ < boundary cell values of b from this processor: 
forall faces f; with index i on p,; do 
| Bi, [> Bi, — b* : XS 
end 
end 
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3.6 Higher Order Differentiation 


Obtaining derivatives of order higher than one might be desired for a multitude of reasons. 
Second order derivatives can e.g. be used to implement a Newton optimization scheme (see 
Section 2.9.2). Further an additional optimization step, operating on the optimized parameters, 
might be desired. This motivates the use of higher derivatives. As already outlined in Section 2.7.2, 
higher derivatives can be obtained by repeatedly applying the first order model of AD. 

The AD tool dco/c++ allows the nesting of first order data types to obtain derivatives of 
arbitrary order. Nesting a tangent data type inside an adjoint type yields the tangent over 





adjoint model a 


typedef dco::gais<dco::gtis<double>::type>::type doubleScalar; 


Analogously a tangent data type nested inside another tangent data type yields the tangent over 
tangent model x2), 


typedef dco::gtis<dco::gtis<double>::type>::type doubleScalar; 


The nesting of data types to obtain second order derivatives, as well as the access routines of 
dco/c++ required to extract the individual derivative components, are shown in Figure 3.25. 

If implemented naively, the calculation of a Hessian H € R”*” requires n evaluations of the 
tangent over adjoint model of AD and consequently n full evaluations of the flow equations. For 
cases like topology optimization, the influence of a parameter to the final objective propagates 
iteratively through the whole flow domain in a non-linear fashion. This propagation leads to a 
dense Hessian. Thus, coloring techniques |GMP05| which can be used to reduce the number of 
evaluations of the AD model required for sparse matrices are not applicable in this case. This 
makes the computation, and also the storage, of Hessians expensive, such that approximative 
Quasi-Newton methods like BFGS are a better fit to speed up optimization convergence. 

However, for lower dimensional parameter spaces (see e.g. parametric optimization, Section 4.6), 
problems which are known to exhibit sparsity, or applications where higher order derivatives 
are needed, higher order adjoints are the superior choice over FD. An example driver for the 
evaluation of the full Hessian using scalar tangent over tangent mode is given in Listing 3.15. 

Improving on the scalar tangent over tangent model, a whole block of the Hessian can be 
extracted at once using the tangent vector mode (see Section 2.8). For a vector size d, this 
mode allows to extract a sub block S € R?*4 of the full Hessian H with one evaluation of the 
augmented primal function. Listing 3.16 shows the seeding procedure to retrieve the Hessian of 
an arbitrary function f : R” — R under the assumption that the vector size d is identical to n. 
Because the memory consumption of two nested vector types corresponds to O(d7) this is not 
feasible for even modestly sized problems. Hence in Listing 3.17 we choose d < n and extract the 
smaller Hessian sub blocks S one by one. The calculation of the Hessian in blocks also allows to 
exploit the symmetry of the Hessian. 

Using the tangent over adjoint mode of AD, the computational complexity of calculating the 
whole Hessian is lowered from O(n*) to O(n), assuming that m is one or a constant with m <n. 
The tape has to be re-recorded for each tangent direction, therefore each tape is only interpreted 
once. Each evaluation of the tangent over adjoint model yields one row/column of the Hessian. 
The seeding of the adjoint and tangent directions for a general function f is shown in Listing 3.18. 

Both tangent over adjoint and tangent over tangent models are implemented in discrete adjoint 
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Figure 3.25: Interface calls allowing to access specific components of the higher order models. 
dco: :derivative() and dco::value() abbreviated as der() and val(). 


1} typedef dco::gtis<dco::gtis<double>::type>:: type ADtype; 
2 extern ADtype f(ADtype* x,int n); 


3 


PM) Sigel fo) a5 sy gale (Cala iat 


ADtype* x, double** H){ 


5 // seed all directions, evaluate f n*n times 
6 for(int i=0; i<n; it+){ 

7 dco::value(dco::derivative(x[i])) = 1.0; 

8 forint ei. jte t 

9 dco::derivative(dco::value(x[j])) = 1.0; 
10 ADtype y = f(x,n); 

11 HLillj] = HLj] Lil = dco::derivative(dco::derivative(y)); 
12 dco::derivative(dco::value(x[j])) = 0.0; 
13 } 

14 dco::value(dco::derivative(x[i])) = 0.0; 
15 } 

16| } 


Listing 3.15: Dense seeding for obtaining a full Hessian of function f : R” — R in scalar tangent 


over tangent mode. 
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1} typedef dco::gtiv<dco::gtiv<double ,d>::type,d>::type ADtype; 


2 extern ADtype f(ADtype* x,int n); 


4 void t2v_tiv(int n, ADtype* x, double** H){ 

5 // seed all directions, evaluate f once 

6 for(int i=0O; i<n; it+){ 

7 dco::value(dco::derivative(x[i])[iJ) = 1.0; 

8 ngfone (Csitore 9 OR a) Sie qj arar)) 

9 dco::derivative(dco::value(x[j])) Lj] = 1.0; 


10 es 

‘tat ADitty peasy et (Oe, a) 

12 for€int i=0; i<n; itt) 

13 form Gnte | —-O. jon jdt) 

14 HLil lj] = dco:: derivative (dco: :derivative(y) [ jap [ mh; 
15| } 


Listing 3.16: Seeding of vector tangent over vector tangent mode, under the assumption that 


the vector size d of gtlv::type equals problem size n. 


typedef dco::gtiv<dco::gtiv<double ,d>::type,d>::type ADtype; 


extern ADtype f(ADtype* x,int n); 


i 
2 

3 

4 void t2v_tiv(ADtype* x, int n, int d, double** H){ 
5 for(int i=0;i<n;it=d){ 

6 for(int j=0;j<=i;jt=-d){ // WseReymmetry 

7 for€int k=i; k<std::min(itdjn) »wokt++){ 

8 
9 


dco::value(dco::derivative(x[k]) [k/4d]) = 1.0; 
for(int 1l=j; l<std:: Mim jn); 1++)4 
10 dco:: derivative (dco::value(x[1])) [14d] = 1.0; 
11 } 
12 } 
13 ADtype y = f(x,n); 
14 for(int k=i; k<std::min(itd,n); k++){ 
15 dco::value(dco::derivative(x[k]) [k/4d]) = 0.0; 
16 for (int®l=—R 1<“etd::min(j+d,n); 1++)¢{ 
iby H{k] [1& =-WeHLI1Ik] = dco:: derivative (dco::derivative(y) [kjd]) [léd]; 
18 dco:: derivative (dco::value(x[1])) [14d] = 0.0; 
19 } 
20 } 
21 } 
22 } 
23 [ah 


Listing 3.17: Seeding of vector tangent over vector tangent mode for arbitrary matrix size n 


and tangent vector size d. 
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typedef dco::gais<dco::gtis<double>::type>:: type ADtype; 
extern ADtype f(ADtype* x,int n); 
void t2s_ais(int n, ADtype* x, doublex* H){ 
Jj seed all directions . cyvaluave © a times 
for(int i=0; i<n; it+)f{ 
dco::derivative(dco::value(x[i])) = 1.0; 
ADtype y = f(x,n); 
dco::value(dco::derivative(x[i]J)) = 1.0; 
ADmode:: global_tape->interpret_adjoint () ; 
ene (time jG a] Sine parte) al 
Hlilljl) = deo. .déerivative (dco: derivative (,)); 
Ir 
dco:: derivative (dco::value(x[i])) = 0.0; 
Jr 
J; 


Listing 3.18: Seeding of scalar tangent over scalar adjoint mode. 


OpenFOAM. They did not require any changes in the code base, compared to the first order 
models (except the handling of checkpoints, if checkpointing is required). This leads us to believe 
that the computation of derivatives of order three and higher can also readily be implemented, by 
introducing the relevant nested data types. ‘The required configuration options to enable second 
order derivatives in discrete adjoint OpenFOAM are listed in Appendix A.2. 

In our observations, using second order AD to compute Hessians to speed up the convergence 
of optimization methods proved to be ineffective, due to the run time overhead introduced by the 
additional evaluations of the augmented primal. The potentially lower number of optimization 
steps required do not outweigh the increase in run time per optimization step. Furthermore, the 
convergence path of the topology optimizer to a local minimum proved to be quite noisy, leading 
to poor convergence of second order optimization methods, as they tend to choose small gradient 
step sizes. 

The run time factors for a higher order solver of the simpleFoam solver are listed in ‘Table 3.7. 
Listed is only the run time for a single evaluation of the augmented primal. To obtain a full 
Hessian, the models must be evaluated repeatedly. 





Higher order sensitivities can be beneficial for parametric optimizations with limited number 
of parameters, as the run time overhead directly scales with the number of parameters for the 





Table 3.7: Run times for a single derivative evaluation in first and second order tangent mode. 
Run time factor for tangent vector mode (vector size 16) is for a single gradient entry. 


Solver Run time (s) Factor 
simpleFoam 1.14 1 
tisSimpleFoam 28.73 3.71 
tivSimpleFoam A51.47 3.65 
t2stisSimpleFoam 100.29 12.96 
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tangent over adjoint model. In the parametric optimization procedure, introduced in Section 4.6, 
approximately constructed Hessians are used. In the future those approximations could be readily 
replaced by exact derivatives. 


3.7 Profiling of Primal and Adjoint CFD Solvers 


3.7.1 Compilation of Discrete Adjoint OpenFOAM 


The compilation of (discrete adjoint) OpenFOAM requires a small subset of C++11 and C++14 
features (mostly for template resolution). Therefore, a somewhat recent compiler is required to 
compile the discrete adjoint OpenFOAM framework. ‘The compilation has been tested with gcc, 
clang, and the Intel icc compiler and different MPI implementations (OpenMPI, Intel MPI). 





In ‘Table 3.8 the compilation time for the OpenFOAM core package and a subset of solvers for 
adjoint mode (als), tangent mode (tls) and passive mode are listed. For the release binaries, 
optimization flags (-03) are set and debug symbols are disabled. 

Between passive mode and als mode, the compile time increases by roughly 30%, regardless of 
the used compiler. For adjoint and passive mode, clang compiles the fastest, for tangent mode 
the compile times between all compilers are roughly on par. 








For all configurations (including passive mode), a template instantiation depth of at least 41 is 
required. In absence of a documented compiler statistic reporting the instantiation depth, this num- 
ber was obtained by gradually increasing the allowed template depth via the -ftemplate-depth 
compiler flag, until no errors occur during compilation. ‘This finding underlines the complexity 
and reliance on templating of the code, making modes of AD other than operator overloading 
very challenging. 


3.7.2 Identifying Hotspots 


In order to understand the impact of AD on the run time and memory behavior of a given 
problem, it is important to accurately quantify where most of the run time and memory is spent 
during program execution. For run time profiling, several different approaches were tried. Namely 
profiling with gprof, the callgrind tool of valgrind and gperftools. All of these methods 
work by polling the program state at fixed (high frequency) intervals, extrapolating from these 
intervals how much time is spent in which subroutines. The function names and other program 
internals are obtained from the debug symbols inside the executable. 





Table 3.8: Compilation times of discrete adjoint OpenFOAM (optimized build with -03) on 
4 cores with 8 threads for passive, adjoint (A1S) and tangent (T1S) mode with different 
supported compilers. 


Compiler A1S (min) T1S (min) Passive (min) 


g++ 4.9 42.05 33.60 31.05 
gt++ 5.4 40.55 32.00 29.60 
g++ 6.3 40.50 32.40 30.20 
clang 3.8 37.50 32.55 26.20 
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For assessing the memory consumption related to the adjoint mode, we use function level 
instrumentation. Function instrumentation allows to inject additional code, which is executed at 
entry and exit of each function (also for inlined functions). This can be used to precisely track 
the tape size, as well as current execution time at the entry and exit of each function call. 

Three different approaches to inject the additional function calls into the code were investigated: 


Using constructors and destructors. An auxiliary object is constructed at the entry of each 
function. The constructor and destructor (which is called once the function has returned 
and the object runs out of scope) of the object can be used to call the custom functions, 
which perform the run time and memory tracking. The auxiliary function is inserted into 
the code by performing a code transformation with the clang-rewriter tool’. This tool 
allows to identify function declarations in the AST, subsequently inject statements into the 
AST and transform them back to code. One advantage of this method is that the name and 
signature of the executed functions can be passed to the profiler directly as a string. ‘The 
disadvantage is that the code rewriting process is complex and time consuming. It tends to 
miss certain functions, e.g. functions which are generated by C preprocessor macros. 





Using the -finstrument-functions feature of gcc and clang. The -finstrument-functions 
compile flag defines the functions func_enter and func_exit. ‘These functions are automat- 
ically called at the entry and exit of each function call. They can be overloaded to perform 
the desired actions. ‘The advantage is that the function hooks are directly injected by the 
compiler, and thus no function calls are missed. One disadvantage is that the function 
name and signature of the instrumented functions are not available readily, but have to be 
reconstructed from the debug symbols in the executable. ‘The lookup from debug symbols 
is rather expensive, though the overhead for repeated lookups can be kept low with a hash 
map implementation. 





Using the -fxray-instrument feature of clang-5.0. This feature, recently added to Ilvm clang, 
allows to instrument functions much like -finstrument-functions, but gives greater 
control over the granularity of instrumentation through file and function lists. Furthermore, 
it respects an instrumentation threshold that allows to exclude functions with very high 
call counts but minimal impact (e.g. operator []). 


Here we focus on the second approach, as it produced the most reliable results. ‘The third option 
would be preferred, due to the better granularity control, but the implementation in Ilvm seems 
incomplete at this point in time and documentation is lacking. 

Instrumenting all functions produces a lot of data, which on a very fine granularity level is not 





of much use and considerably slows down program execution, as the instrument functions are not 
inlined by the compiler. ‘Therefore, we deliberately disabled the instrumentation for regions of the 
code that mostly concern the handling, calculation, and storage of data on a very low level (e.g. 
operator[] on vectors). The influence of these omitted functions is not lost, but accumulated 
into functions higher up in the call tree. Instrumentation was omitted for the src/OSspecific/ 
and src/OpenFOAM/ folders in the source tree. Instrumentation of functions outside of the scope 
of OpenFOAM was disabled by blacklisting code in the system folders /usr, /1ib and /etc. 

In instrumentation mode, dco/c++ creates a trace of the program execution, which lists all 





encountered functions in chronological order, as well as the connectivity of the functions. From 
this information, the full call graph of the program can be reconstructed. For every (instrumented) 
function entered and exited, the following information is stored: 


“https: //clang.llvm.org/doxygen/classclang 1 1Rewriter.html 
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Number of children: Number of functions called by the function. 

Memory size: Number of tape entries created by the function excluding children. 
Cumulative memory size: Number of tape entries created by the function including children. 
Input count: Number of adjoint inputs. 

Output count: Number of adjoint outputs. 

Function name: Name of the function, stripped from function and template arguments. 


To obtain a run time profile of adjointSimpleFoam, we run the instrumentation on the laminar 
problem of the angled duct introduced in Section 3.1.2. Due to the high run time overhead of the 
instrumentation mode, only one iteration of the adjointSimpleFoam solver is traced. 

In Figure 3.27 the memory sizes for all (direct) child functions of the main function are shown. 
The blue bars show the tape memory used for a single step of the adjointSimpleFoam solver 
in black-box mode. The red bars show the same functions, but with symbolically differentiated 
linear solvers. Consequently the red bars are considerably lower (observe the logarithmic scale of 
the x axis) than the blue ones for the linear solver calls and identical for the rest. Consequently, 
the main complexity (at least in terms of memory consumption) shifts away from the linear 
solvers to different places in the code. 

In Figure 3.28 we show the same information for the Pitz-Daily example, with k-e turbulence 
model. We see the same general behavior as with the laminar case, but with an additional 
memory spike for the correction of the turbulence model. The kEpsilon::correct() routine 
embeds the two linear solvers, required to solve the turbulence equations. Therefore, the memory 
consumption differs between black-box and symbolically differentiated linear solvers. 

The full information of the instrumentation trace is best explored interactively, allowing to limit 
the high information density to regions of interest. A screenshot of a tool developed to visualize 
the call history is shown in Figure 3.26. The figure shows a breakdown of the function calls, 
including sub calls of up to level eight, for one iteration on the Pitz-Daily case. The hierarchical 
function calls are arranged in a sunburst diagram. The area of the slices correspond to run time 
or tape memory size. 


Dasher Ae 





Figure 3.26: Screenshot of interactive instrumentation visualization tool. 
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fv Matrix::relax 
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Figure 3.27: ‘Tape size consumed by function calls in adjointSimpleFoam for the angled duct 
case. Black-box differentiation in blue and symbolic linear solvers in red. 
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Figure 3.28: Tape size consumed by function calls in adjointSimpleFoam for the turbulent 
Pitz-Daily case. Black-box differentiation in blue and symbolic linear solvers in red. 
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3.7.3 Application of Profiling Results 


Inspecting the detailed breakdown of function cost, one can begin to identify routines which 
can be further optimized by exploiting application insight. As an exemplary case study, we will 
inspect the function lduMatrix: :H(), which ranks highly in the list of functions obtained by the 
instrumentation. This function calculates the negative product of the off diagonal entries L,U of 
a sparse matrix A = L+ D+U in 1lduFormat with a vector w: 


Hy =—(L+U)p=(D-A)w. 


The product is implemented component wise in a loop over all faces in src/OpenFOAM/matrices/ 
lduMatrix/lduMatrix/lduMatrixUpdateMatrixTemplates.C as: where Hpsi holds the result 
of the product, lower and upper hold the matrix coefficients J, and u of the lower and upper 
triangular matrices L and U, and 1Addr and uAddr store the row and column indices L and U of 
the matrix coefficients. 

One use case for the H() operator is the assembly of the pressure correction equation from the 
calculated velocities (e.g. in simpleFoam). 

In this application, lower and upper are scalar fields and psi is a vector field (i.e. the 
individual entries psi[1Addr[face]] are in R*). The adjoints can be calculated symbolically 
by the following code, where a1l_lower, ail_upper, al_psi and ai_Hpsi are the adjoint vectors 
corresponding to lower, upper, psi and Hpsi. This function is a good candidate for introducing 
symbolic adjoints, due to its limited scope and easy to derive derivative. ‘The adjoint code was 
implemented by applying AD to the computational kernel from Listing 3.19 by hand, according 
to the rules introduced in Section 2.7.2. For more complex codes, a source code transformation 
tool could be used. 

The scalar vector multiplication in Lines 3—4 in Listing 3.19 induce scalar products (calculated 
by the OpenFOAM ampersand operator) in the adjoint code (Listing 3.20) in Lines 4 and 8. This 
is illustrated by the following example with scalar A, and vectors x,y € R”: 


y = Ax 


7 Oy Be n—1 


7 Oy\* _ _ 
x= Ax -y=Ay. 


Listing 3.20 shows the hand adjoined loop over the faces, calculating the adjoints of the matrix 
entries L and U and the vector yw. Due to the split of primal and adjoint calculation, additional 


// Input: lowezgumupper, psi 
for (label face=0; face<nFaces; facett) 
1 
HpsiluAddr[face]] -= lower[face]*psillAddr[face]]; 
HpsillAddr[face]] -= upper[lface]*psiluAddrl[face]]; 
i 
// Output Hpsi 


Listing 3.19: Implementation of the HQ) operator. 
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for (Foam::label face=0; face<nFaces; facett+){ 
// adjoin HpsiluAddr[face]] -= lower[face]*psillAddr[face]]; 
ail_lower[face] -= psillowerAddr[face]] & al_HpsilupperAddrl[face]]; 
ai_psillowerAddr[face]] -= lowerlface] * a1t_HpsilupperAddr[face]]; 


// adjoin HpsillAddr[face]] -= upper[face]*psiluAddr[face]]; 
ai_upper[face] -= psilupperAddr[face]] & al_HpsillowerAddr[face]]; 
ai_psilupperAddr[face]] -= upperl[face] * a1t_HpsillowerAddr[face]]; 


Listing 3.20: Loop over faces to calculate adjoints of H(). 


boilerplate code is needed to correctly insert the symbolic adjoint calculation into the tape. This 
code, in slightly simplified form, is given in Listings 3.21 and 3.22, showing the creation of the 
gap in the tape and the symbolic calculation of adjoints during the interpretation. The process 
using adjoint callback functions is very similar to the implementation of SDLS. 


The code calculates the product Hpsi, and thus this vector is registered as an output of the 
function H(). Its adjoints become an input to the calculation of the symbolic adjoint. 





Due to the product rule, the primal values of lower, upper, and psi are needed in the reverse 
section. Copies of those vectors are saved in a checkpoint. ‘To access elements of the adjoint 
vectors at the correct positions, the LDU addressing needs to be available in the symbolic adjoint 
function as well. The addressing of the matrix indices is assumed to be constant throughout 
the program execution. ‘Thus, only a pointer to it is stored, instead of a full copy. If the mesh 
topology, and thus the connectivity of the faces, changes during program execution, full copies 
have to be saved. 





The vectors lower, upper, and psi are the inputs to HQ) and are registered as inputs with the ad- 
joint helper object D. Those adjoints have to be incremented with the calculated symbolic adjoints 
at the end of the adjoint callback function. 


The size of the checkpoints, input, and output vectors all correspond to the number of cells 
in the mesh. In contrast, the number of tape entries created by the loop, which calculates the 
matrix vector product, over the faces obviously depends on the number of faces in the mesh 
(two assignments are recorded for each face). The symbolic treatment avoids creating those tape 
entries. ‘he improvement achieved by the symbolic implementation is therefore expected to be 
higher the denser the matrix becomes, as the density is determined by the ratio between faces 
and cells. The savings should thus be best for complex polyhedral 3D meshes, where each cell is 
connected to multiple other cells. 








In ‘Table 3.9 we show the memory consumption and run time for the 2D reference cases from 
Section 3.1.2, as well as for the OpenFOAM motorbike case, which has a more complex 3D mesh. 
The savings observed in practice by the symbolic implementation of H() are pretty minor (< 5%), 
due to the rather high amount of data which needs to be checkpointed. However, no cases with 
decreased performance were observed. ‘Thus, the optimization is still worthwhile and serves as a 
template for further optimizations. 
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1 Foam::Field<Foam::vector>> Foam::lduMatrix::H(const Field<Foam::vector>& psi) 

2 i 

3 Field<vector> Hpsi(lduAddr().size(), Zero); 

4 if (lowerPtr_ || upperPtr_){ // only needed if matrix not diagonal 

5 const label nFaces = upper().size(); 

6 

id ADmode::global_tape->switch_to_passive() ; 

8 D = ADmode::global_tape->create_callback_object<ADmode:: 
extermalladjointl_obyect_t -() ; 

9 // register adjoint inputs 

10 forAll(lower(), i) 

i D->register_input (lowerPtr[i]); 

12 

13 forAll (upper ©, i) 

14 D->register_input (upperPtrl[i]); 

15 

16 cope ALL (Govan sa) )) af 

iy D->register_input (psiPtr[i][0]); 

18 D->register_input(psiPtr[li][1]); 

19 D->register_input (psiPtr[i][2]); 

20 } 

21 

22 D->write_data(lower()); 

23 D->write_data(upper ()); 

24 D->write_data(psi) ; 

25 D->write_data(&(lduAddr().lowerAddr())); 

26 D->write_data(&(lduAddr().upperAddr())); 

27 

28 for (label face=0; face<nFaces; face++){ 

29 HpsilLuAddr[face]] -= lower[face]*psiPtr[lAddr[face]]; 

30 HpsillAddr[face]] -= upper[lface]*psiPtr[luAddr[face]]; 

31 } 

32 

33 forAll(Hpsi, i) 4 

34 Hpsili][0O] = D->register_output (dco::passive_value(Hpsili][0])); 

35 Hpsili][1] = D->register_output (dco::passive_value(Hpsili][1])); 

36 Hpsili][2] = D->register_output (dco:: passive_value(Hpsili][2])); 

37 } 

38 ADmode:: global_tape->insert_callback<ADmode::external_adjoint_object_t>(gapH 
»D); 

39 ADmode::global_tape->switch_to_active() ; 

40| } 

Al return Hpsi; 

42) } 





Listing 3.21: Primal code for H(), augmented to enable symbolic differentiation in the reverse 
interpretation. 
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1 inline void gapH(typename Foam::ADmode::external_adjoint_object_t *D){ 


2 
3 
4 
5 
6 
ag 
8 
9 


const Foam::scalarField& lower = D->read_data<Foam::scalarField>() ; 
const Foam::scalarField& upper = D->read_data<Foam::scalarField>() ; 
const Foam::vectorField& psi = D->read_data<Foam::vectorField>() ; 


const Foam::labelUList& lowerAddr = *(D->read_data<Foam::labelUList*>()); 
*(D->read_data<Foam::labelUList*>()); 


const Foam::labelUList& upperAddr 


Foam::scalarField ai_lower(lower.size() ,Foam:: Zero) ; 
Foam::scalarField al_upper(upper.size() ,Foam:: Zero) ; 
Foam::vectorField al_psi(psi.size() ,Foam:: Zero) ; 
Foam::vectorField ail_Hpsi(psi.size() ,Foam:: Zero) ; 


forAlilCaleHpsa , 1) 
for (int 9-0; <3; 40) 
ai_HpsililLlj] = D->get_output_adjoint (); 


Foam::label nFaces = upper.size(); 

for (Foam::label face=0; face<nFaces; facet+){ 
ai_lower[face] -= psillowerAddr[face]] & al_HpsilupperAddrl[face]]; 
ai_psillowerAddr[face]] -= lowerlface] * a1_HpsilupperAddr[face]]; 


ail_upper[face] -= psilupperAddr[face]] & al_HpsillowerAddr[face]]; 
ai_psilupperAddr[face]] -= upperlface] * a1t_HpsillowerAddr[face]]; 
ii 


forAll (lower ,i) 
D->increment_input_adjoint (dco:: passive_value(ai_lower[i])); 


forAll Cupper .2 ) 
D->increment_input_adjoint (dco::passive_value(al_upperl[i])); 


forAll Cosi 51) 


for (inte) —O5 I<og +) 
D->increment_input_adjoint (dco::passive_value(ai_psilillj])); 


Listing 3.22: Symbolic adjoint code for H(). 


Table 3.9: Comparison between regular HQ) and symbolic adjoint implementation. 





Test case Variant Adjoint vector size Total tape size (MB) Run time (s) 
Ancled duff regular H() 177467477 6093.35 25.75 
© symbolic HO 171887480 5966.35 25.68 
Pitz-Dail regular H() 121031438 3951.65 10.56 
Y symbolic HO 118963913 3897.06 10.50 
Motorbike regular H() 709475801 29049.55 91.53 
symbolic H() 686295043 24420.04 91.50 
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3.8 Overcoming AD Memory Limits 


3.8.1 File Tape 


The AD tool dco/c++ supports multiple options to store the tape in memory (RAM or disk): 


Blob Tape: Use an adjoint stack of fixed length. Memory is reserved when the tape is created. 
Most performant option, as no bounds need to be checked. 





Chunk Tape: Use an adjoint stack which consists of multiple chunks, that are allocated on 
demand. Most flexible option, slightly less performant. 


File Tape: Use chunks, but buffer full chunks to files on the file system. File IO is handled by 
the kernel and chunks may be fully buffered in memory. Significantly slower than both 
blob and chunk tape, but allows running big calculations when checkpointing and offloading 
with AMPI is not feasible. 


The file tape can be utilized when memory demands are high and checkpointing is hard to 
implement. One such case would be, if for an iterative simulation one time step does not fit into 
the tape, requiring a more granular checkpoint level. ‘To get reasonable performance from the 
file tape, fast memory is needed. ‘The access patterns are sequential writes during the recording 
phase and sequential reads from the adjoint stack and random access read and writes on the 
adjoint vector during the interpretation phase. 

To predict the performance of the file tape, multiple tests were performed on different architec- 
tures. Studied were the performance of a SSD RAID (6* 1024 GB in RAID 0) and an Intel NVME 
installation on the RW'TH compute cluster. For RAID 0, the capacity of a volume is the sum of 
the capacities of the involved disks. The latency and throughput is improved by striping, i.e. data 
of consecutive writes is distributed over multiple disks, reducing the IO bandwidth needed on the 
individual disks. For random access, it also improves latency as multiple read/write operations 
can be performed at the same time. 





Table 3.10: SSD benchmark reading/writing of 10GB of data in 10 MB blocks with GNU dd. 
Average is taken over ten samples and rounded to next multiple of 5 MB. 


Mode ddflag SSD IO (MB/s) NVME IO (MB/s) 


write no flags 950 1570 
direct 1765 1675 
dsync 830 705 
sync 710 740 
nocache 910 1250 

read _no flags 390 1580 
direct - - 
dsync 390 190 
sync 340 1635 
nocache 900 1580 
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Figure 3.29: Read and write IO usage of the SSD/NVME devices for 40 iteration steps. Approx. 
300 GB of data is written to disk during the augmented forward run, and read back during 
the adjoint propagation phase. 


The sustained read and write performance for the SSD and NVME systems, benchmarked by 
the GNU dd command, is shown in Table 3.10. Different modes of disk access are specified by the 
-iflag/-oflag. Data is read/written in consecutive chunks of block size 10 MB. For small blocks, 
the read and write performance is significantly reduced, however as dco/c++ reads and writes the 
adjoint stack in chunks (corresponding to the chunks of a regular tape), the full read and write 
bandwidth should be used by the dco/c++ file tape implementation. For the SSD system, the OS 
caches are flushed after each benchmark run. We do not have root access to the NVME systems, 
the caches can thus not be manually flushed. For the NVME system, the RAM is artificially filled 
as much as possible by another process, to avoid skewing of the results by system level read/write 
buffers. The numbers obtained by these benchmarks correspond to the maximum read/write 
rates observed with dco/c++, with the exception of SSD write speed, where the full bandwidth 
indicated by dd in direct mode could not be achieved. 








To evaluate the file tape performance, we study a case run with adjointSimpleShapeFoam on 
a S-bend geometry for 40 iteration steps. The case consumes 307 GB of tape memory, which 
is written to the hard disk in chunks of one GB, (i.e. one file is created on the disk for every 
chunk). The adjoint vector is kept fully in RAM, due to the non consecutive memory access 
pattern involved in the reverse interpretation of the tape. The high latency (compared to RAM) 
of disk based storage makes non consecutive (random) memory access expensive. For this case, 
the adjoint vector occupies 70GB of RAM. For bigger cases, the size of the adjoint vector will 
quickly become a bottleneck. A technique to overcome this bottleneck will be presented in the 
next section. 











The IO bandwidth (as measured by iotop) during augmented forward run and adjoint reverse 
propagation is shown in Figure 3.29. As SDLS (see Section 3.4) is enabled, new entries for the 
adjoint stack are only generated outside of the linear solver iterations. During the augmented 
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forward run the NVME system consistently reaches a write speed of 1.8 GB/s. This is fast enough, 
such that the pauses in the stream of write operations due to the symbolic differentiation of linear 
solvers become visible in the IO bandwidth. The SSD system reaches a level of 650 MB/s, after 
an initial peak of higher bursts. We suspect these peaks to be a combination of OS buffering 
and the SSD write caches, which both become saturated after roughly 100 GB of data has been 
written. For the SSD system, no idle periods in the IO bandwidth are visible, as the Linux kernel 
buffers the write operations in memory. Apparently the write performance is not high enough 
to completely drain those buffers during the idle periods. However, the write performance for 
the SSD systems is high enough not to majorly impact the forward run time, compared to the 
NVME system. 

The read performance of both the NVME and SSD system is considerably lower than the write 
performance, maxing out at 300 GB/s for the SSD based system and 550 GB/s for the NVME 
system. In contrast to the write operations, the read operations are blocking. Therefore, if a 
chunk of memory is not already contained in the OS cache, the calculation will stall until the 
chunk is completely fetched from disk. The read performance can potentially be improved by 
prefetching some data, reducing idle time in the IO subsystem. The lower read performance of 
the SSD system clearly effects the run time of the adjoint propagation, as it takes considerably 
longer on the SSD system than on the NVME system. 


3.8.2 Adjoint Vector Compression 


As already mentioned in the previous section, the adjoint vector, which in dco/c++ is stored in 
RAM to retain acceptable random access performance, limits the usefulness of offloading the 
adjoint stack to secondary storage. 

In order to calculate the adjoints for each entry of the adjoint vector, the entries of the adjoint 
vector connected to it by outgoing edges must be accessible. For iterative methods, where many 
changes in the states are only local to the current iteration and do not directly influence any 
values in the next iterations, the required adjoint vector size can be bounded by the longest edge 
in the tape. Once the tape interpretation has moved past a certain element in the adjoint vector 
(that is all partial derivatives of the element have been incremented along its outgoing edges), 
its location in the adjoint vector can be reassigned to another element, which comes later in the 
interpretation procedure and has not yet been incremented. ‘The number of elements required 





to be present in the compressed adjoint vector at the same time is bound by the maximum 





distance between two elements, connected by an edge, in the uncompressed adjoint vector. The 
concept of the compressed adjoint vector was introduced in |NL18]. This thesis will focus on the 
implementation in the context of complex iterative algorithms, the performance cost associated 
with the adjoint vector compression, and a novel implementation to mostly recover the added cost. 





We will first illustrate the adjoint vector compression with an example, before showing it more 
rigorously. As a simple iterative algorithm we consider the Babylonian root finding method. This 
fixed point iteration approximates the square root of a positive real number RT > R*™ : 2 = a 
by the following iteration procedure: 

1 a 
t=5(2+7). 


This iteration procedure can be straightforwardly derived from Newtons method (see Section 2.9.2), 
applied to find the root of f(x) = x? — a, as shown in Equation (3.6). 
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(wi) _ | tira _l (=, 7 <) (3.6) 


Lit] = Ly — f'(i) a 20; 9 
It was developed independent of Newtons method several centuries earlier by Babylonian and 
Greek mathematicians, hence the name. 
From an initial guess x9 = a = 2 we find the square root 73 V2 with an error of 0.00015% 
after only 3 iterations. To introduce some intermediate variables, which are local to the iteration, 
we decompose the fixed point iteration into the following equivalent SAC: 





ADtype x = a; // initial guess 
for€int i = 0; i<3; it+){ 
ADtype vi = a/x; 
ADtype v2 = vi + x; 
x = 0.5*v2; 
Ir 


Listing 3.23: SAC of Babylonian root finding algorithm for three iterations. 


Using the adjoint method of AD we calculate the approximation of the derivative 


aye) _ 
Ox ——_ 2/2 


by adjoining the three iterations of the fix point iteration up to an error of 0.00177%. 

Figure 3.31 shows the tape structure generated by the three steps of the algorithm. ‘The 
parameter a is incremented from each iteration step, the adjoint of a must therefore be available 
in the adjoint vector during the interpretation of all iterations. However, the individual iterations 
do not depend on any other iteration steps, except of the state of the directly preceding iteration. 
Outgoing edges of an entry of the adjoint vector only influence entries which are located before it 
in the adjoint vector (that is the partial derivatives were created earlier in the primal evaluation 
phase). Entries below the current interpretation location are never read from or incremented 
again. Those adjoints can be safely discarded, assuming they are not desired for some other 
adjoint calculation, and the space in the adjoint vector may be reused for another entry. We will 
now formalize this observation. 
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Theorem 6. 

Let | denote the maximum distance between two entries e;,e;,1 > 7 of the adjoint vector that are 
connected by an edge (e;,e;) € E. Then the number of elements, that need to be retained in the 
adjoint vector, 1s bounded by sharply [+ 1. 


Proof. By induction we will show that a rolling window over the adjoint vector of length / + 1 
is enough to perform all increments. Let S = [s9,...,5n—1] € IR” be the full adjoint vector 
and S*? = ISmax(0,n—i—l)> ++ +1 8n—i—1] be a sub vector including at most / + 1 consecutive elements 
of S starting at index max(0,n —7i-—1). This can be pictured as a rolling window moving 
backwards over the full adjoint vector. 


Base Case: Incrementation of adjoints connected to adjoint entry s”~*~! for i = 0: 
All adjoint vector entries s; with (sn_1,s;) € E’ have to be incremented. As the distance 
between s,—; and s; is at most 1, the adjoint vector S° = [s,_1,...,8,—1] contains all 
required entries s;. Note how the element from which all edges relevant for this step 
originate is the last entry of the rolling window. 





Inductive step: incrementation of adjoints connected to adjoint entry s”~*"! fori =k-+1: 
The last element of the previous inductive step is not needed any more, because all outgoing 
edges s; with (s,z,s;) € EH have been incremented and there can not be any backward 
pointing edges (s’, s’) ¢ E for all 7 > i. We can thus transform S* into S**! by removing 
the last element and inserting one more element at the front, namely s,,_j_(,41). By the 
definition of | the vector S*t+! again includes all elements s; with (8p_1-(n41); 83) © £, 
required to increment all entries connected to the last entry of the adjoint vector S*t1. 


The induction terminates when all entries of the adjoint vector have been fully incremented 
(for 7 =n — 1). CI 





The naive adjoint vector compression breaks down, if there are edges in the tape that span multiple 
iteration steps. ‘This is commonly the case if each state depends on a set of parameters, that is 
defined at the beginning of the program. For illustration, see the dashed arrows in Figure 3.30. 
‘To overcome this, the set of parameters is made available at each point of the tape interpretation. 
Assuming that the first p entries of the adjoint vector S are parameters, the size of the compressed 
adjoint vector can again be bound by S* € R's+!+?, with the following new definitions for S* 
and [,: 


l, — max 1 i= ) 
(s;,8;)€E,j>p 


ah 
Ss = So, -++)5p—-1,5n—l,—itpo-++> oa ; 


We call the set of parameters fixed at the start of the adjoint vector perpetuated parameters |NL18]. 
Then /, is the length of the longest edge in the tape, not connected to a perpetuated parameter. 

In practical implementation, the transformation of vector S* to S‘t! would either need copying 
of data, or a more complex linked data structure that allows to remove the first element and 
append a new element at constant cost. ‘To avoid this added complexity, instead the addressing 
into the vector is implemented relying on the cyclic properties of the modulo operation. 

This allows to reuse the same allocated memory for a sequence of vectors S; without the need 
to copy any values. ‘This is best illustrated with an example. The cyclic shifting of the adjoint 
vector is illustrated in ‘Table 3.11. 
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Table 3.11: Adjoint vector of length eight, with longest edge three. Rolling window compression 
(left) and modulo shifting (right). Current position in interpretation underlined. 
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Uncompressed tape representation of the Babylon root example. 
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Figure 3.31: Adjoint vector compressed to the longest edge between states, excluding edges from 
states to parameters. Dashed lines indicate references to the virtual uncompressed adjoint 
vector. 





The interpretation phase, using the adjoint vector compression technique, for the Babylonian 
example is illustrated in Figure 3.31. There is only one perpetuated parameter a. ‘The longest 
edge, not connected to parameter a, spans two nodes. ‘The required size for the compressed 
adjoint vector is thus n,+i+1=1+2+1=4. 

To further illustrate the principle of this approach, an implementation of a basic AD tool, 
applied to the Babylonian root problem, is included in Appendix D. The tool implements both the 
tape interpretation for a regular tape and a modulo compressed tape. For the exact transformation 
of the full adjoint vector to the individual entries of the modulo compressed adjoint vector, refer 
to this code. The implementation in dco/c++ is functionally identical, however not as transparent 
due to the added complexity of using a template engine. 

The main benefit of the adjoint vector compression is that the part of the tape that needs to be 
stored in RAM can be made independent of the number of iterations. ‘This allows to differentiate 
through a high number of iterations by offloading the growing part of the tape to HDD/SSD 
storage while keeping the latency sensitive adjoint vector in RAM completely. 


Implementing Adjoint Vector Compression in OpenFOAM 


The only change required in the discrete adjoint OpenFOAM framework, in order to activate the 
compression of the adjoint vector, is to replace the adjoint datatype dco: :gals<double>: :type 
with the dco: :gais_mod<double>::type data type. ‘The compression of the adjoint vector 
is not enabled by default, as it adds a run time penalty to the tape interpretation phase. 
The added modulo operations add additional operations and prohibit the CPU from utilizing 
speculative execution. 
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Figure 3.32: Run time of 16 iterations of the Pitz-Daily case using the uncompressed (als) and 
the modulo compressed (als _ mod) adjoint vector implementation. 


As a heuristic to determine the set of parameters which need to be perpetuated, dco/c++ 
treats all variables that are explicitly registered with the tape as parameters. After changing the 
adjoint data type, the remaining code of adjointSimpleFoam can be left unchanged, as the first 
action after creating the tape is already to register all parameters a, making them perpetuated 
by default. 

Before we examine the memory savings achieved by the adjoint vector in detail, we will first 
look at the run time. The run times for 16 iterations of the adjointSimpleFoam solver on the 
Pitz-Daily case, with and without adjoint vector compression, is shown in Figure 3.32. While 
the run time of the augmented primal evaluation stays nearly identical, the time required for 
the interpretation of the tape grows considerably. This illustrates the additional computational 
complexity introduced by the modulo operation, performed with each increment of adjoint 
variables. ‘The time required for the allocation of the adjoint vector decreases, already hinting at 
a reduction in RAM usage. 

The extent of the run time penalty, introduced by the modulo operator, and consequently an 
optimization to reduce the overhead of the reverse adjoint propagation, will be presented in the 
following section. 

















3.8.3 Bitwise Modulo Optimization 





The introduction of the compressed adjoint vector increases the run time of the reverse adjoint 
propagation phase. Discrete adjoint OpenFOAM seems to be particularly susceptible to this 
increase in run time, other synthetic benchmarks in the dco/c++ benchmark suite exhibit a less 
significant run time increase. The root cause for the run time increase is the execution of the 
modulo operator in every adjoint incrementation. The data flow reversal, at least in parts of the 
calculation, is not memory bound, such that this additional numerical operation has a significant 
impact on the total run time. A naive implementation of the modulo operator might look like 
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Table 3.12: Calculation of (138 mod 8) = 5 using the bitwise operation optimization for divisors 
of power two. 


bz bg 01 bo 

a 1 1 0 1 

6 1 O O O 

b-1 0 1 1 1 
a&(b-1) 0 1 0 1 


the following |Knu97]: 
a mod b=a— |e. 


The actual implementation by the compiler might be optimized, but will still generally require an 
integer or floating point division. 

The run time of the adjoint propagation phase can be reduced by expanding the adjoint vector 
size to the next power of two, allowing to replace the modulo operation by a more efficient bitwise 
operation. On all modern platforms bitwise operators are executed very efficiently, especially 
compared to the floating point division operation involved in the calculation of the modulo 
remainder. In the worst case, the adjoint vector grows by a factor of two, when expanding the 
size to the next power of two. Due to the small adjoint vector size, compared to the stack memory 
size, this should be negligible in most cases. We will now show that the equivalence between 
modulo operation and bitwise AND operation holds for divisors which are powers of two. 





Theorem 7. 
LeticN, b=2' andaé€Nt. Then the following equivalence holds: 


amodb=a& (b—-1), 
where & is the bitwise AND operator. 


Proof. The bitmask b — 1 masks all bits lower than the single significant digit corresponding to 0. 
The bitwise AND operation then strips away all digits of a left of and including the digit in 0, 
removing all integer multiples of b from a. ‘This yields the desired remainder a — | ¢ | b: 


b 
i=] . 
b-1=) 2 
j=0 
flogs(a)| 


a& (b-—1)=a- g=a—|= Ib. 





= 

a 
The bitwise AND calculation, replacing a modulo operation, is illustrated in Table 3.12 on the 
example (13 mod 2°) =). 


To quantify the impact of the operations, we will first analyze the impact of the modulo and 
bitwise AND operator on a synthetic benchmark and its disassembly. The loops in Listing 3.24 
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77 modi ike 

for Cint 9 =O" 71 <<) 1024-5444) 4 
ao s= ae, ib 

} 

// bitwise AND 

fOr (int) 1-070 49 1024) a4) 4 
a=a & (b-1); 

} 


Listing 3.24: Synthetic benchmark for modulo and bitwise AND operation. 


are benchmarked using the google benchmark framework? with random positive integers for a 
and b = 2°. 

As can be seen in the disassembly in Listing 3.25, on systems with a x86 or IA64 based processor 
the compiler schedules a call to the idiv1 |Int16] instruction, which computes the quotient, as 
well as the remainder of two signed integer numbers. ‘The computation can be slightly sped up 
(approx. 5%) by switching a and 6 to signed data types. Measuring the execution time of individual 
instructions on modern architectures is not straightforward, due to instruction fusing and out of 
order execution. The following benchmarks should thus only be interpreted qualitatively. 

The for-loops around the operations are necessary to capture a measurable time delta for 
the bitwise AND operation at all. The benchmark framework polls the loops several hundred 
thousand times to get a reliable average of the run time. The benchmark tool reports an average 
run time of 6328 ns for the first loop and a run time of 100 ns for the second loop, with a standard 
deviation of below one percent. ‘The run time of the second loop can be further reduced to 89ns 
by eliminating the subtraction of one (as the adjoint vector size is known when entering the data 
flow reversal the subtraction can be performed once instead of at each calculation of the modulus). 
Absent of any memory access (a and 0 are held in registers) the bitwise AND operation is faster 
than the modulo operation by a factor of above 60. Assuming the full clock rate of 3.3 GHz of 
the machine can be utilized for the benchmark, this nets an average execution time of 0.25 CPU 
cycles per bitwise AND instruction and 20 cycles per regular modulo instruction, indicating that 
the compiler is able to efficiently vectorize the instructions. 

Switching from the synthetic benchmarks to the application in CFD codes, we expect the savings 
by the bitwise AND operation to be much smaller, as not all operands are stored in processor 
registers. ‘Therefore, memory latency is introduced which will affect both implementations equally. 

In Figure 3.33 the earlier picture showing 16 iterations of the adjointSimpleFoam solver is 
updated with the newly obtained results from the bitwise AND operator. ‘The modulo operations 
added a significant run time overhead to the regular adjoint solver. Almost the whole overhead can 
be recovered by switching to the bitwise AND operation instead. As seen previously the allocation 
of the adjoint vector decreases in run time significantly for both modulo implementations, due to 
the much smaller adjoint vector. 





*https://github.com/google/benchmark 
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VOid LOO (int. a, ant. b)t 
int c=a%,b; 
cae Gl = g 48 (Coal) 3 
i 
VOa det oOoO int fast bf 
02 55 push Arbp 
1: 48 89 e5 mov AGSP 4540p 
4: 89 7d ec mov hedi ,-0x14 (4rbp) 
Ce Soe foes mov hesi,-0x18 (4rbp) 
dic co <=) a /. be 
a: 8b 45 ec mov -Ox1l4(/rbp) ,,eax 
d: 99 cltd 
e: £7 7d e8 idivl -0x18(/%rbp) // signed integer division 
11; $9 55 £8 mov hedx ,-0x8(/rbp) 
mae cl Say fe (Coil) < 
14: 8b 45 e8 mov -Ox18(/rbp) ,,eax 
17: 83 e8 O1 sub $0x1 ,,eax / Mmsubteract V1 
ta: 23 45 ec and -Ox14(/rbp) ,,eax // bitwise AND 
id: 89 45 fe mov heax ,-Ox4(/rbp) 
Ir 


Listing 3.25: Disassembly of Lines 1-4, as generated by gt+ -c -00 -g. Optimization will 
remove some of the move instructions, however the arithmetic operations remain unchanged. 
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Figure 3.33: Run time of 16 iterations of the Pitz-Daily case using the modulo (als mod) and 
bitwise AND (als_ bitw) adjoint vector compression. 
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The four Figures 3.34-3.37 show the same case, with different number of iterations between 
one and 64. The maximum size of 64 iterations is chosen such that the tape fills the 128 GB of 
RAM of the test machine almost completely when not utilizing SDLS. For 64 iterations, the tape 
consumes around 100 GB of RAM, the uncompressed adjoint vector around 19GB. Utilizing the 
modulo operation, the size of the adjoint vector compresses down to 365 MB. When increasing 
the adjoint vector length to the next power of two, the size of the adjoint vector grows to 512 MB. 
The figures clearly show, that the compressed adjoint vector size stays constant for increasing 
iteration counts. The run time for the interpretation increases slightly when utilizing the bitwise 
AND optimization, but grows with the same rate as the uncompressed version. All effects of the 
adjoint vector compression, both in terms of run time and compression, are less pronounced for 
the SDLS enabled runs, as the ratio of instructions treated by dco/c++ compared to the overall 
operations is lower in those cases. 

Summarizing, the modified version of the adjoint vector compression allows to regain most 
of the performance, that was lost when the regular modulo adjoint vector compression was 
introduced. The adjoint vector compression makes large calculations feasible, which would have 
been limited by the memory size of the adjoint vector before. 
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Figure 3.34: RAM usage without SDLS for uncompressed and compressed adjoint vector using 
the modulo and bitwise AND operators. 
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Figure 3.35: RAM usage with SDLS for uncompressed and compressed adjoint vector using the 
modulo and bitwise AND operators. 
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Figure 3.36: Run time without SDLS for uncompressed and compressed adjoint vector using 
the modulo and bitwise AND operators. 
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Figure 3.37: Run time with SDLS for uncompressed and compressed adjoint vector using the 
modulo and bitwise AND operators. 
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4 Implementing Efficient Algorithms for Steady Flows 


After introducing AD to the calculation of full iteration histories, we will now discuss approaches 
that will improve efficiency of the computations for steady state flow cases. 


4.1 Reverse Accumulation 


4.1.1 Introduction 


Reverse Accumulation allows to iteratively accumulate the adjoints of a problem which contrac- 
tively converges to a steady solution (fix point) by adjoining the last iteration step repeatedly. 
The algorithm is extensively documented in the literature |Chr94; Chr92; Gil92], here we focus 
on the application of the algorithm to typical CFD optimization cases. ‘To this end, we assume 
the parameters to be the topology optimization parameters @ € R”¢. 

For the purpose of describing the reverse accumulation algorithm, we again separate our 
problem into a pre-processing step x” = P(a), (iterative) processing step 


x” — F(x°,a) = f” (x*—!, a) nfs] (x*~?, a) o...0 f' (P(a), a) , 








and post-processing step y = J(x*,a) = J(F(P(a)), a). The independent (state) variables x 
are initialized in the pre-processing step, potentially depending on the parameters a@. For laminar 
steady flow, the independents are the velocity, pressure, and face flux vectors x = (U,p, @). From 
the output of the pre-processing step, the state is iteratively converged to the fixed point x”, 
bringing the residual of the Navier-Stokes equations towards zero. ‘The iteration function f can, 
potentially for every iteration step, switch between different execution branches, e.g. due to 
upwinding in the discretization schemes, therefore a different function f’ is assumed for every 
iteration step. After the converged state is reached, the cost function 7 is calculated from the 
last state x*, and potentially also from a. 

As already stated previously in Section 3.2.2, applying AD to the whole iteration process of k 
iterations w.r.t. the parameters a@ yields 


apt af ax! aft 


= A.l 
da Ox! da Oa’ ay) 








with the recursion formula 








of® dx*-1 4 of* k = 1 


dx* Oxk-l1 da Oa 
da Of On" ee 
Ox° Oa —_ 


Reverse Accumulation uses the knowledge, that for a problem which has reached a converged 


state x* = x* the Jacobian 
Oxk 


V f(x") — Oxk-1 
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fri, 








Figure 4.1: DAG of the iterations used for reverse accumulation. x’ denotes the penultimate 
state from which on taping is enabled. The final state x* is calculated from x! by the fixed 
iteration step f*. Compare to the black-box differentiation in Figure 3.5. 


of the state x” = f(x*—') differentiated w.r.t. to the previous state x*~! has also converged to 
a fixed state V f*. Furthermore, the location of the fix point x* is independent of the starting 
point x?, and therefore 








dx* 
_8_ | ig~ 0 ; 
| dx? 
and consequently 
dy 
——|} +0. 
| dx? 








Let x* be the last iterate of a fixed point iteration, xj the penultimate state, and f* the 
last iteration producing the state x* from x'. Instead of evaluating the full chain (4.1) the 
alternative chain 


da \ dxi Oa Oxt Oa Oxt Oa Oa 


























(4.2) 


can be calculated. This is the earlier recursion formula explicitly unrolled for fixed Jacobians 
Of*/Ox! and Of*/Oq@ instead of Oy ox and Of’ /Oa. The chain can be evaluated for an 
arbitrary number of iterations k. Convergence predictions are given in |Chr94|. In our application 
we choose k high enough to ensure convergence of the adjoint fixed point iteration. 

The main advantage of the reverse accumulation approach is that only one iteration needs to be 
captured in the tape. Highlighting the passive and active sections of the procedure, the generation 
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of the output y from x° is illustrated in Figure 4.1. Passive computations are connected by 
dashed lines, computations which need to be captured inside the tape are drawn solid. 

The alternative chain (4.2) can be conveniently evaluated by reverse mode AD. Interpreting the 
tape for the iteration step f* : R"**"¢ — R™~, which generates x* from x", yields incremental 


of \> 
+ (35) -S 
_, (af\' 
Q + ( an Si 
where s € R”™ is an arbitrary adjoint seed vector. 


By choosing the first seed vector as s° = (07/0x*)", we construct the following iteration 
which evaluates (4.2) by repeatedly using the result x of the previous iteration as the seed vector 


for the next iteration: 
s-gi_(@F "60. OF Of” 
Oxt Ox* \ Oxi 


projections of the form 





| 
| 





a 


























eo _@ (Of) 3 _ aS (ary 

Ox! Ox* \ Ox! (4.3) 
be (OL @ p wT (Of \* 
_— (5) -  ax* Vaxt } 


While the adjoint x is reset to zero after each iteration, the adjoint @ is allowed to accumulate 
and yields the desired adjoint approximation for (OJ /Ox* )(df*/da@): 











ae. OF (of) OIF AT Of" 
a 0p = + 






























































Oa Oa “OW Fe Ox* Oa 

ae or DI a7 (Of* af af 

2 1 L,Y 
_— +(4) da (+55 Oa 
=e afe\* AJ ag (af aftafe (aft? Of 

a ) 
a +($) = $4 58 (EOE axt) Ba ae 
qk OF , OF (aft | af af | (Af \” af af)" af” 

— Oa Ox* \ da  OAxt da Oxt } da 7 Oxi Oa 

















SIF a7 aft (aftr? OP \ OF 
, ba | Ox (1498 + (55) Tenet | ei Oa 
With f* being a contractive function, the norm (af*/axt)"] will tend to zero for k > ov, 


making x” an indicator for the convergence of @. 


137 


4 Implementing Efficient Algorithms for Steady Flows 


4.1.2 Implementation in a CFD Setting 


In the following the procedure of accumulating the adjoints is detailed with focus on the usage of an 
AD tool. The seeding and calculation of the adjoints 7 of the post-processing step is left unchanged 
from the standard black-box or checkpointed approaches, yielding the adjoints x* = (O07 /Ox* ye -1 
and & = (97/da)* -1 of the cost function J w.r.t. the final state x* and the parameters a. 

The tape of the post-processing step can be discarded, once the adjoints x* have been calculated. 
The tape of the final iteration step is interpreted from x* back to x', without discarding the 
tape. This calculates the adjoints x', but also begins to accumulate the desired adjoints & by 
incrementing them with the adjoint chain @ = @+(07/0x*)(Of*/da). Next, the adjoints x! are 
extracted from the tape and written to a temporary storage field. This step is slightly complicated 
by the fact, that usually iterative solvers overwrite the existing state with the newly calculated 
one, i.e. x’t! = f(x", a) is actually implemented aliased as x := f(x,a). While the adjoints x! 
exist in the tape, they can not be addressed by the variables x (i.e. dco: :derivative(x) in 
dco/c++ notation), as this would instead yield x* (memory aliasing). Therefore, the location 
of x* in the tape must be saved after first registering the state x in the tape. This procedure 
is similar to the extraction of adjoints of the state using checkpointing. For the OpenFOAM 
implementation, we can therefore reuse parts of the adjoint checkpointing interface for the reverse 
accumulation iteration. 

After extracting x!, the adjoints of all tape entries between x! and x* in the adjoint vector 
are reset to zero, leaving only the already partially accumulated adjoints @ as non-zeroes in the 
adjoint vector. 

Next, the stored adjoints of the state x” are written back into the adjoint state x”t!. The 
tape is then re-interpreted from x* to x', yielding a new adjoint state x! and incrementing the 
adjoints of the parameters a. 

Summarizing the above procedure the following steps needs to be implemented by a CFD solver 
utilizing reverse accumulation, enabling to evaluate the adjoint chains (4.3) and (4.4): 











e Calculate n — 1 iterations in passive mode up to x* (choose n such that solution has 
sufficiently converged). 


e Register all required parameters @ in tape. 
e Store current position of tape > TT. 
e Register the state variables x! in tape. 
e Calculate the nth iteration in augmented forward mode, yielding x*. 
e Store the current position of the tape > T*. 
e Evaluate the cost function J and seed adjoint 7 = 1. 
e To evaluate Cl we proceed as follows: 
— Interpret tape from final position up to 7%. 
— Get adjoints of final state and store them x, = x”. 
1. Set adjoints of the state x* = xg. 
2. Interpret tape from position T* to TT. 


3. Extract adjoints of initial state and store them X, = xX’. 
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4. Reset the values of all adjoints from t* to 7; to zero. 
— Repeat steps (1-4) until ||x,|] < e. 


e Cl is now available in a. 


Utilizing reverse accumulation, the calculation can be sped up considerably, as no checkpointing of 
an iteration evolution is required. The tape re-evaluation is quite efficient compared to augmented 
forward steps, further improving runtime. However, it can be difficult to find an iteration step 
which is a suitable candidate for reverse iteration, if the residuals are noisy. An example for 
the convergence of reverse accumulation compared to the black-box approach is given in the 
following section. 

The source code of an OpenFOAM solver, implementing reverse accumulation, using the 
aforementioned checkpointing interface, is given in Appendix C.1. 





4.2 Piggyback Adjoint Iteration 


The concept of piggyback adjoints in the context of AD was first introduced in [GF03] and further 
detailed in |GW08]. In this approach, the adjoints are propagated alongside the primal values, 
somewhat mirroring the behavior of the continuous adjoint, where the adjoint equations are 
solved alongside the primal equations. In a one-shot optimization setting [Bos+14], the design 
parameters are immediately optimized using the partially converged derivative information. 

The updates of primal, adjoint, and design state are therefore evaluated in one coupled iteration. 
With the functions f’ : R’* x R? > R” which calculate one iteration step of the primal, and 
g: IR" X R™ X R* which calculates the adjoints of the states, as well as the gradient required 
for the update step, the iteration procedure can be outlined as follows: 





for k = 0,7. ..2, ae: 


xktl dl gh (x”, x", a") 
att =a,—P 1 GgE (x®, x*, a") 


where P is a suitable preconditioner, to ensure the convergence and stability of the design opti- 
mization. 

When applying AD, the function g is not given explicitly, but is evaluated by adjoining the 
iteration f”, as well as the calculation of the objective y = J(x*t") after each iteration step. 

The piggyback approach is closely related to reverse accumulation. It differs in that not a fixed 
(last) iteration step of a fixed point iteration is repeatedly adjoined. Instead always the most 
recent iteration f* of the augmented primal is used to obtain the next iterate for the adjoints x*t1. 
New iteration steps are repeatedly calculated and adjoined until the change in both x and x 
fall under a prescribed threshold. Only after this threshold is reached, an update of the design 
parameters @ is performed. The change in @ will in turn increase the residuals of both x and x, 
starting another round of inner piggyback iterations. 

Even without applying an optimization, one advantage of the piggyback approach is that no 
fixed point has to be identified in advance. The method lends itself well to one-shot optimization, 
as the optimization of the parameters @ can be started with a not completely converged gradient. 
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Figure 4.2: Sensitivities obtained by tangent and adjoint modes. The adjoint is piggybacked 
from the initial state and reverse accumulated by repeatedly adjoining primal iteration step 
800. 


The piggyback method has been implemented in discrete adjoint OpenFOAM for the SIMPLE 
algorithm, derived from the simpleFoam solver. As with reverse accumulation solvers, the 
checkpointing interface can be used to help with the handling of the adjoints. Figure 4.2 shows a 
comparison of the convergence of the sum of sensitivities for black-box tangents (which are identical 
to the black-box adjoints), reverse accumulation starting after 800 iterations, and piggybacking 
(without design updates, that is P~! is a zero matrix). The derivatives are calculated on the 2D 
Pitz-Daily case, with SDLS enabled for piggyback and reverse accumulation solvers. ‘The change in 
the derivatives from iteration to iteration is detailed in Figure 4.3. Reverse accumulation converges 
down to machine precision. ‘The convergence of the black-box differentiation and piggyback 
bottoms out after roughly 600 iterations, however the derivatives have converged sufficiently to not 
observe any significant changes in the adjoints. The obtained derivative residuals roughly match 
the chosen linear solver tolerance, a stricter tolerance level will further improve the residuals. 

Further applications of the piggyback approach, including optimization using steepest descent 
algorithm are given in the case study presented in Section 5.1. 
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Figure 4.3: Change in the sum of sensitivities between subsequent iterations. Initial residual 
scaled to one. 





4.3 Verification 


While the results of the sensitivity calculations obtained so far look plausible, they certainly still 
need to be verified. We will achieve this using the following steps: 


e Check the correct implementation of AD on the algorithms, by comparing results obtained 
by the adjoint method with results obtained by tangent mode and FD. 


e Verify that the differentiated algorithms match the physics of the continuous adjoint, by 
comparing to results obtained with the continuous adjoint method. 
The adjointShapeOptimizationFoam solver, supplied with OpenFOAM, is used for calcu- 
lating the continuous adjoints. 


4.3.1 Mesh Independence of Adjoint Sensitivity 


For a steady state problem, the flow fields, obtained by a FVM discretization, should, for increasing 
mesh resolution, converge towards a final state, as truncation errors decay. As the fields are 
evaluated at different positions for each discretization, we evaluate the volume integrals of the 
fields instead to judge convergence. 

The angled duct case is evaluated for mesh refinement levels 5 (325 cells) to 80 (83 200 cells). 
The primal calculation is performed with passive simpleFoam iterations, starting from a pre- 
initialized potential solution. The primal iterations are run until both velocity and pressure fields 
have reached a prescribed convergence tolerance. Afterwards one augmented forward primal 
step is executed and then repeatedly adjoined using reverse accumulation. ‘To speed up the 
calculation, the mesh is decomposed into 8 processor domains. Figure 4.4 shows the number of 
iterations required to achieve the prescribed solver tolerances. The number of primal evaluations 
scales roughly linearly with the number of cells in the domain. ‘The number of adjoint reverse 
accumulations required rises slower, and after a certain point seems to grow logarithmically. 

The change in the velocity, pressure, and sensitivity fields, evaluated as volume integrals, is 
detailed in Figure 4.5. Shown is the change in volume integral, compared to the previous mesh 
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Figure 4.4: Number of primal SIMPLE iterations and number of adjoint reverse accumulation 
iterations needed to bring case to convergence. 


resolution. ‘The curves are normalized and shown in a double logarithmic scale. The change in 
all fields decreases, as the FVM approximations converge towards the solution of the continuous 
problem. The pressure convergence lags behind the velocity, which is common with SIMPLE 
algorithms and is amplified by the lower relaxation factor chosen for the pressure correction 
equation. The formulation of the cost function is dominated by the influence of the pressure, 
therefore it is plausible that the convergence rate of the sensitivity seems strongly linked to the 
convergence rate of the primal pressure. 

Summarizing, the adjoint sensitivity field converges towards a fixed point, as the mesh resolution 
increases. It does so with a convergence speed comparable to the primal iteration. ‘The number 
of iterations required to achieve adjoint convergence are on par, or lower, than the primal. 
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Figure 4.5: Change in velocity, pressure, and adjoint sensitivity field, compared to previous 
mesh resolution. First value is normalized to one. 
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4.3.2 Validation Against Tangent and FD Models 


To validate the implementation of AD, we first use the laminar angled duct testcase. Units are 
introduced to the dimensionless geometry by choosing L = 1m, with the origin at the lower left 
corner of the geometry. 

In Figure 4.6 we show the values for the derivative d7/da at different discrete locations along 
two evaluation lines crossing the domain. The first line Ly connects the points (0, 0.55, 0.05) 
and (5,0.55,0.05), the second line L2 connects (4.55, 0, 0.05) to (4.55, 5, 0.05) (all coordinates in 
meters). The coordinates of the endpoints are chosen, such that the line passes the cell midpoints 
of the cells it penetrates. ‘The location of the lines L; and Lo, as well as the full sensitivity field 
are Shown in Figure 4.7. 

Figure 4.6 compares the results obtained by 400 iterations of adjointSimpleCheckpointingFoam 
to the results obtained by a tangent and FD implementation. For the adjoint results, the symbol- 
ical differentiation of the linear solvers is disabled, to ensure comparability to the tangents with 
up to machine precision. ‘The tangent implementation uses the vector mode of dco/c++, allowing 
to evaluate all 16 reference points with one solver run. A tangent vector size of 16 was utilized, 
using the 16th entry to calculate the sum of all sensitivities. 

For FD, a one sided perturbation with h = 10~° was used. The tangent and FD evaluation 
points are offset by some margin, so that they do not overlap in the plot. The adjoint sensitivities 
are plotted without interpolating between the cells (to allow us to precisely match the tangent 
and FD data points), giving a piecewise constant sensitivity evolution along the lines Ly and Lg. 

The lines intersect the inflow and outflow boundary. At the inflow and outflow boundary, we 
observe a discontinuous behavior in the adjoint field. This is matched by the tangents and FD. 
It is thus not an error in the differentiation, but rather an issue of the primal implementation. 
The artifacts presumably arise from the implementation of the boundary conditions, which is 
continuous for the primal but not necessarily for the adjoint. The artifacts are limited to the 
cells adjacent to the inflow, outflow, and their neighboring cells. Further up- and downstream no 
obvious influence of those discontinuities can be observed. The wall boundary conditions do not 
exhibit such behavior. 

In ‘Table 4.1 we summarize the adjoints for five exemplary points. The tangents match the 
adjoints up to some orders of magnitude of the machine precision. Slight deviations are induced 
due to the different order of application of arithmetic operations between adjoint and tangent mode, 


and the use of the -ffast-math compiler flag which can lead to additional truncation errors!. 





Table 4.1: Adjoints, tangents, and FD for five exemplary cells. Differences between adjoints and 
tangents are marked bold. 


Cell index Adjoint Tangent FD Error of FD 


970 13351.8234523369  13351.8234523345  13351.6289 0.0015 % 
1013 14929.8557699385  14929.8557699349  14929.9356 0.0005 % 
2785 8605.97203473893  8605.97203473516  8605.8119 0.0019 % 
2817 603.880863106184 603.880863106741 603.6694 0.0350 % 
A099 -81.383599564168 = -81.383599564306 = -81.4823 0.1213 % 


“https: //gcc.gnu.org /wiki/FloatingPointMath 
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Figure 4.6: Sensitivities on two lines through the domain, computed with adjoint mode without 
interpolation between cells. The results are verified with 15 points computed by T1V and FD 
mode respectively. 
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Figure 4.7: Sensitivity field of angled duct case. Evaluation lines L; and Lz are shown in white. 
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Figure 4.8: Sum of sensitivities computed with FD, tangent mode, and adjoint mode. FD with 
h = 107° unreliable, all other curves match. 


Figure 4.8 shows the evolution of the sum of sensitivities 





. dJ (x!) 
> = 
2a 
over 150 iterations. This sum can be conveniently calculated by FD and tangent mode with a 
run-time factor of O(1) - cost(J(a@)), by seeding all inputs at the same time: 


So ge ye = St, 27. 
da 


——ge; == e; —_— —_ 
~ da da — da 
i} i} 


In adjoint mode, all sensitivities of a specific iteration step, and thus also the sum, are calculated 
in O(1) - cost(Z(a@)) as well. However, to obtain the sum after every iteration step, the tape has 
to be completely evaluated from the current position back to the inputs after each iteration step. 
This raises the complexity to the number of iterations. The sensitivities after each iteration are 
only needed for this verification task, in practice one would only evaluate the tape once, after the 
primal iteration has finished. 

The tangent and adjoint sums match up to machine precision for each iteration step. FD 
for h = 10~° also matches very closely. FD for h = 10~° exhibits significant noise, however the 
sensitivities obtained (or at least the sum) remain stable and oscillate around the correct value. 


4.3.3 Validation Against Continuous Adjoint Solver 


Next, we validate against the continuous adjoint solver included in OpenFOAM. This solver 
is based on the principles described in [Oth08], the implementation is further documented 
in |OVWO7|. It implements topology optimization for ducted flows and optimizes for power 
loss using a steepest descent approach with fixed step width. For an introduction to the primal 
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Figure 4.9: Sensitivity fields obtained with discrete piggyback solver. Power loss integrated over 
the inlet on the left, power loss integrated over inlet and outlet on the right. Contours of zero 
sensitivity in white. While the field differs at the outflow, regions of negative sensitivity are 
almost identical. 





and continuous adjoint equations, refer to Section 2.5. The cost function is hard coded into the 
boundary conditions adjointOutletVelocity and adjointOutletPressure. The primal and 
adjoint equations are iterated concurrently, the resulting sensitivities are immediately used to 
update the design field @ (this is the continuous equivalent to the discrete piggyback approach). 
The solver calculates the adjoint velocity and pressure U and p, the sensitivities of the individual 
design parameters a; are then calculated as the scalar product of primal and adjoint velocity 
field: 





dJ 
da; 





= A;(U;-U;). 


For the first part of the validation study, we set the step width of the steepest descent optimizer 
to zero. Thus, the sensitivity field for a fixed (zero) field @ is calculated. 

The discrete solver is set up to match the cost function and boundary conditions of the stock 
adjoint solver. ‘This requires to constrain the computation of the power loss to the inlet. If the 
outlet is included, the sensitivity field changes substantially, while still indicating the same regions 
of optimization. ‘This effect is shown in Figure 4.9. 

The sensitivities are again evaluated along the lines Ly and Lo. Figure 4.10 shows the 
sensitivities for the continuous and discrete solvers after convergence (2000 iterations). One can 
see, that the sensitivities line up very well. ‘The irregularities introduced by the discrete solver at 
the inflow and outflow boundaries do not introduce any effects in the inner flow domain, which 
would lead to obvious differences to the continuous solution. 

Figure 4.11 shows a condensed y-axis of the previous plot, focusing on the most important 
region of negative sensitivity. While we see some differences in magnitude between the continuous 
and discrete adjoint solution here, the sign of the sensitivities is consistent between the adjoint 
and discrete calculations. A topology optimization approach would thus penalize the same cells, 
albeit with a slightly different magnitude. For a graphical representation of these regions, we 
show the sensitivity field for the continuous and discrete solver in Figure 4.12. White iso lines 
indicate the zero crossings of the sensitivity, bounding regions of negative sensitivity. The results 
are qualitatively identical to the results obtained with the discrete solver, albeit showing a slightly 
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Figure 4.10: Sensitivities along L; and Lo obtained using the continuous and discrete adjoint 
approach. 
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Figure 4.11: Magnification of the negative section of Figure 4.10. 


147 


4 Implementing Efficient Algorithms for Steady Flows 





seMns 
1.7 10e4+04 





Figure 4.12: Sensitivity field obtained with the continuous (left) and discrete approach (right). 
Iso-lines of zero sensitivity in white mark the regions of interest for topology optimization. 


larger zone of negative sensitivity near the upper right corner of the flow domain. 

In Figures 4.13 and 4.14 we compare the convergence speed of the continuous primal and 
adjoint equations to the convergence of the piggyback approach. As stated earlier, the piggyback 
approach iterates the adjoint along the primals in a similar fashion to the implicitly coupled 
continuous adjoint, making this comparison an obvious choice. The former figure shows the 
convergence of the sum of sensitivities for a mesh of 11 700 cells, the latter for a finer mesh of 
187 200 cells. While the continuous and discrete solver converge roughly along the same trajectory, 
the discrete solver does so with significantly less oscillations, thus arriving earlier at a point where 
the adjoints can be considered reliable. This potentially leads to a better approximation of the 
gradient in a one shot optimization setting. Lowering of the under-relaxation factors of the primal 
and adjoint continuous equations might lead to a reduction of the oscillations, at the expense of 
slower convergence. 
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Figure 4.13: Convergence of the sums of sensitivities of piggybacking and continuous adjoint 
for medium resolution case. 
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Figure 4.14: Convergence of the sums of sensitivities of piggybacking and continuous adjoint 
for fine resolution case. 
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Next, we consider the Pitz-Daily case. For this example, we can not expect a perfect match 
with the continuous results, as the continuous solver available in the public domain uses a 
frozen turbulence assumption, which assumes that the influence of the turbulence onto the 
sensitivities is negligible and can therefore be omitted. The derivation and implementation of 
continuous equations for turbulence models is complex and sometimes not even possible in a closed 
form |Car+10; Nem+11]. The frozen turbulence assumption can be approximately replicated 
with the discrete method, by pausing the tape recording during the calculation of turbulence. 
The turbulent quantities are thus treated as if they were passive. 

The primal flow and the turbulent kinetic energy within the Pitz-Daily geometry is shown in 
Figure 4.15. The sensitivity results of the simulation after 8000 steps (from a zero initialized 
solution) are shown in Figure 4.16. 

The sensitivities computed with the continuous adjoint solver are shown on top. Below that are 
the results obtained with the discrete method. The second image is computed by the piggyback 
approach with differentiated k-e turbulence model. The third image is computed with the same 
solver, but with the taping of the turbulence model switched off. In this configuration the discrete 
model thus replicates the frozen turbulence assumption of the continuous solver. The three images 





look essentially alike, especially the region of negative sensitivity indicated by the white contour 
lines is very similar. The biggest conceivable difference is located at the location of the step. At 
this position a solution singularity in the sensitivity can be observed. The maximum sensitivity, 
obtained at the singularity point by the continuous solver, is roughly 30% higher than the discrete 
sensitivity obtained with both frozen and fully differentiated turbulence. Further downstream the 
sensitivities match very well. 

For further insight, we again plot the sensitivities over lines through the computational domain. 
We observe five vertical lines, cutting through the domain at different distances from the inlet. 
The first line L; is located 0.001 m right of the step, capturing the influence of the singularity 
point. The second to fifth line (Lz to Ls) are located at x = {0.05 m, 0.1m, 0.15 m, 0.2m} behind 
the step respectively. For the first line, both discrete solutions are very similar, indicating that 
the effect of the turbulence on the sensitivities is low. However, the discrete adjoints do not 
match the continuous solution too well. Nevertheless, the sign of the sensitivities is consistent 
between the continuous and discrete results. Away from the singularity (which is located at 0.5 
along the normalized line distance) the results match better. For lines Lz — L5, downstream of 
the singularity, the frozen turbulence and fully differentiated discrete results start to significantly 
differ. From Figure 4.15 we can see that this coincides with the regions of high turbulent kinetic 
energy k. ‘The result obtained by the discrete frozen turbulence assumption is in turn now very 





similar to the continuous result, showing a maximum difference of 7% and an average difference 
of below 2% along the line. Furthermore, the (interpolated) lines of the discrete solution match 
all kinks and features of the continuous lines. 
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Figure 4.15: Velocity magnitude (absolute length of the velocity vector, top) and turbulent 


kinetic energy (bottom) of the Pitz-Daily case. 
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Figure 4.16: Sensitivity fields obtained with the continuous (top), discrete (middle), and discrete 
approach with frozen turbulence (bottom). Iso-lines of zero sensitivity in white mark the 


regions of interest for potential topology optimization. 
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Figure 4.17: Sensitivities along (from top to bottom) lines Ly to Ls. The continuous adjoints 
match the discrete adjoints obtained with frozen turbulence. 
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Figure 4.18: Sensitivities along line L5, obtained by the discrete adjoint model and matched by 
FD. The continuous model with frozen turbulence (drawn dashed) clearly differs. 


As we could not check the correctness of the fully differentiated turbulence models with the 
continuous adjoint model, we again compare those values against FD and the tangent model. In 
Figure 4.18 we exemplary verify the fully differentiated turbulence model with FD for 13 points 
on Ls. The values obtained by FD match the fully discrete adjoint and clearly differ from the 
frozen turbulence model. 


4.3.4 Convergence of FD 


As derived in Section 2.7.1, FD approximates the derivative up to a truncation error of order O(h) 
for one sided differences and O(h”) for two-sided differences. Thus, the difference between FD 
and the derivatives calculated by AD should shrink according to the relations 





JT(a+he;)—TJ(a) dJ/(a) 








h > 
IT (a+ Ns — hej) _ a = 19 = O(R?), 


for some index 7. We will now investigate if these relations hold for our implementation. 

Figure 4.19 shows the difference between the FD approximation and the derivative calculated 
by tangent mode for the cell at location P = (0.02, —0.02,0.0005) of the Pitz-Daily case. Plotted 
are the absolute values of the difference rj and rg of FD and tangent, scaled by the tangent, for a 
range of values h. 

The step width of h is chosen in the range [10~°, 10°]. For relatively big values of h, the forward 
and central finite differences adhere strictly to the reduction factors O(h) and O(h?). For h in 
the range [10~7, 10"], the two sided differences, while still providing a better approximation than 
one sided, behave a bit erratically. We suspect that this is due to the different treatment of 
the a term for positive and negative signs. While positive values of a contribute to the system 
matrix and increase the diagonal dominance of the matrix, negative values are put onto the right 
hand side of the momentum equations. For h < 10~°, the quality of the approximation begins to 
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Figure 4.19: Difference between FD and (tangent) AD for decreasing h. 


deteriorate. For even lower values, the FD values become noisy and eventually unusable. This 
highlights the difficulties involved with FD of finding a suitable step size h. For stiff problems, 
the values of h should be scaled by the central coeficient ag of the FVM discretization, that is 
the corresponding diagonal entry of the discretization matrix. 


4.4 Shape Adjoints 


Shape optimization, in contrast to topology optimization, aims to improve the geometry of the 
domain directly, by manipulating the position of the nodes on the outer shell. ‘This has the 
advantage, that the physical wall boundary conditions are retained, allowing the wall stresses 
to be evaluated more accurately. Other boundary conditions like thermodynamic heat flows 
can be implemented much easier (or no change is required at all), if a physical wall is present. 
Usually, the points on the surface are moved in the direction of the normals of the corresponding 
faces. When the points on the outer surface move, the interior points have to be adapted as well, 
such that the mesh quality does not degrade, leading to unreliable results or lack of convergence. 
Two of the simplest approaches to mesh movement are to solve a system of spring /dampener 
equations or to apply a Laplacian smoothing. The downside of shape optimization is that the 
mesh movement limits the range of optimization, as the design will retain some resemblance to 
the original design, making it likely that a local minimum, instead of the global minimum, is 
hit. It is also challenging to preserve design features during the mesh morphing process, as the 
morphing tends to smooth out sharp features. 

To facilitate the calculation of shape sensitivities in OpenFOAM, we register the positions 
of the individual points in the mesh as soon as they are created. ‘The mesh is set up in 
meshes/polyMesh/polyMesh.C, and the points are created in the constructors 








Foam::polyMesh::polyMesh(const IOobject& io){L...]} 
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and 


Foam::polyMesh::polyMesh 

( 
const I0object& io, 
const Xfer<pointField>& points, 
const Xfer<faceList>& faces, 
const Xfer<labelList>& owner, 
const Xfer<labelList>& neighbour , 
const bool syncPar 


DOr Any, 


respectively. 
After the mesh and boundaries have been created the points on the boundary can be registered 
in the tape, making them parameters for the following calculations. 


forAll(boundary_,patchI)f{ 
forAll(boundary_[patchI],faceJ){ // loop over all boundary faces 
const labelList 11 = boundary_[patchI][faceJ]; 
forAl1(11,j){ // register points of boundary faces 
ADmode:: global_tape->register_variable( points_[1l1l[j]J][0] ); 
ADmode:: global_tape->register_variable( points_[1l1l[jlJ]Lli] ); 
ADmode::global_tape->register_variable( points_[1l1l[j]J][2] ); 
Jr 
i 
i; 


One could also register all points of the whole mesh. However, as we are only interested to move 
the points on the surface, and the inner points are adapted by a mesh smoothing technique, 
we only register the points on the surface to save some tape memory. The set of parameters is 
thusy = Q € R°*”er. 

After the adjoint propagation is completed, the adjoints of the individual points of the mesh can 
be extracted. However, one is usually not interested in the raw adjoints but wants to constrain 
the movement of points to the normal direction of the associated faces. ‘Those adjoints can not be 
directly read from the tape, but can be calculated in a post-processing step from the adjoints of 
the individual points q. For each boundary face, the adjoints of the points contained in the patch 
are interpolated to the face interior by averaging the vectors, giving the face centered adjoint 
sensitivity vector qr. Once the face center vector is calculated, it can be constrained to the face 
normal direction by taking the scalar product of qr with the face normal n. We define this scalar 
product as the shape sensitivity G € RT: 








Or. ‘NFR, 
(= oe O<i<nr, 
Ap. 
where Af is the area of the boundary face, required to obtain a mesh independent sensitivity. 
For better compatibility with post processing tools, the resulting face defined quantity is copied 


to the face adjacent cell. The interpolation procedure is outlined in Listing 4.1. 


4.4.1 Checkpointing of Shape Adjoints 


Conceptually, the application of checkpointing remains unchanged from the case of topology 
optimization. Compared to topology optimization, the pre-processor stage is much more complex. 
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// loop over all boundary patches 
forAll(mesh.boundary(),bi)f{ 
// loop over all faces in boundary bi 


forAll(mesh.boundary()[bi],i)¢f 
// Vast ot point indices, ico, face aon ba 
const labelList face_points = mesh. boundary () [bi].patch() [il]; 
// cell in domain corresponding to boundary face 
const label face_cell = mesh.boundary() [bi].faceCells() [il]; 
const Foam::vector face_normal = mesh.boundary() [bi] .nf() (0) Lil; 
Foam::vector sensVec(0,0,0); 
forAll(face_points ,fp){ 
const point& pt = mesh.points()[face_points[fp]J]; 
sensVec[0] += dco::derivative(pt[0]); 
sensVec[1] += dco::derivative(pt[1]); 
sensVec[2] += dco::derivative (pt[2]); 
Ir 
sensVec /= face_points.size(); 
// scalar product of sensitivity vector with face normal 
sens[face_cell] = (sensVec & face_normal) / mesh.boundary() [bi].magSf() [il]; 
Ir 
} 


Listing 4.1: Interpolation of sensitivity vectors, defined at points, to the corresponding faces. 
The shape sensitivity is computed by computing the scalar product of the resulting averaged 
sensitivity vector with the face normal vector. 


In this stage the parameters, that is the location of the individual points of the mesh (contained 
in the primitive mesh), are used at various locations in the code to construct the CFD mesh 
representation. This mesh construction phase is only executed once and can not be restored from 
a checkpoint easily, therefore it is immediately included in the tape. Following the pre-processing 
phase, the tape is switched off and the usual checkpointed iteration phase begins. After all 
iteration steps have been adjoined, the remaining tape of the pre-processor is adjoined, yielding 
the adjoints of the parameters. 

A naive implementation yields results, which are not consistent with black-box adjoints, 
indicating that some dependencies are missed. ‘hose missing dependencies have been identified as 
the non-orthogonal correction vectors (compare Section 2.1.4) by manually comparing the tapes 
of black-box and checkpointed adjoint. ‘The reason the dependencies are missed is the presence of 
on demand functions in OpenFOAM. Several data fields in the mesh object are stored in dynamic 
memory, and are only constructed once they are first requested by their access routine. 

The following access functions in the fvMesh class create their fields on demand: 

















e C(): Constructs the cell center vector, 





e Cf(): Constructs the face center vector, 


e V(): Constructs the cell volume vector, 





e Sf(): Constructs the face area vectors, 





e magSf(): Constructs the magnitude of face area vectors, 


e deltaCoeffs(): Constructs delta coefficients, 
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void init_mesh(Foam::fvMesh& mesh) { 
mesh.Sf(); mesh. magSf () ; 
mesh.C(); mesh.Cf(); 
mesh.V(); mesh.deltaCoeffs(); 
mesh.nonOrthDeltaCoeffs(); mesh.nonOrthCorrectionVectors() ; 


t 


int main(int argc, char *argv[]) 


{ 
#include "setRootCase.H" 
#include "createTime.H" 


dco::ais::global_tape = dco::ais::tape_t::create(); 
#include "createMesh.H" 
#include "createFields.H" 


init_mesh(mesh) ; 
dco::ais::global_tape->switch_to_passive() ; 
[...] // checkpointed SIMPLE algorithm 

Ir 


Listing 4.2: Forcing the early on demand construction of the fvMesh fields by calling their access 
routines. 


e nonOrthDeltaCoeffs(): Constructs the non orthogonal delta coefficients, 


e nonOrthCorrectionVectors(): Constructs the non orthogonal correction vectors. 


Most of these functions are first accessed during the pre-processor phase, and thus the construction 
of the fields is captured by the tape. However, the non-orthogonal correction vectors are first 
constructed when discretizing the gradient operator in the momentum equations, using the 
corrected surface-normal gradient scheme. ‘The first occurrence of this discretization is in the first 
SIMPLE iteration, at which point the tape has already been switched off, to calculate the passive 
iterations needed for the checkpointing iteration. When the nonOrthCorrectionVectors() access 
function is subsequently called while the tape is active, only a reference to the field created earlier 
is returned. ‘Therefore the dependence of the correction vectors on the parameters is lost. 





To fix this problem, we explicitly call all on demand generator functions of the fvMesh instance, 
after the pre-processing is finished but before the tape is switched off. This might be redundant 
for some functions, if the field has already been initialized. However, as in that case only a 
reference is returned, which is subsequently ignored, the run time and memory cost of those 
additional calls is negligible. The actual constructors generating the data are private to the 
fvMesh class, and would require modifications inside the OpenFOAM code base in order to be 
accessible from our solvers. ‘Therefore we simply trigger dummy calls to the accessor routines, 
which have the side effect of creating the required data fields. ‘The changes required in order to 
obtain a consistent checkpointed shape adjoint are presented in Listing 4.2. 














The same fixes apply when using the checkpointing interface to implement reverse accumulation 
or piggybacking. Figure 4.20 shows sensitivity results over iteration count for a single point on 
an airfoil surface (compare to Section 4.4). Sensitivities are obtained by tangent mode, adjoint 
mode with checkpointing (due to the cost involved only evaluated every 20 iteration steps) and 
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Figure 4.20: Iteration history of shape sensitivity obtained by tangent mode, adjoint mode, and 
static piggybacking. 


static piggybacking (no design update). All converge to the same solution, with tangent and 
checkpointed results without SDLS being identical within machine precision. The adjoint solution 
with SDLS exhibits a maximum difference from the tangent solution of 0.6% and the final value 
matches up to 0.002%. 


4.4.2 Verification of Shape Adjoints 
Numerical Verification using Higher Order AD 


Using the second order differentiation model, the AD implementation of the adjoints can be 
verified against tangents inside the same solver. As a by-product to the second order derivatives 
generated by the tangent over adjoint model, the model includes both the first order tangent and 
adjoint models. 


x1) = VSP ya) 
y2) = Vf x® 


Ignoring the second order derivative components, and using the correct seeding, the tangents and 
adjoints can be calculated within the same solver. ‘To get representative adjoints, the full SIMPLE 
iteration history has to be adjoined back to the inputs after each iteration step. Using the scalar 
tangent mode, only one tangent (i.e. the x,y or z component of a single point) can be verified 
at a given time. Only a subset of points can thus be verified in a reasonable amount of time. 
We applied this approach and did not observe any discrepancies beyond the usual floating point 
precision differences. After the correctness of the AD model implementation has been checked, 
we focus on showing that the results are also consistent to results obtained by the continuous 
adjoint method. 
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Figure 4.21: Mesh of a cylinder in laminar flow. Inlet on the left, no-slip wall on the cylinder 
surface, zero pressure outlet on the right, top, and bottom boundaries. ‘The mesh used for the 
calculation has double the resolution. 


Verification Against Continuous Adjoints 


To verify the shape adjoints, laminar steady flows around a cylinder in a channel were studied. 
This is a well studied problem, which exhibits different kinds of behaviors for different flow 
conditions. For Reynolds numbers of one and lower, the flow closely resembles a potential flow 
and stays attached to the cylinder. For Reynolds numbers of around 10, a recirculation area 
begins to form behind the cylinder, however the flow remains steady. For Reynolds numbers of 100 
and above, the flow becomes transient. ‘The cylinder begins to shed vortices with a characteristic 
frequency (characterized by the dimensionless Strouhal number St), due to the oscillating pressure 
field in the wake of the cylinder. ‘This flow phenomenon is commonly known as the von Karman 
vortex street |Von54|. While still laminar at first, for growing Reynolds numbers the transient 
flow becomes increasingly turbulent. 

To obtain steady solutions, which exhibit different flow characteristics, two cases with Reynolds 
numbers Re = 2 and Re = 20 were studied. A structured block mesh around a cylinder of unit 
diameter was chosen. The structure of the mesh is presented in Figure 4.21, however the actual 
mesh used for the calculations has double the spatial resolution (and thus four times as many 
cells). The flow enters the domain through an inlet on the left, with a prescribed flat velocity 
profile. It exits the domain mainly through the outlet on the right, however also the lower and 
upper boundaries are configured as outlets. The domain is possibly not big enough to eliminate 
all influence of the outflow boundaries onto the flow near the cylinder. This would be required 
to reliably determine the frequency of vortex shedding. However, as we are interested in steady 
cases and want to compare the sensitivities obtained by different approaches of differentiation, a 
minor influence of the boundary conditions on the flow is deemed irrelevant. 





The laminar flow around the cylinder for both cases is illustrated in Figure 4.22. ‘The former case 
exhibits flow around the cylinder with the streamlines of the flow near the cylinder following the 
cylinder surface tangentially. The latter flow is still laminar and steady, however the streamlines 
detach somewhere after the point of maximum cylinder width and two recirculation areas of 
opposing rotation form in the wake of the cylinder. Further downstream the streamlines converge 
together again. 
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Figure 4.22: Laminar flow around cylinder for Re = 2 (left) and Re = 20 (right). For the lower 
viscosity, a recalculation area begins to form in the wake of the cylinder. 


Sensitivities are calculated with respect to the power loss between the inlet and outlet, as 
introduced in Section 3.1.3. Similar results can be obtained by choosing the drag on the 
surface of the cylinder as cost function. ‘The discrete results are obtained by applying the 
piggyback method to the simulation, without performing design updates, for 500 iterations. 
Continuous adjoint results are obtained by running a modified version of the stock OpenFOAM 
solver adjointShapeOptimizationFoam, expanded to calculate the wall sensitivities according to 
Equation (2.11) after each iteration step. 

The results are presented in Figures 4.23 and 4.24. The former shows the sensitivities in a 
Cartesian plot, the latter in a polar plot. For both plots the abscissa, ranging from —7z to 7, is 
the position along the surface of the cylinder, starting from the stagnation point at the outmost 
left position of the cylinder (due to the unit radius it is also the angle between the x-axis and the 
position on the cylinder surface in rad). The ordinate gives the sensitivity of the cost function to 
translation of the cylinder surface points in surface normal direction. A negative value indicates a 
movement in negative normal direction, squishing the cylinder together and reducing the volume 
of the cylinder. A positive value expands the cylinder in direction of the normal and thus increases 








the volume. 
The results of the discrete adjoint Gp have been scaled by a uniform factor A to best match Go: 


min Bo — MBplly- 


Due to the linearity of the adjoint momentum equation, the result of the continuous adjoint 
calculation is linearly dependent on the adjoint inlet velocity, which can be arbitrarily chosen. 
Therefore a linear factor between the discrete and continuous solution does not indicate a problem, 
and would be eliminated by the step size control of an optimization scheme. 

The results show a very good match between the adjoints produced by the discrete adjoint and 
the continuous adjoint, especially considering that they are produced in a considerably different 
way. While the match between the discrete and continuous adjoint is best judged from the 
Cartesian plot, the influence on the shape can be better recognized in the polar plot. For the low 
Reynolds number case, the results indicate that an optimization would steer toward an ellipsoidal 
shape, in order to lower the surface area of the obstacle presented to the flow. If we assume the 
origin to be the center of the cylinder, the magnitude of the mesh movements is both symmetric 











around the x-axis and y-axis. This matches the symmetry of the geometry, mesh, and flow field. 
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— Discrete Re = 2 
— Continuous Re = 2 
— Discrete Re = 20 
— Continuous Re = 20 


Adjoint shape sensitivity 6 
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Figure 4.23: Sensitivities over location (in radians) on the cylinder surface in Cartesian coordinate 
system. Origin is the stagnation point on the front of the cylinder. 
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Figure 4.24: Sensitivities over location (in radians) on the cylinder surface in polar coordinate 


system. 
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Figure 4.25: Shape optimization with fixed volume constraint of the unit cylinder. 


For the higher Reynolds number case, the flow field is not symmetric around the y-axis anymore 
and neither are the sensitivities. The sensitivities indicate a shape more formed like a hourglass 
(note, that the linearized gradient only guarantees an improvement of the cost function for an 
infinitely small perturbation, an actual optimized geometry might look different), likely to prevent 
the separation of the flow from the cylinder and to combat the formation of the recirculation area. 
For this solution, there are regions of positive sensitivity, indicating regions where a redirection of 
the flow is more important than to minimize the surface area of the obstacle. 








4.4.3 Shape Optimization 


After having established the consistency of the discrete shape adjoints, we will now apply the 
gradients obtained to optimize the cylinder geometry. A mesh morphing strategy, developed 
in [Mol18], using adaptations of existing OpenFOAM mesh morphers, will be used. The sensitivi- 
ties are supplied to the morpher to determine the amount of movement of the surface nodes. ‘The 
remaining nodes are moved using a Laplacian smoothing technique [Sor+04], distributing the 
surface movement into the domain, decreasing the movement with increasing boundary distance. 
The connectivity of the mesh remains unchanged during the optimization. ‘The sensitivity results 
obtained earlier suggest, consistent to intuition, to reduce the volume of the cylinder in order to 
obtain a lower drag on the cylinder body. Obviously the optimal solution would be to have no 
obstacle to the flow at all. In order to obtain a more meaningful optimization target, we will 
constrain the volume of the (unit) cylinder to its initial volume 7th. To enforce the constraint, we 
choose a simple penalization approach with 





JI = In (x,q) + A(Vo— Vi), 


where Vo is the sum of all cell volumes in the initial configuration, Vj; the sum of cell volumes in 
the deformed state and \ a suitable scalar penalization parameter. Cell volumes are positive by 
definition, no absolute value is thus needed in the summation. Due to the fixed boundaries, the 
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Figure 4.26: Normalized drag and penalized drag on the cylinder surface over 200 iteration 
steps. 


change of sum of cell volumes is directly proportional (with inverse sign) to the change in cylinder 
volume. We start with a rather low A and repeatedly increase it during the optimization process 
to stronger enforce the constraint. ‘The application of the mesh morpher is consistent to a steepest 
descent method, as it moves the surface points by the gradient, scaled by a constant factor. 


The optimization loop repeatedly calls the (piggyback) solver to obtain a gradient and the 
mesh morpher to translate the sensitivities into an updated mesh. Using the established discrete 
adjoint framework, it is feasible to combine both steps into one application, potentially executing 
a mesh update (with small movements to ensure convergence stability) after each piggybacking 
step. A similar combination of different utilities has been carried out to combine the blockMesh 
and simpleFoam utilities, yielding a solver which allows to directly optimize for parameters of 
the blockMesh mesh description. This approach is detailed in Section 4.6. 


The movement of the cylinder surface from the baseline to an optimized configuration is shown 
in Figure 4.25. After 100 iterations of the shape optimization procedure, the drag is reduced 
by 5.8%, the volume constraint is violated by 0.85%. After 200 optimizer iterations, the drag is 
reduced by 6.6%, while the volume of the cylinder is much nearer to the target volume (0.12% 
violation of the volume constraint). 

The gradual reduction in drag is shown in Figure 4.26. Due to the penalization, the constraint 
violation never exceeds 3%. As the gradient 0J/0G nears zero, and the penalization parameter A 
is increased, the gap between J and Jp closes. 





One would suspect, that a globally optimal solution would narrow the cylinder even more. 
However, with a fixed mesh topology, for which only the point positions are morphed, the mesh 
quality will degrade for big displacements, leading to distorted meshes and eventually divergence 
of the solvers. Note, how the center of gravity of the deformed cylinder is not fixed and moves 
slightly in flow direction. If the cylinder is supposed to stay fixed, the squared movement of the 
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Figure 4.27: Baseline mesh and morphed meshes. Mesh after 200 iterations is barely regular 
with highly distorted cells. The geometry should be remeshed at this point. 





center of gravity could be introduced as an additional penalization term. 

The original and morphed meshes are depicted in Figure 4.27. ‘The mesh morpher keeps the 
mesh quality acceptable, such that e.g. no negative volumes occur, however after 200 iterations 
the quality has significantly degraded and the cylinder should be remeshed. 





4.5 Discrete Adjoint Residual Approach 


In this Section we will show the steps required to efficiently implement the discrete adjoint residual 
formulation, introduced in Section 2.10, in OpenFOAM. The big dimension but sparse nature of 
the involved Jacobians motivates using coloring techniques. 

First, we efficiently determine the non-zero pattern of the residual Jacobian. Second, we color 
the resulting Jacobian, by applying information obtained either directly from the mesh or from an 
intermediate graph representation. Third, we compute the Jacobian entries using AD or FD and 
use the resulting linear system to compute the desired sensitivities. Last, we present applications 
of this methodology to our reference cases. 








4.5.1 Calculation of Residuals in OpenFOAM 


In the following derivations, we focus on the case of steady laminar flows, and the parameter set 
required for topology optimization. ‘The flow is thus characterized by the velocity and pressure 
fields. ‘To incorporate turbulence or other physical quantities, the states, and therefore also 
the sparse Jacobian of the residuals, can be expanded. ‘The face flux field @ depends on the 
velocities, but can not readily expressed by it with an explicit formula, because it is iteratively 
corrected (see Section 2.3). Therefore, to obtain accurate adjoints, the face flux is introduced as 
an independent variable to the residual Jacobian [RU13]. The state x is assembled from both 
cell centered quantities (u,p) and face centered quantities (face flux ¢), yielding the resulting 
state vector x = (U,p, @) € R*"¢t”F. Consequently the residual R = (Ry, R,, Ry) € R*"Ct"? 
is also split between cell centered (Ry, R,) and face centered (Rg) entries. 
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The residual vector Ry of the momentum equation can be calculated, either using the built in 
residual function of OpenFOAM, 


fvVectorMatrix UEqn( 
fvm::div(phi, U) 
+ turbulence ->divDevReff (U) 
+ fym:  Spicalphia.. U) 
- fvc::grad(p) 
ye 
volVectorField URes = UEqn.residual (); 


or by explicitly calculating the residual of the linear equation system: 


volVectorField URes = (UEqn & U) + fvc::grad(p); 





Similarly the residual of the mass conservation equation R, can be calculated, either using the 
fvMatrix residual function, 


fvScalarMatrix pEqn( 

fvm::laplacian(rAtU();, p) —-— fve:: div (philyAéA) 
ye 
volScalarField pRes = pEqn.residual () ; 


or explicitly as: 
volScalarField pRes = (fvm::laplacian(rAU, p) & p) - fvc::div(phiHbyA) ; 


The update of the face flux is not calculated by solving a linear equation, but by an explicit 
update formula. For a converged case, the difference between two subsequent iterations of @ can 
be interpreted as the residual Rg: 





Rs = g' _ g'! 
With the update formula of the face flux in OpenFOAM, 
volScalarField rAU(1.0/UEqn.A()); 
volVectorField HbyA(constrainHbyA (rAU*UEqn.H(), U, p)); 


surfaceScalarField phiHbyA("phiHbyA", fvc::flux(HbyA)); 
phi = phiHbyA - fvc::flux(rAU*xfvc::grad(p)); 


the calculation of the residual can be implemented in OpenFOAM as: 


surfaceScalarField phiRes = (phiHbyA - fvc::flux(rAU*fvc::grad(p))) - phi; 


4.5.2 Prediction of Jacobian Sparsity Pattern 


In order to efficiently compress the Jacobian of the residuals, as described in Section 2.11, the 
sparsity pattern, that is the individual positions of the non-zero entries, of the Jacobian has to 
be known. A naive way to determine the sparsity pattern is to calculate the Jacobian entries 
densely, and then check which entries are non-zero. ‘The resulting sparsity pattern can then be 
used to recompute the Jacobian more efficiently. This obviously is not very efficient and requires 
that the Jacobian sparsity pattern is reused multiple times to yield any improvement. 
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For general purpose computer programs, the sparsity pattern can be obtained more efficiently by, 
instead of calculating the actual entries of the Jacobian, only determining the boolean dependence 
of the outputs on the inputs. For a function y = f(x), with x € R”,y © R”, an output y; 
depends on an input x; if the partial derivative Oy;/Ox; = Ji; #0, and thus this dependence 
implies a non-zero in the Jacobian. The determination of boolean dependence can be implemented 
as the (forward or reverse) propagation of dependency sets. This process is outlined below. 

Let f : R” — R” be implemented by elemental functions y with intermediate variables v. We 
assign to each variable a set D,, C N. For the 7-th input, we initialize D,, = {i}. For a unary 
function vg, = yz(v;), the dependency is propagated unchanged from vu; to vz: 














D D.. 


UE = Ont) — 





For binary functions vy = ~(vi,v;), the dependency set is determined by the union of the 
dependency sets of its inputs: 


D 7 oon ore 


Uk=—Pk (v; 50; ) 





Instead of implementing the dependency directly as sets, in dco/c++ a similar approach is chosen 
where the sets are modeled as bitsets of fixed length. This increases the memory footprint of the 
program and may necessitate to split the dependency calculation into multiple parts (similar to a 








driver for tangent vector mode), but reduces the run time for the individual union operations on 
the dependencies from O(log(n)) to O(1), due to the direct memory lookup of the bitset. 
Second order dependencies (needed for the assembly of Hessians) can be obtained by similar, 
but computationally more expensive methods [Var11]. 
For OpenFOAM meshes, the determination of the full Jacobian sparsity pattern, using the 
dco/c++ pattern data type, consumes a significant amount of time. However, if the dimensions 








of the finite volume stencils are known, the sparsity pattern can be exactly constructed from 
the mesh connectivity information. A more efficient way to determine the full sparsity pattern 
is thus to use the pattern type, to obtain the stencil size of an arbitrary cell inside the domain. 
The full sparsity pattern can then be constructed from this reference stencil and the mesh 
connectivity information. 

The stencil sizes required for the calculation of the desired Jacobian are shown in Table 4.2. As 
can be seen, the stencil is rather compact, with the biggest stencils required for the calculation 
of R, and Rg. The definitions for cell and face centered stencils are illustrated in Figure 4.28. 

The resulting block structure of the Jacobian of the state residuals is shown in Figure 4.29. An 
exemplary sparsity pattern, exhibiting these blocks, for the angled duct case with 325 cells, is 
presented in Figure 4.30. 








Table 4.2: Stencil size for the individual Jacobian residual blocks. Superscript * indicates a cell 
centered cell stencil, + a cell centered face stencil, t a face centered cell stencil, and ¢ a face 
centered face stencil. 








R; 32 iF 2 a 
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(a) Cell centered stencils (*,+) (b) Face centered face stencil (o) (c) Face centered cell stencil (1) 





Figure 4.28: Face stencil around central cell (a), face stencil around central face (b), cell stencil 
around central face (c). 
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Figure 4.29: Sub blocks of the residual Jacobian. 
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Figure 4.30: Jacobian of residual of the reference case with ng = 325, nr = 600, resulting in a 
RPPxX15 sparse matrix with n,, = 50350 non-zero entries. Sub-blocks of the Jacobian are 
indicated with different shades of gray background. 
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4.5.3 Calculation of Discrete Adjoint Residual 


The discrete adjoint residual approach was implemented using the adjoint mode of AD, allowing 
to determine the Jacobian of the residuals. For validation and performance comparison, it was 
also implemented in tangent mode and with FD. Colorings of the Jacobian are obtained by using 
graph coloring algorithms, implemented in ColPack |Geb+13]. When coloring and compressing 
the Jacobian with ColPack (using smallest last heuristic as ordering) we observe slightly fewer 
colors when compressing columns than rows, giving the tangent mode and FD a slight advantage. 
In order to evaluate the matrix vector product (OR/da)* - Ax from Equation (2.19) in tangent 
mode and FD, the matrix (OR/da)* needs to be explicitly calculated. To efficiently calculate 
this matrix, a separate coloring for the parameters @ is performed, yielding significantly less colors 
than needed to compress the full residual Jacobian, due to the stencil size of the parameter being 
limited to one. The calculation of the residual Jacobian is fastest in (one sided) FD mode, followed 
closely by the adjoint mode, which performs well due to the high amount of re-interpretation 
(one seed and tape interpretation for each color but no additional tape recording). Tangent 
mode exhibits a constant run time overhead compared to FD. Utilizing tangent vector mode, 
the performance should be competitive to FD. For simplicity of the driver, this was not pursued 
further. ‘The adjoint mode can be utilized in a vector mode as well, allowing reverse propagation of 
different seeds at the same time. This can potentially be used to further speed up the calculation 
of the Jacobian, at the cost of higher memory usage. 

Preparing for the solution of the linear system (OR/Ox)* - Ax = (O7/Ox)* , the coefficients 
of the Jacobian are stored in an Eigen |GJ+10]| sparse matrix. Storage in a native OpenFOAM 
format would be preferable, particularly to preserve parallelism during the solution, however 
OpenFOAM lacks convenient general purpose linear equation solvers for fields which are not 
connected to a specific geometry. The solution of the linear equation system is calculated with 
either the Eigen SparseLU or Eigen BiCGStab (with incomplete LU preconditioner) solvers. 
In our implementation, the overall run time is heavily dominated by the solution of the linear 
system, making the overhead of the AD tool less significant. The memory requirement to reverse 
the residual evaluation by AD is considerably lower than to tape a full SIMPLE iteration step. 
The memory required for the tape is of the same order of magnitude as the space needed to 
store the full sparse Jacobian matrix. For reference, to reverse one full iteration of the finer 
case, introduced below, about 610 MB of tape space (with SDLS enabled) are needed, while the 
tape size required to capture the calculation of the FVM residual is 408 MB. The total memory 
consumption including the storage and solution of the sparse matrix is 1250 MB. 

Therefore, despite requiring multiple evaluations of the tape to obtain the full Jacobian, the 
adjoint method is competitive with FD, in both run time and memory consumption. 

To illustrate the results of the discrete adjoint residual solver, we once again turn to the angled 
duct example. Because this case is a laminar 2D case, the state vector consists of x = (uz, Uy, Pp, P) 
and consequently the size of the residual Jacobian is (83n¢c + nf) X (8NC + NP). 

We investigate two different mesh resolutions, a coarse mesh with ng = 2925 cells and 
nr = 5700 internal faces and a fine mesh with no = 46800 cells and ng = 93000 internal 
faces. A ColPack bipartite graph representation is build from the sparsity pattern, then partial 
row/column distance two coloring is applied to obtain a suitable Jacobian coloring. Coloring the 
columns of the coarse Jacobian using ColPack yields 90 colors, coloring the rows 102. The fine 
Jacobian yields slightly more colors, namely 95 for coloring the columns and 106 for coloring the 
rows. As the general connectivity of the mesh is not changed by the mesh refinement, the lower 











169 


4 Implementing Efficient Algorithms for Steady Flows 


Ue bagrit cle 
F?ide-03 


J.CCSa407 


“ALOR ARGAA 


L* a Ca 
Lola S 





(a) Adjoint velocity (b) Adjoint pressure 


SEAS SCI 
eee 


a SS 
ATA 


[. 


20 


7 
- 0CQe+3) 


tae 


aro? 


_ 
-Lrseerde 





(c) Sensitivity $2 (d) Sign of sensitivity o( $2) 


Figure 4.31: Solutions for the fine angled duct case obtained by the discrete adjoint residual. 


limit for the number of colors likely does not increase with finer meshes; However, the coloring 
heuristic performs slightly worse for the finer mesh. 


Figure 4.31 shows the sensitivity results of the angled duct case, as obtained by the discrete 
adjoint residual method. ‘The first row of figures shows the adjoint velocity and pressure, which 
can be extracted from the solution Ax, = (U, D, od) and have the same physical meaning as the 
adjoint velocities and pressures defined for the continuous adjoints. The second row shows the 
final sensitivities d7/da, as well as the sign of the sensitivity for easier cross reference. 





The sensitivity results are identical for tangent and adjoint mode (up to machine precision) 
and align with the FD results very well. The results also match the results obtained by both the 
discrete black-box differentiation and the continuous adjoint presented in Section 4.3.3. 


Note that the adjoint velocities u obtained by this method are vector quantities. ‘They can be 
used to evaluate Equation (2.11) to obtain shape derivatives, equivalent to the continuous adjoint 
approach, circumventing the need to differentiate through the generation of the mesh from the 
individual points. 


Table 4.3 lists the run times of the following phases for adjoint, tangent, and FD mode: 
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Table 4.3: Run times of the different stages; Memory consumption for adjoint, tangent, and 
FD solver for the coarse and fine level angular duct. Passive run time is calculated over 200 
iteration steps. 


Coarse Case A1S T1S FD 

Colors 102 90 / 23 90 / 23 
Passive 6.638 4.938 2.268 
Pattern 0.5858 0.60s 0.638 
Diff 0.748 1.168 0.348 
Solve 2.818 3.158 2.488 
Total run time 10.90s 10.448 6.178 


Max memory 176.39MB 155.54MB- 131.35 MB 


Fine Case AlS TIS FD 


Colors 106 95 / 25 95 / 25 
Passive 122.888 100.00 s 40.528 
Pattern 9.948 10.248 11.478 
Diff 14.99s 20.668 5.688 
Solve 147.408 166.428 152.858 
Total run time 303.118 307.008 217.008 


Max memory 1254.34MB 871.55MB_ 805.51 MB 


Passive evaluation: Iteration of the case from initial condition to a converged state. 





Assembly of sparsity pattern: Assembly of the expected sparsity pattern of the Jacobian from 
mesh connectivity. 


Coloring: Conversion of sparsity pattern to ColPack graph format and partial coloring of the 
bipartite graph. 


Differentiation: Calculation of the Jacobians Jx and Ja (Jaq only required for tangent mode and 
FD). 


Solution: Solution of the linear system and calculation of (2.18). 


All stages are performed in one solver. In practice it can be useful to separate the stages, as 
the sparsity pattern and coloring remain constant for a specific mesh and are independent of 
e.g. boundary conditions. ‘They can thus be reused for different configurations of the same case. 
The iteration procedure can be started from a partially or fully converged state (which can be 
created by a fully passive version of OpenFOAM) instead of the initial state, lowering the time 
for passive evaluation. 
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4.5.4 Directly Obtaining Colors from the Mesh Representation 


In order to reduce the computation time of the Jacobian, a coloring approach, as presented in 
Section 2.11, is used to compress rows or columns of the Jacobian. Previously we used the external 
software package ColPack to obtain a suitable coloring. ‘This necessitated the calculation of the 
non-zero pattern, as well as the construction of a graph structure from the Jacobian non-zero 
pattern. We will now introduce a method to directly obtain a feasible coloring from the FVM 
mesh representation. 


Mesh connectivity graph 


To analyze coloring problems, arising from the discretization of CFD problems, one would like 
to utilize already well known results from graph theory. ‘Therefore, it is desirable to introduce 
a graph representation of the mesh connectivity, as some properties defined on graphs can be 
reused on the mesh description. 





Definition 19 (Mesh connectivity graph). 
Let M = (L,U) be a mesh addressing given in LDU format, with L,U € N"¥. We define the 
corresponding mesh connectivity graph Gy = (Vu, Eu), consisting of: 


e One node for each cell in the mesh: V = {v; |i =0,...,ne — 1}; 


e Lach edge in the graph corresponds to an interior face connecting two cells, that 1s a pair 
(172) = (L, U): k= 1 (Chex Cre) | ~=0,...,nF- IN: 


A cell is directly adjacent to another cell if it shares a face in the mesh. This corresponds to an 
edge in the mesh connectivity graph. The distance between two cells (cj, c;) can be conveniently 
defined as the length of the shortest path in the mesh connectivity graph connecting both cells. 





Definition 20. 

Let v; and v; be cells of a finite volume mesh, that is they are vertices in Gy. We define the 
distance d = D(uj,v;) € N between cells vi and v; as the length of the shortest path in Gy 
connecting the nodes corresponding to both cells. 


Figure 4.32 gives an example 3 X 3 mesh with 9 cells and 12 internal faces, its corresponding 
mesh connectivity graph, and its internal LDU representation. 


Application of Mesh Connectivity to CFD problems 


Definition 21 (Cell Stencil). 

In CFD the stencil of discretization is defined by the cell neighborhood around a cell which 
influences the value of the solution at this specific cell in the next iteration step. The size of this 
stencil 1s defined as the maximum distance between the two cells in the cell adjacency graph. 


An example of two non overlapping stencils of size two on a structured mesh is shown in 
Figure 4.33. ‘The spreading of information over consecutive iteration steps, as well as the 
distance 2d neighborhood needed for the later proof, is illustrated in Figure 4.34. 

We will show that a distance 2d coloring on the cell adjacency graph corresponds to a partial 
distance two coloring of the bipartite graph of the Jacobian. ‘Thus it can be used to compress the 
Jacobian matrix of the residuals. The same can easily be shown for face stencils. 
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Figure 4.32: Mesh connectivity graph, derived from ldu Addressing corresponding to 3 x 3 
mesh. 









































Figure 4.33: Non overlapping stencils of size two around two cells located at distance four. 
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Figure 4.34: Mesh connectivity graph of 1D mesh. Node 2 and 7 are connected with dashed 
edges to all nodes reachable by distance 2d = 4. 
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Figure 4.35: Bipartite graph for 1D structured mesh with nc = 10 and finite volume stencil of 
size two. Nodes rg and rz (or cg and c7 for column compression) can share the same color, 
as no path of length two exists between them. Blue edges indicate matrix entries created by 
stencils of size one, red edges additional matrix entries created by stencil of size two. 








Theorem 8. 

A distance 2d coloring, where d is the maximum stencil size used in the finite volume discretization, 
on the mesh connectivity graph Gy = (Vy, Em), can be used to compress the Jacobian of the 
residual of an equation discretized by this stencil. 


Proof. We will transform the coloring problem on Gy into an equivalent problem on the bipartite 
adjacency graph, which can be colored according to Lemma 1. Let two cells u;,u; be at distance 
d or less in the mesh connectivity graph, that is, starting from cell 7, cell 7 can be reached by 
crossing at most d faces. Then the undirected edges (r;,c;) € Eg and (r;,¢;) € Ep are both part 
of the bipartite graph G'g = (Vg, Eg) of the Jacobian induced by the finite volume stencils (as v; 
is inside the stencil of v; and vice versa). 








Let two cells u,;,u; be at distance of 2d or less then there exists a path in the bipartite graph 
of length two ry > cy > ry with Duy, vg) <d and D(vz, vj") < d. Thus, cells vy and vj, may 
not share a common color for a direct recoverable row compression (Lemma 1). Due to symmetry, 
the same argument holds for column compression and the path cy > rz — cj. 

Now let the two cells vj, vj, be at distance of at least 2d+1. There exists no path ry > cp > rj) 
in the bipartite graph, as for each k either D(cy, cy) > d or D(cx, cj’) > d, or both. Thus, cells vy 
and vj can share a color without breaking the distance two condition on the bipartite graph. 

Therefore, a mesh connectivity graph, colored such that no nodes at distance 2d or lower are 
colored with the same color, corresponds to a bipartite graph with a valid (partial) distance two 
coloring. Such a coloring can be used to compress the Jacobian calculation (Lemma 1). LJ 
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Figure 4.36: Distance-0, distance-1, and distance-2 stencils in three dimensions. Stencils 
exploded along depth axis. 


With the mesh connectivity graph, a valid coloring can be obtained directly on the mesh 
description (see Section 4.5.2). The OpenFOAM LDU mesh description implicitly models the 
mesh connectivity graph. The following four ordering heuristics, which are ordered from least to 
most computationally expensive, have been implemented directly in an OpenFOAM solver. 


Natural Ordering: Color cells in order of their cell numbering. 
Random Ordering: Color cells according to a random permutation of their cell numbering. 


Approximate Largest First: Compute the size of the distance-1 neighborhood for each cell and 
sort big to small. Color in this order. ‘The distance-1 neighborhood is already available in 
the OpenFOAM mesh representation, the evaluation of the neighborhood size is thus cheap. 


True Largest First: Compute the size of the full distance-4 neighborhood for each cell, and sort 
big to small. Color in this order. 


First we investigate the coloring performance for a structured hexahedral n X n X n mesh of 
the unit cube. A distance two stencil (blue), as well as all cells colored with color zero (red) for 
the 10 x 10 x 10 unit cube, are shown in Figure 4.39. A graphical representation of structured 3D 
stencils up to distance-2 are given in Figure 4.36. 

All above heuristics exhibit a run time behavior linear in the number of cells ng = n° of the 
mesh. This is to be expected, as each cell is individually colored, and the cost per cell is constant 
as is argued below. For every cell, a breadth-first search |Moo59| (BFS) is performed, however 
the BFS is stopped after a fixed number of steps (the desired coloring distance). Thus the run 
time of a single BFS is independent of ng. Instead it depends on the connectivity of the mesh 
and the type of the used cells. ‘The linear run time behavior can be seen in Figure 4.37, where 
the resolution of the unit cube is scaled from N = 10° to N = 100%. The corresponding number 
of colors required to color the Jacobian are shown in Figure 4.38. The number of colors required 
remains largely constant as nc rises. 
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Figure 4.37: Run time behavior of the presented heuristics. Run time scales linear with no. 
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Figure 4.38: Number of colors required to color the unit cube for the presented heuristics. 
Number of cells nc dashed on second axis. 
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Figure 4.39: Unit cubed meshed with 10 x 10 x 10 cells, colored with 36 colors. Only cells of 
the first color (red), as well as one stencil (blue) is shown. 


For unstructured meshes, the true largest first heuristic performs very well. For structured 
meshes, it performs worse, which is explained by the uniform size of the cell neighborhoods, 
yielding many nodes with the same degree. In this case the heuristic basically falls back to a 
natural ordering (natural ordering is used as tie-breaker), except near boundary patches, where 
the number of cell neighbors is lower. 














In the implementation it has been observed, that using a flat (vector based) set representation 
instead of the red-black tree |Cor+09]| based implementation in the C++ standard library |Pla+00] is 
considerably more efficient when performing the BFS on the cell neighborhood. An implementation 
of the BFS search around a given cell in the mesh representation is given in Listing 4.3. ‘To 
improve performance and reduce memory, the recursive BFS algorithm is explicitly unrolled to a 
loop based implementation, 

The size of the cell neighborhood is determined by the number of neighboring cells. For 
structured hexahedral meshes, the size of the neighborhood is determined by 


2d+1 1D 
mn = ¢ d?4+(d+1)? 2D 
5(4d? — 6d? +8d—3) 3D, 


where d is the stencil size. ‘The full distance-4 neighborhood consequently contains 41 cells for 2D 
and 129 cells for 3D structured hexahedral meshes. 

For unstructured meshes, the size of the cell neighborhood is obviously much more varied. Here 
the largest first heuristic performs well. While this heuristic takes considerably longer to evaluate, 
it produces an ordering that yields significantly fewer colors. This is illustrated by the motorbike 
and VW Polo cases in Table 4.4. The motorbike case consists of 352 863 cells and has a maximum 
cell neighborhood of size 455, due to the polyhedral nature of the mesh. The VW Polo case is hex 
dominated, with tetrahedrons connecting the interior elements to the boundary surfaces. This 
mesh contains 7.75 million cells, the maximum size of the distance-4 cell neighborhood is 595. 

The run time of the coloring heuristics is dominated by the BF'S during the coloring stage. 
Only for the true largest first coloring heuristic the creation of the ordering takes a significant 
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Table 4.4: Number of required colors and run time in seconds for the different ordering heuristics. 


Cube 10° Cube 100° Pitz-Daily 
Heuristic Colors Runtime Colors Runtime Colors Run time 
Natural Ordering 4] 0.03 45 19 19 0.07 
Random Ordering A8 0.02 58 27.4 25 0.08 
Approximate Largest First 44 0.03 62 18.4 25 0.08 
True Largest First 36 0.04 62 o2@ 20 0.13 
Pitz-Daily 3D Motorbike VW Polo 
Heuristic Colors Runtime Colors Runtime Colors Run time 
Natural Ordering 48 2.04 99 13.55 110 307.23 
Random Ordering 56 3.18 90 12974 101 384.95 
Approximate Largest First 60 2.15 96 14.09 115 315.02 
True Largest First AS 3.84 19 25.89 87 622.12 





amount of time, as it also evaluates the cell neighborhood using BFS. ‘The overall run time for 
the true largest first scheme can potentially be improved, at the cost of much higher RAM usage, 
by saving the BFS results during the ordering phase and reusing them during coloring. 

The true largest first distance-4 coloring on the mesh produces results which are competitive 
to colorings obtained by ColPack |Geb+13] on the adjacency graph of the Jacobian. For the VW 
Polo case, which is the biggest case we considered, ColPack produced 85 colors, compared to 
the 87 colors obtained by the coloring implemented directly in OpenFOAM. A visualization of the 
cell colors of the near wall cells, mapped onto the surface of the car body is shown in Figure 4.40. 


1 set<label> calc_cell_neighborhood_BFS (label root, label k, fvMesh& mesh) { 


2 set<label> neigh; 

3 neigh.insert (root); 

4, set newCells = neigh; // copy root set 

5 for(int i= 0; i < k; i+t+)f{ 

6 set<label> tmpCells = newCells; // iterate over all cells of old level 
6 newCells.clear(); // empty set for next level 

8 for(label it : tmpCells){ 

9 labelList& newNeigh = mesh.cellCells (it); 

10 forAll(newNeigh ,j)f 

11 // returns pair<iterator,bool>, true if element not previously in set 
12 auto ret = neigh.insert (newNeighl[j]); 

13 // only add new elements for further exploration in next level 

14 if(i < k-1 && ret.second) 

15 newCells.insert (newNeigh[j]); 

16 i 

17 } 

is| fF 

19 return ne ear ; 

20 } 


Listing 4.3: BFS algorithm to identify the distance k neighborhood around cell index root. 
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Figure 4.40: Cell colors of distance-4 coloring of near wall cells. Plot is split at the y = 0 plane. 
On the y > 0 half of the plot only cells of color zero are shown in red. 


4.6 Parametric Optimization 


4.6.1 Introduction 


In addition to high dimensional optimizations, carried out by topology and shape optimization, 
the discrete AD model can also be applied to a parametric optimization setting. Parametric 
optimizations use a set of parameters, which model the shape of the geometry in some way. For 
example, a pipe could be modeled by a spline, defining the centerline of the pipe, and a set of 
radii, which model the cross section of the pipe. 

Compared to the state vector x € R"*, the dimension of the parameter vector y € R”” is 
rather low, with 1 <n, <n. Therefore a calculation using either tangent mode or adjoint mode 
is feasible. ‘he parameters are an input to the mesher, which transforms the parameters to the 
full mesh representation. 

To circumvent the need to differentiate the mesher and solver at the same time, a hybrid 
approach is possible. In this setting, the sensitivities of the generated points Q (output of 
mesher M) w.r.t. the design parameters dQ/dy are generated using tangent (vector) mode 
at cost O(m) - cost(M(v)). The sensitivity of the (scalar) cost function, with respect to all 
points d7/dQ, is generated in adjoint mode at cost O(1) - cost(J(F(y))). The final sensitivities 
can then be calculated by the following product: 


dJ OF OM 
dy OQ a 
An advantage of parametric designs is that, assuming a reasonable compact parametrization is 


chosen, the final design can be straightforwardly modeled in CAD tools by changing the parameters 
of the initial design to the final optimized values. The reduced parameter set of parametric 
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Figure 4.41: Illustration of a B-spline, defined by five control points (red). Spline in solid blue, 
corresponding control polygon dashed. For the optimization, only the inner three control 
points are treated as parameters. 


optimization also allows the application of advanced optimization schemes, as parameters can 
more easily be constrained and Hessian approximations can be assembled more effectively. ‘The 
remeshing after every optimization step ensures that the mesh quality stays consistent, which 
for a shape optimization, moving all surface points, is much more challenging and might require 
excessive amounts of smoothing of the surface features. 


4.6.2 Parametric BlockMesh Optimizer 


As a proof of concept, we differentiate through a combined solver of the mesher blockMesh and 
flow solver simpleFoam, using adjoint AD. 

The blockMesh mesh description syntax is a plain text description of blocks, faces, and 
edges. From this description, a mesh is generated by the blockMesh utility, using hexahedral 
elements. The resulting meshes resemble structured meshes (OpenFOAM meshes are always 
stored unstructured). Edges can be curved, where the curves are parametrized by splines and 





their corresponding control points. Different spline interpretations are available including Bézier 
curves, B-splines and Catmull-Rom splines. For this proof of concept, we only considered B-spline 
curves |De 78], as they best retain tangential relations in the geometry. The other spline types 
can be differentiated analogously. B-splines are defined by a control polygon, that is a piecewise 
linear path through the control points. ‘The spline connects the start to the endpoint, but does 
not pass through the intermediate control points. Such a spline is illustrated in Figure 4.41. The 
function defined by the control points, as well as its derivative are continuous, unless points are 
multiply defined. 

All spline control points are registered as parameters in the tape, as soon as the block edges 
are created from the control points. The mesh utility afterwards evaluates the spline at the 
required intervals and constructs the mesh primitives (points, faces, cells, etc.). After the mesher 
has finished, it returns a polyMesh object which holds the mesh information. Instead of reading 
the mesh information from the mesh files as usual, the finite volume discretization is instead 
constructed from this polyMesh object, leaving the derivative information between mesher and 
solver intact. 

The mesh generation with blockMesh is inherently limited to serial execution. ‘To retain 
the parallelism of the adjoint flow solver, the points generated by the serial mesher need to 
be distributed to the parallel nodes. ‘The dependencies of the points on the parameters can 
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Figure 4.42: Distribution of the global point fields Q, created by the mesher M, to local point 
fields, required for parallel solver execution *. Adjoints are seeded into y on processor Pp and 
then propagated back to the parameters +. 
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be retained by allowing AMPI to capture the communication. To facilitate this, we load the 
distributed finite volume mesh from an existing domain decomposition, but update the processor 
local point fields from the global field generated by the mesher (which triggers a recalculation of the 
primitive mesh). The global mesh point vector is thus split into several local parts and distributed 
by (A)MPI to the corresponding processors. This introduces a communication overhead, as well 
as the need to hold the whole point mesh on one processor (also required by the mesher anyway). 

The derivatives d7/dQ of the serial or parallel solver S can be calculated by any mean feasible 
to adjoin the iteration history. This step is essentially identical to the calculation of the full 
shape adjoints. For the proof of concept, we choose the piggyback approach, repeatedly adjoining 
steps of the SIMPLE algorithm until the adjoint of the state x is sufficiently small. The product 
with dM/dy can be calculated with a single interpretation of the remaining tape of the mesher. 


4.6.3 Application to Pitz-Daily Geometry 


As an application, we again use the Pitz-Daily backward facing step case, used for topology 
optimization in previous sections. The step is replaced by a ramp, such that the width of the 
nozzle gradually expands. Depending on the steepness of the ramp, the flow might still detach 
or remain attached to the lower wall. Except for the change of mesh topology, the boundary 
conditions are kept identical to the case previously discussed. As a first step, to find a sensible 
initial shape, the angle of the ramp is varied, while keeping the walls straight. As can be seen 
in Figure 4.43, the optimal region is quite broad. Depending on the starting position a local 
optimization will run into different local minima, located at different angles but with only slightly 
differing values of the cost function. ‘The parameter yo = So, is chosen from the middle of the 
optimal region as 108°. The global optimum is located in this region, both according to a brute 
force evaluation of the cost functions for different angles, evaluated at 65 different positions 
of So, between 50mm and 180mm, and according to a gradient based optimization starting 
at So, = 50mm, employing random perturbations after convergence to escape local minima (basin 
hopping |WD97]|). As the variation of So, changes the length ratio between the different blocks, 
the cells in x-direction are dynamically redistributed between blocks B, and Bo (see Figure 4.44), 
retaining mesh quality while keeping the global cell count constant. 

Next, the shape of the wall is modeled by a B-spline with three interior control points. In 
the initial configuration the control points are placed, such that the wall is tangential to the 
connecting walls. The control points are allowed to move in x and y-direction, giving the optimizer 
seven DOF for the spline control points. ‘The gradient is assembled from the derivatives of the cost 
function, here again power loss, with respect to the positions of the control points, as well as the 








x-coordinate of the ramp endpoint, to allow corrections of the ramp slope if required. The vector 
of parameters is thus 7y = So. Bl 2 1ys 92n) O2ys O3e S3,,| € R’, with the corresponding gradient 
of same dimension V4.7 € R’. The control points of the splines are bounded by [ymin, 0]? to avoid 
mesh breaking overshoots of the solution (which except for badly behaved FD approximations of 
the gradient was not strictly needed). 

The optimization was carried out using the SLSQP solver |[Kra94|, implemented in the 
SciPy |[JOP+01| optimization framework, as it both implements parameter bounds and lin- 
ear constraints and proved more reliable than other solvers. For the unconstrained (but bounded) 
case, the SLSQP method resembles the (Pseudo-) Newton method. Using a gradient obtained 
by adjoint mode and a Hessian internally approximated by SciPy, the optimization required 77 
passive and 26 active evaluations of the flow field to converge to the prescribed tolerance bounds. 
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Figure 4.43: Influence of the ramp angle on the power loss. ‘The optimum is located in a broad 
shallow valley; To find the global minimum, depending on the starting position, a global 
optimization might be needed. 





Figure 4.44: Parametrized backward facing step model. Ramp is parametrized by B-Spline with 
three control points and seven DOF. Design baseline in blue, optimized geometry in red. For 
better visibility, the y-axis is scaled by a factor of two. 
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Figure 4.45: Flow inside the optimized Pitz-Daily geometry. 


The initial and final geometry are shown in Figure 4.44. Changing from the initial design to 
the final geometry only changes the geometry slightly, but still improves the power loss by about 
6.5%. The flow through the optimized geometry is shown in Figure 4.45. The solution closely 
resembles solutions obtained by topology optimization. 

While the optimization produces reasonable results, it tends to converge into local minima. 
This is evident from the fact that the optimization does not deviate much from the starting values 
of the splines in x-direction, regardless on how they are chosen. It is thus advisable to try several 
initial configurations or to employ a global optimization strategy, e.g. basin hopping. 
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In this chapter two different case studies are presented, highlighting the different approaches 
to optimization, and the flexibility the discrete adjoint framework allows. The former case is a 
topology optimization of a finely resolved 3D geometry, showcasing the parallel scalability of the 
adjoint solvers. ‘The latter case is a shape sensitivity analysis of a 2D airfoil at a high Reynolds 
number, showcasing the influence of turbulence and the stability of the discrete adjoint approach. 
The chapter is closed with a brief overview over other applications considered with the discrete 
adjoint OpenFOAM framework. 


5.1 Topology Optimization of 3D Pitz-Daily Case 


In this section a three dimensional version of the Pitz-Daily case is topology optimized. Particular 
emphasis is placed on the scaling behavior of the computation on multiple MPI nodes. 


5.1.1 Case Configuration 


The test case is derived from the 2D Pitz-Daily case, introduced in Section 3.1.2, by extruding it 
by 0.25 m in the z-direction. This case is also used by |AU16] to access the scaling of the standard 
OpenFOAM version on the Hazel Hen' cluster of HLRS Stuttgart. In its coarsest configuration, 
the test case is meshed with a 122 250 cell blockMesh (the 12 225 cells of the 2D case multiplied 
by 10 cells in the z-direction). It can then be refined by uniformly increasing the number of 
cells along each dimension of the geometry. Due to the nature of the blocks for this particular 
test case, the refinement has to be carried out with an integer factor. ‘The number of cells for 
refinement levels 1 to 5 are listed in ‘Table 5.1. 





Table 5.1: Number of cells for different refinement levels of the Pitz-Daily 3D test case 


Refinement Level Cells 
1 122 250 
2 978 000 
3 3 300 750 
4 7 824 000 
5 15 281 250 


The Reynolds number is set to Re = 25000 and the k-e turbulence model is used to obtain a 
non-transient solution. The cost function is implemented as the (total) power loss between inlet 
and outlet, as defined in Section 3.1.3. 


‘https: //www.hlrs.de/de/systems/cray-xc40-hazel-hen/ 
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Figure 5.1: Geometry of the Pitz-Daily 3D test case with blocks (bold black lines) and cells 
(gray lines) for refinement level one. Inflow on the upper left, outflow on the right. The 
geometry is extruded 0.25m in z-direction. 


5.1.2 First Simulations 





To assess the convergence speed of the primal and derivatives, we first run a tangent simulation 
on the base configuration. The initial flow field is set to a solution of the potential flow equation 
(potentialFoam). We then run a tangent solver for 400 iterations. By seeding the tangents of all 
parameters a; to one, we obtain the sum of all partial derivatives 





F=VI(0)-a— NH 


with one evaluation of the tangent augmented simulation code. The convergence history of the 
sensitivity J is depicted in Figure 5.2. We see no major changes in both the primals and the sum 
of tangents after 1500 iterations. ‘The sensitivities do not lag behind the primals significantly. 
However, the momentum equation lags behind the pressure correction by a few hundred iterations, 
hinting that the momentum equations might be too strongly relaxed. 

After having determined the approximate convergence rate with the tangent simulation, we 
now switch to the adjoint mode. ‘The flow is initialized to a mostly converged state, obtained by 
2500 passive iterations of the simpleFoam solver. We run the adjoint simulation with a uniform 
starting field a, initialized to zero. First we run for 400 iterations, to check if the adjoints match 
the values predicted by the tangent simulations. Figure 5.2 shows that the sensitivities obtained 
by piggybacking match the value obtained by tangent mode. Further the sensitivity obtained 
with reverse accumulation by repeatedly adjoining the 400th iteration step match the results 
obtained by piggy-backing and tangent mode. 


5.1.3 Run Time and Optimization Results 


For the optimization, we utilize the piggyback algorithm (introduced in Section 4.2), which is run 
until both the primals and adjoints are sufficiently converged. 

Table 5.2 lists the run time factors and memory usage of the piggyback simulation. Shown is 
the average run time of a single iteration step of the simpleFoam and piggySimpleFoam solvers. 
By introducing the dco/c++ data type into simpleFoam, without executing an augmentation of 
the forward section, the run time increases by a factor of over two and the memory consumption 
by a factor of 1.6. 
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Figure 5.2: Sensitivities obtained by tangent and adjoint modes, with piggybacking from the 
initial state, and reverse accumulation by repeatedly adjoining time step 400. 


Table 5.2: Global run time and memory, including factors, for the Pitz-Daily 3D case. 


simpleFoam passive 
SimpleFoam active 
piggySimpleFoam w. SDLS 
piggySimpleFoam w.o. SDLS 


Run time(s) Factor Memory(MB) _ Factor 


26.92 1.00 239.14 1.00 
79.08 odes, 381.93 1.60 
218.64 8.12 9979.15 25.00 
463.48 i22 38474.52 160.89 
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Table 5.3: Run time and run time factors (compared to simpleFoam) of the individual solver com- 
ponents. The run times and run time factors for the individual linear equation system solvers 
are given for the augmented forward and reverse interpretation (only for SDLS) execution. 


simpleFoam simpleFoam active Piggyback w. SDLS Piggyback w.o. SDLS 


Run time (s) Runtime (s) Factor Runtime (s) Factor Run time(s) Factor 


Total 26.91 79.81 2.82 213.00 7.92 463.48 7 22 
Augm. forward 25.80 73.80 2.86 134.69 5.22 296.26 11.48 
U 6.39 15.43 2.41 16.78 2.62 62.69 9.80 
p 7.74 19.36 2.50 20.98 2.71 108.65 14.04 
k 2.14 5.10 2.38 5.14 2.40 21.18 9.88 
epsilon 1.68 4.01 2.38 4.65 2.76 16.09 9.57 
Seed — — — 3.47 — 15.42 —— 
Interpretation — — — 66.39 — 143.32 — 
U reverse — — — 16.86 — — = 
p reverse — — — 14.96 — — — 
k reverse — — — 9.01 — — — 
epsilon reverse = = — 4.52 ——- — —— 


Enabling the augmented forward and reverse interpretation for the piggySimpleFoam solver 
increases the run time factor to a factor of approximately eight, and the memory factor to 25. 
This is still considerably better than with black-box differentiated linear solvers, which consumes 
double the run time and over six times more memory. 


The run times are broken down to the individual solver phases in ‘Table 5.3. The solver run is 
broken down into the forward phase, seeding phase and interpretation phase. The forward phase 
is further broken down into the individual linear solver calls (velocity, pressure and turbulence 
(k-e)). The seeding phase is dominated by the first allocation and initialization of the adjoint 
vector, and therefore profits from SDLS, due to reduction in size of the adjoint vector. For the 
SDLS case, the interpret phase also breaks down the individual linear solver calls, executed from 
the adjoint callback objects. As with the primal, the solution time of the adjoint equation systems 
are dominated by the velocity and pressure systems, the turbulence equations are comparatively 
cheap to solve. 


When both the primal and adjoint have converged, we update the parameters @ corresponding 
to the steepest descent algorithm. ‘The values for @ are clamped below zero to eliminate non- 
physical momentum sources and capped at a maximum value to obtain a solution which is 
upwards bounded. 





Figure 5.5 shows the convergence history of the cost function 7(a@) over 2000 steepest-descent 
iterations. The average total pressure drop between inlet and outlet drops from 18.74m?s~? in 


the baseline version to 11.49m?s~? in the optimized version. 


The top and bottom part of Figure 5.3 show the velocity field and total pressure contours for 
the baseline and optimized design respectively, while Figure 5.4 shows the distribution of the 
penalization parameter a for the optimized case. The gap in the penalization field in the area 
after the step is an artifact visible at different mesh refinement levels and presumably tries to shift 
some flow from the centerline of the duct more towards the near- and far-field of the geometry. 
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Figure 5.3: Velocity plot on the z-midplane of the initial (top) and optimized (bottom) configu- 
ration. White contour lines show levels of total pressure. 





Figure 5.4: Geometry with penalized regions where a > 0. 
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Figure 5.5: Convergence of the cost function J for the optimization of the 3D Pitz Daily test 
case, improving the predicted power loss by 38%. 
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Figure 5.6: Mesh decomposition for 48 and 192 processors using scotch decomposition. Processor 
boundaries are shown in black, remaining mesh colored by processor id. 


Table 5.4: Relevant Hardware of the RWTH Aachen Compute Cluster. 


Section CLX-MPI CLX-SMP 


LSF Node ‘Type c24m128 c144m1024 

+# Nodes 600 6 
#Sockets per Node 2 8 

CPU Codename Intel Broadwell EP Intel Broadwell EX 
CPU Model E5-2650v4 E'7-8860v4 
Clock Speed (GHz) 2.2 2.2 
##Cores per Chip 12 18 

##Cores per Node 24 144 
Memory per Node (GB) 128 1024 
Memory per Core (GB) 5.33 (eel 


5.1.4 Scaling Behavior on HPC Cluster 


Having established the feasibility of this test case for topology optimization we now use this 
case for benchmarking the scaling behavior of the implementation. The case is decomposed onto 
multiple processors using the ptScotch decomposition algorithm |CP08]. The decomposition for 
48 and 192 processors is depicted in Figure 5.6. 

In Figure 5.8 we show the memory consumption and run time results for refinement level three. 
We again observe a major reduction of tape memory when utilizing the symbolically differentiated 
linear solvers (3088 GB down to 253GB). For not well conditioned problems, which require more 
solver iterations, the memory improvements are even higher, as the memory consumption without 
symbolically differentiated solvers is directly dependent on the number of solver iterations. Also 
one rogue iteration, that needs more linear iterations than average to complete, may kill the 
whole simulation due to lacking RAM space. 

With SDLS we also generally see a reduction in run time, due to improved linear solver efficiency 
(calculation in passive mode), less memory allocation, less adjoint propagation, and less adjoint 
communication, which outweighs the need to solve additional linear equation systems during 
the adjoint propagation. In our case, the adjoint propagation phase is slightly slower due to 
the additional equation systems which need to be solved. ‘This is offset by the more efficient 
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augmented primal section, leading to an overall reduction in run time of roughly 40%. 

Both the run time and the memory consumption are dominated by the solution of the pressure 
equation, for which the geometric-algebraic multi-grid solver (GAMG) is used. The remaining 
equations are solved with a Gauss-Seidel solver variant, which is often used in near transient 
cases due to its stability. 

In Figure 5.7 we show the scaling behavior of our implementation on the RWTH University 
compute cluster. Benchmarked are refinement levels three and four of the test case. Table 5.4 lists 
the two systems types most relevant to the application of AD. They both provide a relatively high 
amount of RAM per core with 5.33 GB and 7.11 GB respectively. With currently 600 installed 
nodes, the former system is the most common node of the RWTH cluster. It supplies 24 cores per 
node and 128 GB of RAM, making it a good general purpose choice. The latter system provides 
1024 GB of RAM per node, making it the preferred choice if a very high amount of RAM is 
needed locally. However, many of the 144 cores need to remain idle to utilize the full memory for 
single threads, which is punished by the job queueing system. 

The simulation was run on the former machines supplying 128 GB each. As the finer simulation 
(level 4) consumes about 650GB RAM (see Figure 5.8), for cases decomposed onto few processors 
(n = {12,24,48,96}), we cannot use all physical cores of the nodes, as not enough RAM is 
available locally. For those cases, we only place 3, 6 and 12 threads respectively per node, leaving 
the remaining cores unutilized. The total available memory bandwidth is thus shared by less 
threads for these cases, at the expense of more communication between distant nodes (connected 
by InfiniBand). 

For n = {192,384,768}, we can fully saturate the nodes with 24 threads each. For the 
benchmark, we time 20 piggyback iterations and calculate the average time needed for both the 
augmented primal and adjoint propagation phases. In the figure we see scaling of both phases. 




















For reference, also the average run time for passive calculation with simpleFoam is shown. For 
the most part, the scaling behavior of the discrete adjoint and passive version are comparable. 
The passive calculation stops scaling earlier than the discrete adjoint, because fewer operations 
need to be performed during each iteration for the passive solver. The run time of the passive 
calculation is thus dominated by the communication overhead earlier. 

For the coarser case of refinement level three, scaling stops after 192 threads for the passive 
computation and after 384 threads for the discrete adjoint. At this point no further scaling is to 
be expected, as the number of cells per thread has already fallen below 10000. For the finer case, 
scaling stops for the passive computation after 192 threads, but continues onto 768 threads for 
the discrete adjoint. At this stage each individual thread only holds around 12000 cells. We thus 
expect the scaling of the discrete adjoint to stop beyond that point for this case as well, as the 
numeric work load for each process becomes too small. 

Scaling of (primal) OpenFOAM has been shown to extend into thousands of processors [Dur+15]. 
For even higher processor numbers, some design decisions limit the scalability |Cull1; AU16]. 
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Figure 5.7: Run time scaling of the discrete adjoint on the RWTH cluster. The average run 
time out of twenty piggyback steps is shown for the recording and interpretation phases. For 
reference, also the scaling behavior of the passive simpleFoam solver and the theoretical ideal 


scaling is shown. 
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Figure 5.8: Memory consumption (over all processes) and run time of the Pitz-Daily example 
for 768 processors with (top) and without (bottom) symbolically differentiated linear solvers. 


5.2 Shape Sensitivities of NACA Airfoil 





In this section we apply the procedures introduced in Section 4.4 to generate the surface sensitivities 
of a NACA airfoil. The flow is calculated using the Spalart-Allmaras turbulence model, the 
sensitivities of the airfoil are evaluated w.r.t. viscous lift and drag. 


5.2.1 Modelling of NACA Airfoils 


The concept of NACA airfoils was introduced by the National Advisory Committee for Aeronautics 
(NACA) in the early 20th century, to efficiently describe different airfoil shapes. The most 
commonly used parametrization is the 4-digit parametrization, e.g. NACA 4412, which parametrizes 
an asymmetric 2D airfoil. A special case is the generation of symmetric airfoil profiles, where 
the distance from the camber line is equal for both the upper and lower surface. ‘hose profiles 
are described by only one parameter t, encoded with two digits, e.g. NACA 0012. Due to the 
symmetric pressure profile at zero angle of attack (that is the angle between spanwise direction of 
the airfoil and the direction of the freestream flow), symmetric profiles produce no lift. 
For a symmetric airfoil, the distance from the camber line is defined as 








ye = 5t (0.296902 — 0.1260a — 0.351627 + 0.2843a° — 0.10362) | 


giving the coordinates of the upper and lower surfaces as x, = 2, ty = 2, yr = —Yyt(2) 
and yy = +y:(2). 


Camber line 
— Upper Surface 
— Lower Surface 





Figure 5.9: Cross section of asymmetric NACA 4412 airfoil (left) and symmetric NACA 0012 (right) 
with zero angle of attack. 
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Figure 5.10: Illustration of the O-type meshing of the airfoil. Zoom on the boundary layers on 
the right. For the actual mesh, the radius of the bounding circle is considerably bigger and 
the boundary layers are finer. 


The wing cross section and the camber line for the asymmetric NACA 4412 and symmetric 
NACA 0012 wings are shown in Figure 5.9. In the following, we will further investigate the 
NACA 0012 airfoil. A symmetric airfoil is useful to verify the adjoint implementation, as one 
expects a symmetric sensitivity field at zero angle of attack. 

An O-type mesh |TWMB85] is generated for a circular domain around the airfoil, with a 
radius of thirty times the wing length. The spacing of the boundary layers yields a y+ value of 
approximately 0.1 on average and a maximum y-+ value of 0.25 at the chosen Reynolds number 
of Re = 2- 10°. The angle of attack is varied by changing the direction of the incoming flow. An 
illustration of the mesh layout is shown in Figure 5.10. In order to make the mesh features clearly 
visible, the radius of the circular domain is lowered and the mesh resolution is reduced in the 
figure. ‘he resulting mesh consists of approximately 80000 cells, most of which are needed to 
form suitable boundary layers and to refine the mesh regions in the wake after the trailing edge. 

As cost function the lift and drag of the wing are considered. The lift force on a wing is defined 
as the aerodynamic force of the fluid exerted onto the wing, perpendicular to the freestream 
flow. Analogously the drag force on a wing is defined as the aerodynamic force parallel to the 
freestream flow. 











For low velocities, the aerodynamic force is dominated by the pressure forces, which act normal 
to the skin surface. The total pressure force acting on the wing can be calculated by integrating 
along the airfoil surface I: 


Fy = § ow) ndw. 


For higher velocities, the contribution of viscous effects, that is forces caused by the shear stresses 
between the fluid layers, to the aerodynamic forces become non-negligible. This part of the forces 
is called friction or shear forces F,, which act tangentially to the skin surface: 


EF. = $ s(w) t dw , 
ie 


where s is the friction force at each location on the airfoil. The calculation formula for the 
friction force depends on the chosen turbulence model, but can in general be obtained from the 
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Figure 5.11: Airfoil with a = 15 deg angle of attack. For the left airfoil, the freestream is aligned 


with the x-axis, for the right airfoil the camber line is aligned with the x-axis. As a result the 
lift and drag vectors are tilted to the coordinate system for the right airfoil. 





implementation of the viscous stress tensor. The total (viscous) aerodynamic force F, = F, + Fs 
is then the sum of pressure and friction force. 

For an angle of attack a, the lift induced by the aerodynamic forces is defined as the pressure 
force projected in the direction perpendicular to the freestream facing upwards: 





— sin(a) 
L, =F,- | cos(a) | , 
0 





and the drag as the force parallel and opposed to the freestream: 


cos(a) 
D, =F, - | sin(a) 
0 


A graphical representation of the drag and lift vectors is given in Figure 5.11. 


5.2.2 Primal and Sensitivity Results 


The pressure distribution along the airfoil surface for three different angles of attack is shown in 
Figure 5.12. The zero surface integral between the pressure on the upper and lower surface of the 
airfoil, for zero angle of attack, shows that no lift is produced by the airfoil in this configuration. 

The sensitivities have been found to strongly depend on a finely resolved boundary layer (y* < 1), 
even more so than the primal pressure distribution. Also the sensitivity fields for drag differ 
fundamentally between viscous and non-viscous formulation of the cost function. ‘This is to be 
expected, because at the chosen Reynolds numbers the viscous drag contributes a considerable 
amount to the total drag. ‘The lift is less influenced by the viscous effects. 

To obtain a simulation state which converges with reverse accumulation, 1.e. a state where 
the adjoint iteration is contractive, the relaxation factors for the primal equations had to be 
slightly lowered. While the primal converges smoothly for under relaxation factors of 0.9 for the 
momentum equations (using the consistent SIMPLEC scheme) and 0.7 for the Spalart-Allmaras 
turbulence equations, convergence of the adjoint could only be obtained after lowering the under 
relaxation factors for the primal equations to 0.8 and 0.6 respectively. That is, the under-relaxed 
field (here for the pressure) p4*! is calculated with under relaxiation factor \ as 


Pu = Api + (1—A)p'. 





The sensitivity results for the drag w.r.t. movement of the surface nodes in surface normal 
direction are shown in Figure 5.14. Negative sensitivities indicate the desire to move the surface 
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Figure 5.12: Pressure distribution along the surface of the NACA0012 wing for zero, two and 
four degree angle of attack. Pressure on lower surface solid and dashed on upper surface. ‘The 
area enclosed by the lower and upper pressure curve corresponds to the total lift of the wing 
(excluding the viscous forces). 


nodes against the airfoil surface normal direction, lowering the cross section of the airfoil. As 
expected, the sensitivities for zero angle of attack are symmetric along the centerline of the airfoil. 
For two and four degree angle of attack, the sensitivities are asymmetric, with the lower chord 
of the airfoil exhibiting higher sensitivity values. Overall the sensitivities are visibly correlated 
to the pressure distribution. The sensitivities are smooth along the airfoil surface, except for a 
singularity point limited to the trailing edge of the geometry. 

Using the Spalart-Allmaras turbulence model, an issue with the calculation of the wall dis- 
tance y*, needed for the calculation of the turbulence equations, was identified. If this calculation 
is left unchanged, it introduces considerable noise into the adjoints, especially near the leading 
edge. If the calculation of the wall distance is treated with a frozen adjoint assumption, the 
adjoints become very smooth, while still retaining the same overall shape. Both the uncorrected 
and corrected sensitivities for a coarse mesh with two degree angle of attack can be seen in Fig- 
ure 5.13. The fix almost completely removes the instabilities observed around the trailing edge of 
the airfoil, only leaving a singularity around the trailing edge. ‘To make sure the correction does 
not completely remove the adjoint sensitivities of the turbulence model from the calculation, the 
same configuration is run with a frozen turbulence assumption. ‘The results of this calculation are 
also shown in Figure 5.13. This reveals that the sensitivities of the airfoil obtained with frozen 
turbulence, while still indicating that the cross section of the airfoil needs to be decreased to 
improve drag, exhibit a completely different behavior. In particular the frozen adjoint solution 





(even with the previous fix to wall distance calculation applied) exhibits issues at the leading 
edge, which do not appear with the fully differentiated Spalart-Allmaras turbulence model. ‘To 
summarize, calculating the wall distance with a frozen adjoint calculation only changes the overall 


196 


5.2 Shape Sensitivities of NACA Airfoil 


—(0).4 


—0.6 


Sensitivity 





— Regular meshWave wall distance 
P — Fixed meshWave wall distance 
aa —— Frozen turbulence 





01 0 O1 02 03 04 05 06 0.7 08 09 1 11 


Normalized position on wing centerline 


Figure 5.13: Sensitivity of the drag w.r.t. movement of surface nodes of the NACA 0012 airfoil in 
surface normal direction. The noisy green curve is calculated without the modified adjoint 
wall distance, the smooth orange curve with the modification. 


perception of the adjoints a little, while considerably improving smoothness of the adjoints. In 
contrast, introducing a completely frozen turbulence assumption changes the overall appearance 
of the adjoint sensitivities and exhibits additional issues. 

The results for the lift on the same case are presented in Figure 5.15. For zero angle of attack, 
the results are symmetrical around the x-axis, indicating that the top surface needs to be moved 
outwards to increase lift, while the lower surface needs to be pushed inwards. ‘his is consistent 
to the shape of an asymmetric NACA wing. For increasing angle of attack, the sensitivity 
distribution shifts in positive direction. Again the sensitivities exhibit issues at the trailing edge 
and are smooth otherwise. 

The sensitivities can be subsequently used as an input to a mesh morpher, e.g. the one 
introduced in Section 4.4, with the aim to obtain an improved geometry. 
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Figure 5.14: Sensitivities of drag w.r.t. movement of surface nodes of the NACA 0012 airfoil in 


surface normal direction. 


-10° 


— 0° angle of attack 
— 2° angle of attack 
— 4° angle of attack 


Sensitivity 





—0.1 0 O1 02 03 04 05 06 07 08 09 1 11 


Normalized position on wing centerline 


Figure 5.15: Sensitivities of lift w.r.t. movement of surface nodes of the NACA 0012 airfoil in 


surface normal direction. 
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5.3 Further Applications of Discrete Adjoint OpenFOAM 


Discrete adjoint OpenFOAM has been applied to a variety of different applications by other 
researchers, or by students writing their thesis at STCE. 

The tangent and adjoint model has been used to differentiate through the OpenCascade 
CAD environment, also including the whole OpenFOAM mesh generation procedure, using the 
snappyHexMesh mesher. The solution process is differentiated using the adjointSimpleFoam 
solver. This allows to optimize with parameters defined directly in the CAD environment, by 
using the spline control points of the intermediate NURBS representation as parameters |Gez16]. 

A parametric optimization study has been performed, coupling the sensitivities obtained by 
shape adjoints on a wing shaped rudder, with the proprietary parametric optimization framework 
CAESES by Friendship Systems AG. Studied were the thickness and twist of the wing in a flow 
with low angle of attack |FT16]. 

Some of the concepts presented in this thesis have been applied to the Foam-extend project, 
allowing to run an even wider range of solvers, as well as advanced discretization methods, such 
as explicitly coupled block solvers [STN]. Furthermore, discrete adjoint OpenFOAM has been 
used to obtain transient adjoints in the context of the aboutFLOW project |EU16]. Higher order 
approaches and the accumulation of full Hessians have been explored in [Pee16]. 

The discrete shape optimization approach, using the Spalart-Allmaras turbulence model, 
was used to obtain sensitivities and improve the drag of the student competition solar car 
Sonnenwagen |Mol18]. A shape sensitivity, with respect to the drag of the car in a wind tunnel 
configuration, is shown in Figure 5.16, demonstrating the applicability to complex geometries. 
Similarly, the sensitivities of a flow around the rear wing of a touring car has been studied 
in |Pes16]. 

Discrete adjoint OpenFOAM is available as open-source under the GNU GPLv3?. 








Figure 5.16: Shape sensitivities w.r.t. drag on Sonnenwagen solar competition car geome- 
try |Mol18]. Red cells need to be moved outwards, blue cells inwards. 


* stce.rwth-aachen.de/foam 
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6.1 Summary 


Starting from a black-box application of AD, introduced as a proof of applicability, different 
strategies were explored to lower the runtime and memory impact of the adjoint mode of 
AD. For steady state cases, the fixed point properties of the solution algorithms have been 
extensively exploited, using reverse accumulation, the piggyback method, or the discrete adjoint 
residual approach. 

For the residual approach, different coloring techniques have been implemented to speed up the 
calculation of the Jacobians using either tangent or adjoint mode, exploiting the sparse nature of 
FVM matrices. For transient cases, or if the convergence of the steady primal case is not sufficient 
to reliably obtain adjoints using the aforementioned methods, the whole iteration history can 
be adjoined with low memory overhead, while still retaining acceptable run time behavior, by 
using binomial checkpointing. The differentiated versions of all solution methods, which rely on 
embedded linear solvers, profit immensely from the optimizations implemented with SDLS. 

Per time step, piggyback run time factors under ten can be achieved, when compared to a 
passive primal execution. The memory consumption of the adjoint also generally is increased by a 
factor of around ten. Using reverse accumulation, the run time factor between adjoint and primal 
is lower, as no augmented primal needs to be calculated for each reverse accumulation iteration. 

During the implementation of the adjoint framework, an emphasis was to retain the parallel 
scalability of the primal and adjoint code. In particular this involved the introduction of AMPI 
to the discrete adjoint CFD framework. In addition to providing improved run time behavior, the 
decomposition of cases onto multiple processing nodes spreads the memory demand onto multiple 
machines. This both improves the memory throughput and also removes the demand for machines 
with unusually high amounts of available RAM. Using these capabilities, cases on complex 
meshes with over 7 million cells could be calculated. If, even with all applied optimizations, 
the RAM demand remains higher than what is available in hardware, the dco/c++ tape can be 
offoaded onto secondary storage (preferably low latency, high throughput, such as SSD storage). 
Due to its random access nature, the adjoint vector is retained in RAM. In order to remove 
the bottleneck of the adjoint vector outgrowing the amount of available RAM, adjoint vector 
compression techniques have been discussed and implemented. ‘The improved adjoint vector 
techniques proved to be very effective when adjoining long iteration histories, without majorly 
impacting the run time of the augmented primal and reverse propagation. Having the discrete 
adjoint available for a whole simulation environment proves to be very valuable. While arguably 
not as efficient as an approach tailored specifically to a specific application, it provides much 
more flexibility. 





Another advantage of a tool based AD implementation is the availability of different models of 
differentiation on the same code base. A feature added to the primal is immediately available in 
adjoint, tangent, and higher order versions, by just changing some environment variables. 

The discrete adjoint framework has been applied to a variety of CFD cases, ranging from 
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ducted flows to external aerodynamics. The availability of fully differentiated turbulence models 
makes the discrete adjoint method attractive for applications, where the turbulent quantities are 
believed to have a major impact on the objective. A variety of cost functions, mostly regarding 
external aerodynamic flows, have been implemented. New and blended cost functions can be 
readily implemented, due to the flexibility of the discrete adjoint. Penalization approaches can be 
used to implement basic constraints. Using the on demand compilation features of OpenFOAM, 
this can potentially even be done on a case by case basis, implementing the cost function as part 
of the case configuration. In addition to the high dimensional sensitivities, produced by topology 
and shape optimization, a parametric optimization approach was implemented, demonstrating 
the full AD treatment of a meshing tool. 


6.2 Outlook 


The discrete adjoint approaches presented in this thesis can be applied to a variety of different 
applications, either within the OpenFOAM framework, or in the general CFD field. The 
application to large scale transient simulations remains challenging, but also has the potential 
to create results not obtainable by other methods. Furthermore, for transient applications, the 
continuous adjoint approach faces some of the same challenges as the discrete adjoint, closing the 
performance gap. With the advances in the file tape and parallel adjoints made in this thesis, the 
computation of sensitivities for very large and complex geometries becomes feasible. With the 
developed profiling methods, further avenues of optimization for the AD implementation should 
be identified and implemented. In addition to the already employed forward activity analysis, 
additional optimizations can be applied to the tape, improving performance, especially if the tape 





is evaluated multiple times. 

To expand the capabilities of the discrete adjoint framework, the application to multi-physical 
optimization is promising. The flexibility of the discrete adjoint makes it applicable to a wide 
variety of simulation approaches and code bases already in existence, without requiring major 
code rewrites. A planned development is the incorporation of the discrete adjoint into coupled 
heat transfer (CHT) problems. 

Another advanced topic is the incorporation of robust optimization, to obtain designs which 
are feasible under a variety of operating conditions. The availability of higher order adjoints 
facilitates the usage of sophisticated robust optimization schemes. 

More complex constrained optimization methods should be explored, in order to generate 
solutions which can be produced cost effectively (design to manufacture), or which adhere to 
certain design constraints (e.g. fixed amount of volume). The already implemented parametric 
optimization capabilities could be used to directly couple the simulated geometries to their CAD 
representations, allowing for a more construction driven optimization approach. 

Besides optimization problems, other applications of the adjoint methods are conceivable, and 
already pursued by other researchers, using the discrete adjoint OpenFOAM framework. For 
example, the application of adjoint error estimators for adaptive mesh refinement, or the usage of 





adjoints for uncertainty quantification. 
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A Developer Documentation 


A.1 Location of Discrete Adjoint OpenFOAM Solvers 


Discrete adjoint OpenFOAM strives to integrate into the native OpenFOAM framework 
as seamlessly as possible. ‘The general folder layout remains unchanged from the native 
OpenFOAM implementation. Let $FOAM_INST_DIR point to the OpenFOAM base direc- 
tory, then the solvers unique to discrete adjoint OpenFOAM are located in $FOAM_INST_ 
DIR/applications/discreteAdjointOpenFOAM. Adjoint solvers are located in $FOAM_INST_ 
DIR/applications/disreteAdjointOpenFOAM/adjoint, tangent solvers in $FOAM_INST_DIR/ 
applications/disreteAdjointOpenFOAM/tangent, passive solvers (e.g. for finite differences) 
in $FOAM_INST_DIR/applications/disreteAdjointOpenFOAM/passive, and support libraries 
(for checkpointing and the calculation of cost functions) in $FOAM_INST_DIR/applications/ 
disreteAdjointOpenFOAM/libs. The dco/c++ header files are located in $FOAM_INST_DIR/src/ 
OpenFOAM/dco. 

The most relevant subdirectories for the usage and configuration of discrete adjoint OpenFOAM 
are listed below: 


. $FOAM_INST_DIR 
applications 
discreteAdjointOpenFOAM 
adjoint 
experimental 
fd 
libs 
-— libCostfunction 
libCheckpointing 
passive 
tangent 
tools 
solvers 
etc 
bashrc 
derivativesSettings.sh 
Src 
|__ OpenFOAM 
L_ dco 
|__ dco.hpp 
tutorials 
wmake 


Figure A.1: Folder structure of discrete adjoint OpenFOAM, with root at $FOAM_INST_DIR. 
Folders not immediately relevant are omitted for space reasons. 
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A.2 Compile Options 


Discrete adjoint OpenFOAM uses the wmake build system of OpenFOAM. The environment of 
OpenFOAM is set by sourcing the etc/bashrc script file. Discrete adjoint OpenFOAM extends 
the environment variables of OpenFOAM (identifiable by the prefixes WM_* and FOAM_*) with a 
set of variables with the prefix DOF_*. 

The following variables are currently implemented: 





DOF_AD_OPTION: Choice of AD option to be used. 
One of {A1S | T1S | T1V | T2A1S | T2T1S | Passive }, default: A1S. 


DOF_COMPILER: Choice of compiler, gets passed along to WM_COMPILE_OPTION. 
One of {Gcc | Clang | Icc}, default: Gcc. 


DOF_COMPILE_OPTION: Specify level of optimization. 
One of {Opt | Debug | Prof } default: Opt. 


DOF_BUILD_PROCS number of threads on localhost for parallel compilation, default: 8. 


The flags for dco/c++ are set in the environment variable DOF_DCO_FLAGS. he DOF_DCO_FLAGS 
are set by the bashrc script, depending on the choice of DOF_AD_OPTION. ‘The flags are added to 
the compilation flags and configure dco/c++, such that only the needed features are instantiated. 
The choice of flags is detailed in the following subsections. 





A.2.1 Passive Mode 


For passive mode, all non-essential features of dco/c++ are disabled. Neither adjoint nor tangent 
data types are available. To enable the use of the same code base for all variants, some dco/c++ 
functions such as dco: :passive_value() are still used. However they should be optimized out 





by the compiler. 


export DOF_DCO_FLAGS="-DDCO_NO_DEFAULT" 


A.2.2 First order adjoint mode (A1S) 


For adjoint mode, the generic adjoint type (gaits) is instantiated, allowing the definition of 
Foam: :scalar as dco: :gais<double>: :type. A chunk tape is used, allowing the tape to gradually 
grow in chunks. ‘Tape callbacks are needed for the symbolic differentiation of linear solvers. Activity 
analysis is enabled, reducing the tape size for regions not dependant on the registered inputs. 
For cases which require tape indices bigger than 2°!, a 64 bit datatype has to be requested to 
enumerate the tape entries with DDCO_TAPE_USE_LONG_INT. 








export DOF_DCO_FLAGS="\ 
-DDCO_NO_DEFAULT \ 
-DOF_DCO_MODE_A1S \ 
-DOF_DCO_A1IS_CONT_LINEAR \ 
-DDCO_GA1iS \ 
-DDCO_CHUNK_TAPE \ 
-DDCO-TAPE_CALLBACKS \ 
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A.2 Compile Options 


-DDCO_TAPE_USE_LONG_INT \ 
-DDCO_TAPE_ACTIVITY \ 
-DDCO_ALLOW_TAPE_SWITCH_OFF" 


A.2.3 First Order Tangent Scalar Mode (T1S) 


For scalar tangent mode, the generic tangent type (gt1s) is enabled, allowing the definition of 
Foam: :scalar as dco: :gtis<double>: : type. 


export DOF_DCO_FLAGS="\ 
-DUF SDCUSMUDEST TS 
-DDCO_NO_DEFAULT \ 
-DWCOSGIIS™. 
-DDEULT LES SACTIVITY” 


A.2.4 First Order Tangent Vector Mode (T1V) 


For the vector tangent mode, the generic tangent type (gtiv) is enabled, allowing the definition 
of Foam: :scalar as dco: :gtiv<double,d>::type. The vector size d is set to 5 and is fixed at 
compile time for performance reasons. If another vector size is required, it can be changed in 
DCO_T1V_SIZE. All object files have to be recompiled in order to apply the changes to the vector 
size. ‘The optimal vector size is dependant on cache behavior, the amount of available RAM and 
the number of needed derivatives. A reference to the i-th tangent of a variable x can be accessed 
by using the dco: :derivative(x) Li] method. 


export DOF_DCO_FLAGS="\ 
-DOP SD CULMODEST hy 5 
=DDCO_NGLDEPAULT \ 
=DUCGCOSGTIV \ 
-DDCO_T1V_ACTIVITY \ 
-DDCO_VECTOR_SIZE=5" 


A.2.5 Second Order Tangent Over Adjoint Mode (T2A1S) 


For the second order tangent over adjoint mode, both the generic adjoint and tangent types are 
instantiated. Those types can be arbitrarily nested to obtain higher order derivative models. ‘The 
scalar tangent over adjoint mode is obtained by nesting an tangent type inside an adjoint type, 
yielding dco: :gais<dco: :gt1s<double>: :type>: : type. 


export DOF_DCO_FLAGS="\ 
-DOF_DCO_MOBE_T2A135 \ 
-DDCO_NO_DEFAULT \ 
-DOF_DCO_A1S_CONT_LINEAR \ 
-DDCO 4G 15% \ 
-DDCO_GA1S \ 
-DDCO_TAPE_CALLBACKS \ 
-DDCO_TAPE_USE_LONG_INT \ 
-DDCO_TAPE_ACTIVITY \ 
SD DCOALLOW_TAPE_SWITCH_OFF" 
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A.2.6 Second Order Tangent Over Tangent Mode (T2T1S) 


The (scalar) second order tangent over tangent mode nests a tangent type inside another tangent 
type, yielding dco: : gt1is<dco::gtis<double>::type>::type. ‘The flags are identical to first 
order tangent scalar mode. 


1)}export OF_DCO_FLAGS="\ 
2 -DOFSDCOLMODESTI2T1S \ 
3 -DDCO_NO_DEFAULT \ 

4 -DDCO_GT1iS \ 

5 -PDCULTISLACTIVITY™ 
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B Full SDLS Code 


1} template <> 
2 Foam::solverPerformance Foam::fvMatrix<Foam::scalar>::solveSegregated 


3 


( 


) 
{ 


const dictionary& solverControls 


GeometricField<scalar, fvPatchField, volMesh>& psi = 
const_cast<GeometricField<scalar, fvPatchField, volMesh>&>(psi_); 


scalarField saveDiag(diag()); 
addBoundaryDiag(diag(), 0); 
scalarField totalSource(source_); 
addBoundarySource(totalSource, false); 


ADmode::global_tape->switch_to_passive() ; 


auto* D = ADmode::global_tape->create_callback_object<ADmode:: 
external_adjoint_object_t>(); 


int nu = 0; if(this->hasUpper()) nu this->upper().size(); 
int nd this->diag().size(); 


O; if (this->hasLower()) nl = this->lower().size(); 


cba ty oa mes ale E 


// register rhs as adjoint inputs 
forAll(totalSource, i) 
D->register_input (totalSourcelil]); 


// register matrix coefficients as adjoint inputs 
if (this->hasUpper () ) 
for€int i= 0; i < nu; i++) 
D->register_input (this->upper()Li]); 


if (this ->hasLower (~)) 
for(int i = 0;.8 <@ml; i++) 
D->register_input (this->lower() li]); 


for(int i= 0; i < nd; itt) 
D->register_input (this->diag()Li]); 


// register boundary coefficients if on parallel boundary 
forAll (psi. boundaryField() ,i) 
if (psi.boundaryField().types() [i] == "processor") 
forAll (boundaryCoeffs_[i],j) 
D->register_input (boundaryCoeffs_[i]lj]); 


solverPerformance solverPerf = lduMatrix::solver::New 


( 


psi.name(), 
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A8 *this, 

49 boundaryCoeffs_, 

50 internalCoeffs_, 

51 psi_.boundaryField().scalarInterfaces(), 

52 solverControls 

53 )->solve(psi.primitiveFieldRef(), totalSource) ; 

54 

55 D->write_data(psi.name()); 

56 D->write_data(Foam::direction(-1)); // dummy direction 


57 D->write_data(*this); 
58 D->write_data(psi) ; 


59 

60 forAll(psi.primitiveField() ,i) 

61 psi.primitiveFieldRef() Li] = D->register_output (dco::passive_value((psi. 
primitiveFieldRef()[il))); 

62 


63 ADmode:: global_tape->insert_callback<ADmode::external_adjoint_object_t >( 
symbolic::fil11SolverGap<Foam::scalar>,D) ; 
64 ADmode::global_tape->switch_to_active() ; 


65 

66 diag() = saveDiag; 

67 psi.correctBoundaryConditions () ; 
68 } 


Listing B.1: fvMatrix solve with creation of solver gap and checkpoints of the necessary data. 
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1} template<class Type> 
2 void fillSolverGap(typename Foam::ADmode::external_adjoint_object_t *D)f{ 
const Foam::word& fieldName = D->read_data<Foam::word>(); 

const Foam::direction& cmpt = D->read_data<Foam::direction>() ; 
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const Foam::fvMatrix<Type>& A = D->read_data<Foam::fvMatrix<Type> >(); 


const Foam::volScalarField& x_ref 


Foam::scalarField ai_x(x); 
forall Calox. i) 


ai_x[i] = D->get_output_adjoint (); 


Foam::fvMatrix<Type> A_T(A); // will hold transpose 
Foam::label nu = 0; if(A.hasUpper()) nu = A.upper () 
Foam::label nl = 0; if (A.hasLower()) nl = A.lower() 


Foam::label nd = A.diag().size(); 


// component will return this for 

Foam::FieldField<Foam::Field,Foam: 
component (cmpt) () ; 

Foam::FieldField<Foam::Field,Foam: 
component (cmpt) () ; 


// transpose matrix if necessary 
bool sym = A.symmetric(); 
if (!sym){ 
A_T.lower() = A.upper(); 
A_T.upper() = A.lower(); 


scalar field 
;>scalar> bempts % 


;>scalar> g@empts = 


D->read_data<Foam::volScalarField>(); 
const Foam::scalarField& x = x_ref.primitiveField(); 


// read incoming adjoints from tape 


of A 


. Sez () ; 
Bs ize () 4 


A_T.boundaryCoeffs(). 


A_T.internalCoeffs(). 


// switch boundary and internal coeffs for transposed 
icmpts = A_T.boundaryCoeffs().component (cmpt) (); 
bempts = A_T.internalCoeffs().component (cmpt) (); 


} 
Foam::word reverseFieldName = fieldName + Foam::word("Reverse") ; 
Foam::volScalarField ai_b(reverseFieldName ,x_ref); 
const dictionary& reverseSolverControls = al_b.mesh().solverDict(L...]); 
Foam::lduMatrix::solver solver = Foam::lduMatrix::solver:: New 
( 

fieldNameCmpt + Foam::word("Reverse"), 

A_T, 

bempts, 

icmpts, 


ai_b.boundaryField().scalarInterfaces(), 


reverseSolverControls 


ye: 


// solve Waor bi 
Foam::solverPerformance solverPerf 
aul a) : 


ae. ey / Bontinued in next listing 


= solver->solve(ai_b.primitiveFieldRef (), 


Listing B.2: Adjoint callback routine, solution of the adjoint system. 
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1 \-ele//7 continuation “Of previous listing 

2 // increment input adjoint for b 

3 for(int i = 0: 1 < all-b.size(): i++) 

4 D->increment_input_adjoint (dco::passive_value(ai_bli])); 
5 

6 // Adressing for upper and lower half of matrix 

7 const Foam::label* uPtr = A_T.lduAddr().upperAddr().begin() ; 
8 const Foam::label* 1Ptr = A_T.1lduAddr().lowerAddr().begin() ; 
9 

10 // increment input adjoint for A (upper part) 

11 double tmp = QO; 

1 if (A.hasUpper()){ 

13 for(int i = 0; i < nu; itt+)f{ 

14 tmp = dco::passive_value(-ai_b[1Ptr[iJ]*x[uPtr[i]]); 
15 if (1Ptrli] != uPtr[i] && sym) 

16 tmp += dco::passive_value(-ai_b[uPtr[li]]*x[1Ptr[li]]); 
ih D->increment_input_adjoint (tmp) ; 

18 } 

i9 = Ss 

20 // increment input adjoint for A (lower part) 

21 if (A.hasLower()){ 

22 for€int i = 0; i < nl; i++)f{ 

23 tmp = dco::passive_value(-ail_bluPtr[i]]*x[1Ptr[il]); 
24 if (1Ptrli] != uPtr[i] && sym) 

25 tmp += dco::passive_value(-ai_b[1Ptr[li]]*x[uPtr[li]]); 
26 D->increment_input_adjoint (tmp) ; 

og Ir 

23, 

29 // increment input adjoint for A (diag part) 

30 for€int i = 0; i < nd; itt) 

31 D->increment_input_adjoint (-ai_b[li]*xli]); 

32 

33 Foam::volScalarField xSF(x_ref); 

34 forAll(ai_b.boundaryField() ,i)f 

35 if(al_b.boundaryField().types() Li] == "processor"){ 

36 ai_b.boundaryFieldRef () Li].initEvaluate() ; 

37 ai_b.boundaryFieldRef () Li].evaluate() ; 

38 xSF.boundaryFieldRef () [i].initEvaluate () ; 

39 xSF.boundaryFieldRef () [i].evaluate() ; 

40 

41 forAl1(A_T.boundaryCoeffs()lil,j)f 

42 Foam::Field<Foam::scalar> x_other = 

43 xSF.boundaryField() [i].patchNeighbourField() (); 
44 Foam::Field<Foam::scalar> al_b_this = 

45 ai_b.boundaryField() [i].patchInternalField() (); 
46 tmp = dco::passive_value(x_other[j]*ai_b_this[j]); 
AT D->increment_input_adjoint (tmp) ; 

48 i 

49 i 

50| + 

51| } 


Listing B.3: Continuation of adjoint callback routine, incrementation of input adjoints. 
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C Reference Solver and Case 


C.1 Reverse Accumulation for Incompressible Steady Flows 


(ae eee 


#include 
#include 
#include 
#include 
#include 


#include 
#include 
#include 


GPLv3 License Header 


PE wer De. 
"“singlePhaseTransportModel.H" 
"turbulent TIransportModel .H” 
Peamplecontrol. i! 
PEvapetous i! 


"CheckInterface.H" 
mCweckDarc ti 
VSOsOPUunceLoOnldDrary i. 


13, int main(int argc, char x*argv[]) 


14| { 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
a7 
38 


#include 
#include 
#include 


"setRootCase.H" 
"createTime.H" 
"createMesh.H" 


simpleControl simple(mesh) ; 


#include 
#include 
#include 


#include 


 enrcoarer we sh 
"createFvOptions.H" 
PinitContanuitybrrs., Te 


"adjointSettings @h" 


CheckInterface check(runTime) ; 
CheckDict checkDict (&runTime) ; 


CheckDatabase checkDB(&runTime ,&checkDict) ; 


turbulence ->validate(); 


dco::gais<double>::global_tape 


auto zero_to = global_tape->get_position() ; 
auto interpret_to = global_tape->get_position(); 


bool frozenTurbulence = false; 
scalar adjointEps = ie-6; 


while (simple.loop()){ // iterate passive until primal convergence 
Pressure-velocity SIMPLE corrector 


“/ <- 


#include "UEqn.H" 
#include "pEqn.H" 


dco; :gals<double>:: tapest:.: create ()'; 
autok& global_tape = dco::gais<double>::global_tape; 


global_tape->switch_to_passive(); // iterate passive until 


primal convergence 
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C’ Reference Solver and Case 


46 laminarTransport.correct() ; 

AT turbulence ->correct(); 

48 

49 if (simple.criteriaSatisfied()) 
50 break; 

51 } 

52 

53 // run and tape one active step 


54 global_tape ->switch_to_active() ; 

55 global_tape ->register_variable(alpha.begin() ,alpha.end()); 
56 

57 zero_to = global_tape->get_position(); // don't zero alphas 
58 checkDB.registerAdjoints(); // register state 

59 interpret_to = global_tape->get_position() ; 

60 

61 #include "UEqn.H" 

62 #include "pEqn.H" 


63 laminarTransport.correct() ; 

64 if (frozenTurbulence) 

65 global_tape->switch_to_passive() ; 

66 turbulence ->correct(); 

67 global_tape ->switch_to_active(); 

68 

69 J = CostFunction(mesh).eval(); // eval cost, seed later 
70 checkDB.registerAsOutput () ; 

ral global_tape->switch_to_passive(); // no more recording needed 
7 

73 iM 2b = Oe 


74. scalar eps = Foam::GREAT; 
75 while(eps > adjointEps ) 


76| 

ei if(iter == 0 && Pstream::master ()) 

78 dco::derivative(J) = 1; 

79 ete 

80 checkDB.restoreAdjoints() ; 

81 

82 global_tape ->interpret_adjoint_to(interpret_to) ; 

83 checkDB.storeAdjoints () ; 

84 

85 static scalar firstEps = checkDB.calcNormO0fStoredAdjoints(); 
86 eps = checkDB.calcNorm0fStoredAdjoints() / firstEps; // normalize eps 
87 global_tape ->zero_adjoints_to(zero_to) ; 

88 aay ee tt 

so} 

90 forAll(alpha,i) // extract sensitivities after reverse acc converged 
91 sens[i] = dco::derivative(alphali]) / mesh.V() (il; 

92 runTime.write(); 

93 dco::gais<double>::tape_t::remove(global_tape) ; 

94 return 0; 

95 } 

96 
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Listing C.1: Source of reverseAccSimpleFoam 
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C.2 Example Case Configuration 


C.2 Example Case Configuration 


1 dimensions [0 1-100 0 0]; 

2 internalField uniform (0 O 0); 

3 

4 boundaryField{ 

5 inlet{ 

6 type fixedValue; 

7 value uniform (100 O 0); 
8 } 

9 walls{ 

10 type fixedValue; 

1 value uniform (0 O 0); 
12 } 

iS outlet{ 

14 type zeroGradient ; 

15 } 

16 defaultFaces{ 

17 type empty ; 

18 } 

19 } 


Listing C.2: Velocity boundary conditions 0/U 


1 dimensions [O22 0) 0 001 
2 internalField uniform 0; 

3 

4 boundaryField{ 

5 inlet{ 

6 type zeroGradient ; 
7 } 

8 walls{ 

9 type zeroGradient ; 
10 } 

11 outlet{ 

12 type fixedValue; 
13 value uniform 0; 

14 } 

15 defaultFaces{ 

16 type empty ; 

17 } 

18, } 


Listing C.3: Pressure boundary conditions 0/p 
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C’ Reference Solver and Case 


transportModel Newtonian; 
nu nu [0 2 -1 000 0] 1; 


Listing C.4: Viscosity and transport model constant/transportProperties 


SimulationType laminar ; 


RAS 

{ 
RASModel kEpsilon; 
turbulence Oust 
printCoeffs on; 

t 


Listing C.5: Turbulence model constant/turbulenceProperties 


checkpointSettings{ 
checkpointingMethod revolve; // equidistant, revolve or none 
nCheckpoints 10; 
nTapesteps ills 

t 


checkpointRequired{ 
U; 
Pp; 
phi: 

} 


Listing C.6: General settings system/checkpointingDict 
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C.2 Example Case Configuration 


/*-------------------------------- *- C++ -*---------------------------------- *\ 
| SSSSSS5=5 | | 
NS / F ield | OpenFOAM: The Open Source CFD Toolbox | 
| \\ i O peration | Version: plus | 
| Nee, A nd | Web: www.OQOpenFOAM.com | 
| wy M anipulation | | 
A ee ee ee ee a ee ee ee ee * / 


FoamFile{ [...] } 


application 
startFrom 


SimpleFoam; 
startTime; 


startTime OF 
stopAt endTime; 
endTime Bor 
deltaT flee 
writeControl timestep; 
writeInterval 100; 
purgeWrite OF 
writeFormat ascil; 
writePrecision 6; 
writeCompression off; 
timeFormat general; 
timePrecision Ge 
runTimeModifiable true; 

Listing C.7: system/controlDict 
/*-------------------------------- *- C++ -*---------------------------------- *\ 
| sssss=s== | | 
FOS / F ield | OpenFOAM: The Open Source CFD Toolbox | 
b ONS /, O peration | Vegrstion : 3.0.x | 
| Ae A A nd | Web: www.OQOpenFOAM.org | 
| aw, M anipulation | | 
Wh SS 9 5 5 9 5 5 5 8 6 5 5 5 59 5 5 5 5 5 5 5 9 5 5 5 5 5 5 5 9959 5 5 5 5 Se * / 
oevmeadber la eoll 
ddtSchemes{ default steadyState; } 
gradSchemes{ default Gauss linear; } 
divSchemes{ 

default none; 


div (phi ,U) 


t 


bounded Gauss upwind; 
div ((nuEff*dev2(T(grad(U))))) Gauss linear; 


laplacianSchemes{ default Gauss linear corrected; } 


interpolationSchemes{ default linear; 


snGradSchemes{ default corrected; } 
wallDist{ method meshWave; } 


t 


Listing C.8: system/fvSchemes 
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C’ Reference Solver and Case 


/*-------------------------------- *- C++ -*---------------------------------- * \ 
| SSSSSS==5 | | 
ENN / F ield | OpenFOAM: The Open Source CFD Toolbox | 
| \\ / O peration | Version: plus | 
| Wy A nd | Web: www.OQOpenFOAM.com | 
| N/ M anipulation | | 
WW SS 9 5 5 5 5 5 5 5 5 9 5 5 5 55 5 5 5 559 8 5 5 555 99 5 55 5 5 9 5 5 5 5 Se * / 


FoamFile{ [...] } 
// * * * © * K * * * K K HK K KK K KK  K KK K K KK He KK K KF K K WE ke ke ORY // 


SDLS yes; // global flag to enable or disable SDLS 


solvers 
i 
"C.*)" // catch all equations 
{ 
solver smoothSolver ; 
smoother symGaussseidel ; 
tolerance 1le-05; 
relTol OF 
SDLS SH SDHL IS) & 
i; 


"(plpReverse)" // specialize for pressure correction equation 


{ 


solver GAMG; 
tolerance 1e-06; 
relTol 0; 
smoother DANG 2 
SDLS $SDLS ; 
t 
t 
SIMPLE 
{ 
nNonOrthogonalCorrectors 0; 
consistent yes; 
costFunctionPatches (inet % 
costFunction "“pressureLoss"; 
adjointEps te-5; // treshold for reverse acc / piggyback 
t 
relaxationFactors 
{ 
equations 
{ 
U 0.9; // 0.9 is more stable but 0.95 more convergent 
Wael 0.9; // 0.9 is more stable but 0.95 more convergent 
t 
t 


Va) 2K OK OK OK OK OK KOK OK KK KK KK OK OK OK OK OOK OOK OK OK OK OK OK OK KK KK KOK OK OK OK KK OK OK OOK OK OK OK OK OK OK OK OK KK OK OK OK KOK OK OK OK OK OOK OK OK OK OK OK OK OK OK OK OK OK OK V7, 


Listing C.9: system/fvSolution 
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22 


23 


24 


25 


27 
28 


[Peo Seeese se ese eee Se ee eee eee eee x= 

| ssssscse= | 

os / F ield | OpenFOAM: 

| \\ ii O peration | Version: 1.7.1 

| NS, A nd | Web: 

| Wy M anipulation | 

ee ne ee ee a ee ee a 

FoamFile{ [...] } 

J] FR ee kk ee 

convertToMeters 1; 

vertices ( 
(O 0 0) (2 0 0) (4 0 0) (5 0 0) (0 1 0) (2 1 
Ce ah CD) (C2 OD) eC SO ES 1) GE Ss 0) a & 
CO 0 Sel) G2 0 al 4 Ol) CS Oe) CO le) C2 al 
(Cy al) Cy i Ty Cs eI el) Cs 

); 

m4_define(lvl ,1) 

blocks ( 
hex (0 15 4 13 14 18 17) ( m4_eval (lv1*2) 


hex 
hex 
hex 
hex 
hex 


ie 


simpleGrading (1 1 1) 
simpleGrading (1 1 1) 
simpleGrading (1 1 1) 
simpleGrading (1 1 1) 
simpleGrading (1 1 1) 


simpleGrading (1 1 1) 


(1265 14 15 19 18) 


(2555 PO lo 1620 619) 


CSG 59e cals 19N2252 1) 


C6? 10 Some 2 0523522) 


(9 10 12 11 22 23 25 24) ( 


patches ( 
patch inlet ( 


( 
) 
wal 

( 

( 

( 
) 


patch outlet ( 


0 


1 
0 
3 
5 


4 17 13) 


walls ( 

i 14313) 
7 20 16) 
sy ON alts) 


(1 2 15 14) 
(7 10 23 20) 
(S@i9 2X 21) 


(11 12 @5 24) 


) 
Ee 


(2 
(10 
C8 


m4_eval(lv1*2) 


m4_eval(lvl*1) 


m4 eval (lvl*2) 


m4_eval(lvl*1) 


m4 _eval(lvl*1) 


0) 
0) 
1) 
1) 


C.2 Example Case Configuration 


The Open Source CFD Toolbox 


www.OpenFOAM.com 


(4 1 0) 


(4 1%) 


m4 eval (lvl*1) 


m4_eval(lvl*1) 


m4 _eval(lvl*1) 


m4 eval (lvl1l*2) 


m4_eval(lv1l*2) 


m4_eval(lv1l*2) 


3 16 15) (4 5 18 17) 


127525323) 
11 24 22) 


x ok ke ok ok ek &k Ae ek k ok Ok Me ke ok // 


Vile a RE Wa ee ek See ee ee ae eae Sole ke ee cae ae 


Listing C.10: system/blockMeshDict 
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CoO vrnrn a BP Ww NY F&F 


LD Demonstator AD Tool 


H#Hinclude <iostream> 
#include <vector> 
#include <cmath> 


typedef std::vector<double> adjoint_vector; 


struct partial_edgef 
partial_edge(double v=0, int i=-1) : partial_val(v),target_idx(i){} 
double partial_val; 
int target_idx; 


tee 


struct adjoint_stack : std::vector<std::vector<partial_edge>>f{ 
int longest_edge; 
int n_params; 

Le 


adjoint_stack s; 


struct adjoint_typef 
double val; 
int tape_idx; 
adjoint_type(const double& x): val(x), tape_idx(-1) {} 


void operator=(const adjoint_type& x)f{ 


thas=> val = x yal 
std::vector<partial_edge> p(1,partial_edge(1.0,x.tape_idx)); 
this->tape_idx = s.size(); 
s.emplace_back(1,partial_edge(1.0,x.tape_idx)); 
} 
}; 


void register_variable(adjoint_type& adt)f{ 
S.l_pabams ++ ; 
s.emplace_back(); // push empty entry to tape 
adt.tape_idx = s.size() -1; 


t 


Listing D.1: Data structures for tape and partial edges; Interface. 


adjoint_type operator*(const adjoint_type& xi,const adjoint_type& x2){ 
adjoint_type y(x1.val*x2.val); 
std::vector<partial_edge> partials; 
if(x1.tape_idx >= 0) partials.emplace_back(x2.val,x1.tape_idx) ; 
if (x2.tape_idx >= 0) partials.emplace_back(x1.val,x2.tape_idx) ; 
y.tape_Wax =™s.size() ; 
s.push_back(partials) ; 
returayy,; 
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| 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 


oO wana a FP Ww DY 


WwW wwwnynn nd WN NY NY) NY NY NY NY HP RP RP RP RP RP RP RF RS 
WN F OO DAN DoH FPF WwW nn FPF ODO HOT WON DT FP Ww NY FF OO 


t 


adjoint_type operator+(const adjoint_type& xi,const adjoint_type& x2){ 


adjoint_type y(x1.val+x2.val); 

std::vector<partial_edge> partials; 

if (x1.tape_idx >= 0) partials.emplace_back(1.0,x1.tape_idx) ; 
if (x2.tape_idx >= 0) partials.emplace_back(1.0,x2.tape_idx) ; 
y.tape_idx = s.size(); 

s.push_back(partials) ; 

ieee bieag es 


t 


adjoint_type operator/(const adjoint_type& xi,const adjoint_type& x2){ 


adjoint_type y(x1.val/x2.val); 

std::vector<partial_edge> partials; 

if (x1.tape_idx >= 0) partials.emplace_back(1.0/x2.val,x1.tape_idx) ; 

if (x2.tape_idx >= 0) 
partials.emplace_back(-x1.val/(x2.val*x2.val) ,x2.tape_idx) ; 

y.tape_idx = s.size(); 

s.push_back(partials) ; 

EEuurm yy; 


Listing D.2: Operators 


int find_longest_edge(const adjoint_stack& s){ 
int max_length = 0; 
fom (int a = S.size ()-1. 1>=0-1-—-) 
for(const partial_edge& p : sLil]) 
if (p.target_idx >=s.n_params) 
max_length = std::max(max_length ,i-p.target_idx) ; 
return max_length; 


t 


void interpret_tape(const adjoint_stack& s, adjoint_vector& v)f{ 
for(int i = s.size()-1; i>=0;i--) 
for(const partial_edge& p: sLil]) 
vlp.target_idx] += p.partial_val * vl[il]; 
} 


void interpret_tape_mod(const adjoint_stack& s, adjoint_vector& v)f{ 
const int nC = v.s@ime() ; 


const int nP = s.m_p@rams ; 

const int 1 = s.longest_edge; 

for(int i = 0; @i< @&. size () -nP);i++){ 
const int ti =&s.Saze()-1-i; // position in tape 
const ime j = n@i-(i 4% (1+1)); // position in mod adjoint vector 
std::vector<partial_edge> partial_edges = s[til; 


for(partial_edge& p : partial_edges){ 
if(p.target_idx >= 0O)f{ 

me (emtarcet_idx < nP){ 
vi[p.target_idx] += p.partial_val * vIjl; 

selse{ 
@Ppnuskt int tot — ((j-ne)-(ti-p,targvetsidx)+cltl))y (lel) nr’, 
v[tgt] += p.partial_val * vIjl]; 

} 
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34 } 

35 v[j] = 0; // reset adjoint after all tape edges adjoined 
36 $ 

37| } 


Listing D.3: (Compressed) Tape interpretation 


1,int main(){ 

2 adjoint_type a = 2; 

3 register_variable(a) ; 

4 adjoint_type x = a; 

A) ee ing a, OR aL Si 9 al aeee al 
6 x = 0.5*(a/xtx) ; 

4 

8 

9 


i 
std::cout << “f(x) = " << x.val << Std: endl. 
10 { // interpret compressed 
11 s.longest_edge = find_longest_edge(s) ; 
12 adjoint_vector av(s.n_params + s.longest_edget1l) ; 
13 avlavecize)=1))s= 94100. 9// seed 
14 interpret_tape_mod(s,av); 
15 std::cout << "df/da = " << avla.tape_idx] << std::endl; 
16 = Sg 
7 17/7 anterpret full 
18 adjoint_vector av(s.size()); 
19 av[lav.eize()=1]) = 1.0; // seed 
20 interpret_tape(s,av); 
21 std::cout << "df/da = " << avla.tape_idx]W&< std::endl; 
22 } 
23 return 70; 
24) } 


25 |e, weenie ona 

26|// £(x) = 1.41422 
27 | 7 wt dial = 307 350566 
28/7 dh/ da = 0. 353566 


Listing D.4: Driver calculating derivative of approximation to /x using full and compressed 
adjoint vector representations. 
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Abstract: Open-wheeled race car aerodynamics is unquestionably challenging insofar as it involves many physical phenomena, such 


as slender and blunt body aerodynamics, ground effect, vortex management and interaction between different sophisticated aero 


devices. In the current work, a 2017 Fl car aerodynamics has been investigated from a numerical point of view by using an 


open-source code. The vehicle project was developed by PERRINN (Copyright©2011—Present PERRINN), an engineering 


community founded by Nicolas Perrin in 2011. The racing car performance is quantitatively evaluated in terms of drag, downforce, 


efficiency and front balance. The goals of the present CFD (computational fluid dynamics)-based research are the following: 


analyzing the capabilities of the open-source software OpenFOAM in dealing with complex meshes and external aerodynamics 


calculation, and developing a reliable workflow from CAD (computer aided design) model to the post-processing of the results, in 


order to meet production demands. 


Key words: External aerodynamics, open-source CFD, 2017 F1 car, drag, downforce, efficiency, front balance. 


1. Introduction 


Nowadays CFD (computational fluid dynamics) 


and Motorsport are closely connected and 


interdependent: on the one hand, aerodynamic 
simulations are crucial for designing and developing 
increasingly fast vehicles; on the other hand, the 
extreme research of performance in motor racing 1s 
the catalyst behind the development of sophisticated 
and reliable numerical procedures and innovative 
CAE (computer aided engineering) tools. 

The impact of CFD on motorsport has grown up in 
tandem with computer hardware advances: looking at 
the Formula 1 experience during the period from 1990 
to 2010, simulations evolved from the inviscid panel 
method to one billion cell calculations of entire cars, 
including analysis of transient behaviour and 
overtaking [1]. As witnessed by the Formula | team 
Sauber Petronas, the CFD technology is applied in 
many stages of the vehicle development: early concept 


phase, system design (engine and brake cooling, brake 
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systems), single component design and complete 
Although 


open-wheeled racing car aerodynamics 1s basically an 


system design and interactions [2]. 
unusual field of research, there are a few publications 
from academic world as well as private industries: as 
an example of partnership between university and 
motorsport teams, Zhang, Toet and Zerihan [3] 
reviewed the progress made during the last 30 years 
on ground effect aerodynamics. 

From the point of view of the required 
computational resources, the complexity of the 
geometry and the resulting numerical issues, the 
simulation of realistic open-wheeled cars is really 
challenging: for this reason, Fl teams and researchers 
often rely on commercial software that provide 
user-friendliness, flexibility and reliability: the more 
you spend time on pre-processing and debugging, the 
lesser you can focus on design and_ physics 
comprehension. Examples of this type of study can be 
found in Refs. [4] and [5]: ANSYS software package 
is used to investigate the impact of 2009 FIA technical 
regulations on the aerodynamic performance of F1 


Cars. 
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In addition to CFD commercial solutions, there are 
open-source codes able to execute both the meshing 
phase and the fluid dynamic calculation: one of the 
most popular is OpenFOAM®. This free-license tool 
is successfully used and developed by academic 
researchers [6] and automotive industries [7] in order 
to predict the aerodynamic performance of road cars; 
however, due to some criticalities connected to 
meshing accuracy and numerical stability, it is not 
widespread in high level motorsport applications. In 
the current study, the highly complex aerodynamics of 
a 2017 Fl car has been numerically investigated by 
OpenFOAM®. Prediction reliability has been tested 
against reference data of drag, downforce, efficiency 
and front balance, provided by PERRINN. 


2. CFD Workflow: Software and Hardware 
Tools 


The first step of this study consists of developing a 
standard and reliable CFD workflow (from meshing to 
calculation) for external aerodynamic analysis of very 
of the 
open-source software OpenFOAM. Many degrees of 


complex geometries, through the use 
freedom are available in the case setup: the user can 
decide time and space discretization schemes of the 
Navier-Stokes equation terms as well as the solver for 
each variable [8]. 

The volume mesh is performed by SnappyHexMesh, 
an OpenFOAM utility providing a non-graphical, fast 
and flexible procedure for every kind of geometry, 
especially in external aerodynamics applications. This 
implies a huge time and resource saving in 
comparison with a traditional meshing software 
without a batch mesh utility. On the other hand, it is 
less accurate than some commercial meshing software, 
for instance in adding layers and tracing the edges of 
complex surfaces [8]. 

Both the meshing and the calculations were carried 
out using Galileo, the Italian Tier-1 cluster for 
industrial and public research, available at CINECA 


SCAI (supercomputing applications and innovations). 


The meshing processes were executed by means of 6 
computational nodes, each of which is composed of 
16 cores (8 GB/core); the calculations were instead 
performed using 14 nodes. About 1,500 iterations 
were required to get convergence on the basis of 


residuals lower than 10%. 


3. Pre-processing and Numerical Setup 
3.1 Geometry 


The input file of the geometry must be in 
STereoLithography format (stl). Many commercial 
CAD softwares are able to convert the original model 
in this format, but it is preferable to use only those 
providing a detailed control on the output file, since 
the quality of the stl model is directly connected to the 
quality of the volume mesh and the accuracy of the 
final fluid dynamic results. 

A final check of the .stl file 1s recommended in 
order to control orientation, closure of the surfaces, 
quality of triangles and edges: Netfabb Basic, a free 
software, was used in the current study. 

The stl file of the Fl car, obtained from the original 
project by PERRINN (Fig. 1), contains a lot of 
interesting features and _ challenges from _ the 
perspective of meshing. The full-scale Fl car model, 
whose wheelbase (WB) is 3.475 m long, presents 
many small realistic details such as winglets, fences, 
vortex generators and slots: the smallest elements are 
1.5 mm thick. 

Proximity problems can be found among. the 
suspension arms, the front wing flaps and between the 
underbody and the ground: with the baseline setup 
(front ride height = 20 mm; rear ride height = 50 mm), 
the minimum distance between the plank and the 
ground is 13 mm. A contact patch between the tires 
and the ground, established by the front and rear ride 
height of the vehicle, needs to be defined in order to 
avoid problems of cell skewness. 

Before starting the meshing phase’ with 
SnappyHexMesh, the car model is divided into 


components, so as to analyse separately the behaviour 
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Fig. 1 Rendering of the 2017 F1 car by PERRINN (image from gpupdate.net). 


FRONT WING 


REAR CRASH 
FRONT BODYWORK 





Fig. 2 Geometry in stl format: (a) top view; (b) bottom view. 


of each part of the car body (Fig. 2). 
3.2 Mesh and Simulation Setup 


The domain length is about 18 times the WB of the 
vehicle: the distance between the inlet and the front 
axle is about 4.6 times the WB, while the outlet of the 
virtual tunnel, where the atmospheric pressure is 
imposed, is located well downstream of the car, 1.e. 
13.8 times the WB (Fig. 3). Since the simulation 1s 
steady and the vehicle is perfectly symmetrical, only 
half car is taken into account: the distance between the 
longitudinal symmetry plane and the sidewall is about 
16 times the half-width of the car. The height of the 
domain is 16 times the height (4) of the vehicle. Slip 


condition is imposed on the side wall and the ceiling 


of the wind tunnel, while the ground is moving at the 
same speed imposed at the inlet, for the purpose of 
comparison with the reference calculation made by 
PERRINN. Angular velocity and rotational axis of the 
wheels need to be defined. 

The main features of the mesh are as follows: the 
height of the first cell at all solid surfaces is 0.6 mm 
and the layer expansion ratio is 1.2. The resulting 
average value of y is about 40: this number obliges to 
use wall functions, as is currently done in industrial 
applications. 

Due to the complexity of the geometry and the 
related physical phenomena, many refinement boxes 
need to be defined. Special attention must be given to 


the huge wake region and the parts responsible for 
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downforce: the multi-component ground effect front 
wing, the rear wing composed of a high-cambered 
main plane and a high angle-of-attack flap, and finally 
the underbody, where the flow reaches its highest 
velocity. 

As suggested by preliminary study [8], two 
turbulence models were taken into account: the 
kOmegaSST (k@SST) and the SpalartAllmaras (S-A). 
Physical data needed to define the numerical setup 
of both 


simulations are summarized in Table 1. The car WB, 


and initialize the turbulent variables 
representing the size of the largest eddy, was chosen 
as turbulent length scale. The incompressible RANS 
simulations were performed by the coupled version of 
the simpleFoam algorithm, which is faster and more 
stable than the segregated one, at the cost of more 
computational resources. The GAMG (geometric 
algebraic multi grid) solver was used for the pressure 
equation, whilst smoothSolver was applied for 
velocity and turbulent variables. The entire calculation 
was executed with 2nd order discretization schemes. 
Convergence was considered to be reached whenever 
the scaled pressure and velocity residuals were lower 
than 10° and the aerodynamic coefficients remained 
stable (+ 1% in the last 500 iterations). 

Three different meshes were tested (140 mln, 120 
mln, 90 mln cells): since the results in terms of global 
performance did not change significantly, the coarsest 


one was chosen for the research. 


iniaeeaeee 
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4. Results and Discussion 


The comparison between numerical predictions and 
reference data from PERRINN database deals with 
drag (SCx), downforce (SCz), aerodynamic efficiency 
(Cz/Cx) and front balance (/'B), where S' is the frontal 
area of the car and FB is the ratio between the 
downforce on the front axle and the total downforce. 

The k@SST turbulence model predicted a premature 
separation of the flow on the suction side of the wings 
and along the diffuser: as a result, both downforce and 
drag coefficients were underestimated respectively by 
20% and 11%. 

On the opposite, S-A showed a better behaviour for 
boundary layer in adverse pressure gradient [8]: as 
summarized in Table 2, the results of the coupled 
RANS simulation with S-A model are consistent with 
the reference data. The percentage errors in prediction 
of drag and downforce are respectively 6% and 7%, 
whilst the front balance coefficient differs by 10% 
from reference datum. After proper validation, Table 3 
summarizes the contribution of the main vehicle 
components to the vertical load (SCz). The bottom of 
the car, composed of the underbody and the plank, 
generates more or less the 58% of the overall 
downforce, whilst the front and the rear wing provide 
respectively the 26.3% and the 27.5% of the total 
contribution. Also the front bodywork has a beneficial 


effect in terms of downforce; on the contrary, sidepod 





Fig.3 Volume mesh: (a) symmetry plane; (b) details of the refinement boxes around the car. 


Table 1 Physical conditions of the simulation. 


Variable 

Freestream velocity (u) 

Air density (p) 

Turbulent intensity (/) 
Wheelbase of the vehicle (WB) 
Reynolds Number (Reyz) 


Value 

50 m/s 
1.225 kg/m* 
0.15% 
3.475 m 
12x10° 
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Table 2 Comparison between numerical results and reference data (S-A model). 


SCx [m7] SCz [m°*] 
Reference data 1.23 -3.59 
S-A results 1.16 -3.35 
Error % a) 6.7 


Cz/Cx FB 
2.92 0.448 
2.89 0.403 
1.0 10.0 


Table 3. Contribution to downforce of the main components of the car. 


Component SCz [m7] 
Cockpit +0.04 
Driver +0.02 
Front bodywork -0.10 
Front suspension +0.06 
Front wing -0.88 
Plank -0.45 
Rear suspension -0.01 
Rear wing -0.92 
Sidepod +0.2 
Underbody -1.49 
Upper bodywork +0.18 
Full car -3.35 


Contribution (%) 
Tie 
+0.6 
-3.0 
+1.8 
-26.3 
-13.4 
-0.3 
-27.5 
+6.0 
-44,5 
+5.4 
~-100 


Table 4 Contribution to drag of the main components of the car. 


Component SCx [m7] 
Front tire +0.115 
Rear tire +0.226 
Front bodywork +0.055 
Front suspension +0.012 
Front wing +0.159 
Rear suspension +0.024 
Rear wing +0.235 
Sidepod +0.035 
Underbody +0.181 
Upper bodywork +0.021 
Bargeboard +0.047 
Other parts +0.05 
Full car +1.16 


and upper bodywork generate undesirable lift because 
they deflect the flow downwards. 

Concerning SCx, one can see in Table 4 that wheels 
are responsible for approximately 30% of total drag. 

The underbody is the most efficient aerodynamic 
device, because it makes extensive use of ground 
effect and Venturi effect to generate downforce, in 
contrast to rear wing. Despite the complexity of the 
suspension geometry, its contribution to drag is only 
3%, owing to the fact that arm sections are 


streamlined like a wing profile. 


Contribution (%) 
+9.9 
+19.5 
+4.7 
+1.0 
+13.7 
+2.1 
+20.3 
+3.0 
+15.6 
+1.8 
+4.1 
+4.3 
~-100 


Fig. 4 illustrates the pressure coefficient (Cp) on the 
surface of the car. The bottom of the bodywork is 
characterized by typical low-pressure cores which are 
located at the beginning of the plank, where the 
ground clearance is smallest, at the entrance of 
underbody and rear diffuser. In close proximity to the 
rear tire disturbance, the pressure increases and the 
ground effect benefits are lost. The upper view shows 
the contribution to downforce of the front bodywork, 
due to the shape of the nose cone and the stagnation 


area in front of the cockpit. As regards the wings, it 
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can be noted that the rear wing generates downforce 
mainly due to the high camber of the airfoil design; on 
the contrary the front wing makes use of ground effect 
to accelerate the flow on the suction side. 

Both generation of downforce and induced drag are 
strictly connected with the management of axial 
vorticity: an overall view of these three-dimensional 
identified by the 


iso-surface of the scalar QO [1/s”]: this variable, defined 


rotational structures can be 


as the second invariant of the velocity gradient tensor, 


allows detection of the regions where the Euclidian 
norm of the vorticity tensor prevails over that of the 
rate of strain [9]. 

As illustrated in Fig. 5, the front wing generates not 
only downforce but also vortices that bypass the front 
tires. The bargeboard, apart from shielding the 
underbody from the tire wake, generates a pair of 
counter-rotating vortices: the upper one travels down 
the sidepod and acts as an aerodynamic skirt, sealing 


the low-pressure area under the underbody; the lower 
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Fig.4 Pressure coefficient contours: (a) top view; (b) bottom view. 





Fig.5 Iso-contour of Q = 50,000 1/s’: (a) top view; (b) bottom view. 
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vortex energizes the flow at the bottom of the car. 
Looking at the rear part of the vehicle, one can see 
the Venturi vortex developing at the inlet of the 
diffuser due to the difference of pressure between the 
underbody and the region at the side of the diffuser itself. 
Finally, the typical wingtip vortices detach from the 
rear wing, despite the presence of endplates and slots. 
Further details about the vortical structures around 
the single components of the vehicle bodywork can be 
examined by plotting axial vorticity contours. Fig. 6a 
clearly shows the presence of the so called Y250 
vortex: it develops between the neutral middle section 
and remaining profiles of the front wing and governs 
the flow towards the underbody inlet. The outwash 
endplate and the channel underneath the end of the 
wing (called Venturi channel) generate a couple of 
vortices, with negative vorticity, which help the flow 
to bypass the front tire. The interaction between the 
front wing and the front tire represents a really 


complex phenomenon due the unsteadiness of the 


flow and the 


parameters (camber, steering angle and toe angle) on 


influence of the tire alignment 


vortex development. Fig. 6b shows three main vortex 
cores: aS mentioned before, two of them are related to 
the bargeboard; the third one, located between the 
bargeboard itself and the plank, is generated by the 
delta-shaped part of the underbody. As witnessed by 
Fig. 6c, the vortex tubes develop along the entire step 
plane: the low pressure core of these fluid dynamic 
structures contributes to generation of downforce, in 
absence of side skirts that isolate the underbody flow. 

Concerning the rear region of the car (Fig. 6d), one 
can see the two vortices generated by the diffuser 
fences into the side of the main Venturi vortex. 
Looking at the flow underneath the car, each main 
vortex is coupled with a secondary structure, because 
of its interaction with the ground boundary layer. 

For the purpose of concluding the qualitative 
analysis of the flow, streamlines around the main 


components of the vehicle are plotted in Fig. 7. The dual 





Fig. 6 Axial vorticity contours: (a) front wing; (b) bargeboard; (c) middle underbody; (d) diffuser. 
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Fig. 7 Streamlines colored by axial vorticity: (a) front wing,; (b) bargeboard; (c) underbody (bottom view); (d) diffuser; (e) 


underbody (side view). 


task of the front wing is best highlighted by Fig. 7a: 
apart from the primary function of generating 
downforce, it energizes the flow for better feeding the 
downstream aerodynamic devices, such as_ the 
underfloor. 

Fig. 7b puts in evidence the interaction between 
bargeboard and sidepod panel: the first acts like a 
huge vortex generator; the latter maintains the 
energized flow attached to the sides of the vehicle. 
The downforce generated by the underbody is a direct 
result of the vehicle front-end design: in fact, the 


diffuser is partially fed by the flow bypassing the front 


tires (Fig. 7c). As witnessed by Fig. 7d, the entire 
highly 


three-dimensional streamlines strengthened by the 


underfloor 1S characterized by 
diffuser activity: it accelerates the flow at its inlet and 
creates additional downforce by means of strong 
vortices. Fig. 7e gives evidence of the diffuser impact 
on the outflow: the streamlines are deflected upwards, 
giving rise to a huge wake. Besides the flow coming 
from the underfloor, the overall wake consists of 
several contributions, including that of rear wing and 
rear tires: this explains why a modern FI car is not 


able to generate an adequate amount of downforce in 
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slipstream. 


5. Conclusions 


Aerodynamic performance of a Fl 2017 car 
designed by PERRINN_ was 
open-source software OpenFOAM. The meshing phase 


analysed by the 


was particularly tricky because of the sophisticated 
geometry of the vehicle. SnappyHexMesh, the 
dedicated OpenFOAM tool, despite some challenges 
such as the layer addition algorithm, provided a fast 
and automatic meshing procedure. 

In view of the simulation complexity, a coupled 
approach was chosen to avoid numerical instability in 
the very first iterations and reduce the number of 
iterations to reach convergence. The results of the S-A 
RANS incompressible calculation were found to be in 
good agreement with the reference data in terms of 
drag, downforce efficiency and front balance. As a 
general comment, OpenFOAM is a good instrument 
for external aerodynamic investigations, considering 
that it is license-free and it is particularly suitable for 
parallel computing. 

Regarding the Fl car aerodynamics, it can be 
concluded that the flow field involves slender and 
blunt bodies interacting with each other. A F1 car 
generates downforce in different ways: by inverted 
multi-element wings, ground effect of the underbody, 
effect of the diffuser 


management. The contributions of front and rear wing 


Venturi and vorticity 
to downforce are 26.3% and 27.5%, respectively; most 
of the remaining percentage is attributable to the 
underbody. Front and rear wheels are the main source 
of pressure drag, because they generate a huge wake. 
The rear wing is primarily responsible for induced 
drag, as a result of axial vortices detaching from the 
wingtips. The front wing deserves a comment of its 
own: its axial vortices can be used to improve the 


performance of the underbody and consequently 


generate more downforce. 

It appears that ground effect is the most convenient 
way to generate downforce: for this reason underbody 
is more efficient than front wing, and front wing is in 


turn more efficient than rear wing. 
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